Keep only the last 5 rows per object in an audit table

Question

I have a web application backed by a Postgres (v11) database and a main table where each row in the table may be seen as an object and each column is a field of the object.

So we have:

| id | name | field1 | field2| .... | field 100|
-----------------------------------------------
| 1  | foo  | 12.2   | blue  | .... | 13.7     |
| 2  | bar  | 22.1   | green | .... | 78.0     |

The table was created using:

CREATE TABLE records(
  id VARCHAR(50) PRIMARY KEY,
  name VARCHAR(50),
  field1 NUMERIC,
  field2 VARCHAR(355),
  field100 NUMERIC);

Now I have an audit table which stores updates to every field of every object. The audit table is defined as:

| timestamp | objid | fieldname | oldval | newval | 
-----------------------------------------------
| 1234      | 1     | field2    | white  | blue   |
| 1367      | 1     | field1    | "11.5" | "12.2" |
| 1372      | 2     | field1    | "11.9" | "22.1" |
| 1387      | 1     | name      | baz    | foo    |

The table was created using:

CREATE TABLE audit_log(
  timestamp TIMESTAMP,
  objid VARCHAR (50) REFERENCES records(id),
  fieldname VARCHAR (50) NOT NULL,
  oldval VARCHAR(355),
  newval  VARCHAR(355));

oldval/newval are kept as varchar since they are purely for auditing purpose so the actual datatype does not really matter.

For obvious reasons, this table has gotten huge over the last few years or so so I wanted to delete some old data. Someone suggested keeping only the last 5 updates for every object (i.e. the UI can then show the last 5 updates from the audit table).

I understand you can get this using a GROUP BY and a LIMIT but the problem is that I have a million+ objects, with some having been updated a thousand+ times while others have barely a few updates over multiple years. And the audit log is being very read/write heavy (as can be expected).

What would be the best way to delete all entries which are older than the 5th latest update for each object (of course, ideally I'll move that out to some secondary storage)?

Please provide your Postgres version and `CREATE TABLE` statements with only the relevant columns, but showing data types and constraints. — Erwin Brandstetter, Feb 22 '19 at 22:00
Added details requested (I'm guessing you wanted to ask about the audit table datatypes, so added the same). — navinpai, Feb 22 '19 at 22:08
I asked for `CREATE TABLE` statements, which are the *one* true source of information (including data types and constraints). You can see those in pgAdmin III or 4 for instance. — Erwin Brandstetter, Feb 22 '19 at 22:11
No PK in `audit_log`? Is it `objid` or `id` - and I suppose `integer`? — Erwin Brandstetter, Feb 22 '19 at 22:34
@ErwinBrandstetter No there's no PK in `audit_log`. What would it even be? The id is the object id (it's a uuid with a prefix string). `only the last 5 updates for every object` is correct. If the last 5 updates for an object were all on the same field, that is fine. If last 5 were across different fields, that's also fine. Basically trying to look at it as last 5 modifications to the record. — navinpai, Feb 24 '19 at 22:47
I see, 5 rows per object, not per field. PK: in a normalized schema *every* table has a PK. The `ctid` is a poor-man's substitute that works for this purpose (but not certain others) Postgres-specific (not standard SQL) implementation detail. It's always preferable to use the PK to identify rows (and there is an index on it automatically.) `objid` <->`id` - there's a mismatch in the question. — Erwin Brandstetter, Feb 24 '19 at 23:03

Willis Blackburn · Answer 1 · 2019-02-22T22:14:09.830

There a few ingredients to the solution:

The PostgreSQL row_number function. Unfortunately this is a "window function" and can't be used in the where clause.
A common table expression (CTE): "with T as (...some SQL...) ...do something with T..."
The PostgreSQL ctid field, which uniquely identifies a row in a table.

You use the CTE to create a logical table that includes both ctid and row_number. Then you reference it from a delete statement. Something like this:

with t as (
    select ctid, row_number() over (partition by objid)
    from the_audit_table
)
delete from the_audit_table
where ctid in (select ctid from t where row_number > 5)

If you're concerned about the effect of doing this all at once, then just run a lot of smaller transactions on some subset of the objid space. Or (if you're going to wind up deleting 99% of the rows) create a new table, change row_number > 5 to row_number <= 5 and make it into an insert into the new table, then replace the old table with the new one.

Test in QA first! :-)

GMB · Answer 2 · 2019-02-23T00:26:20.190

If you are going to keep just 5 records in groups that might contain thousands, a more efficient approach would to use a temporary table.

First, create a new table on the fly by selecting the records that you want to retain, using the CREATE TABLE AS syntax. Analytical functions make it easy to select the records.

CREATE TABLE audit_log_backup AS
SELECT mycol1, mycol2, ... 
FROM (
    SELECT a.*, ROW_NUMBER() OVER(PARTITION BY objid ORDER BY timestamp DESC) rn
    FROM audit_log a
) x WHERE rn <= 5

Then, just TRUNCATE the original table and reinsert saved data :

TRUNCATE audit_log;
INSERT INTO audit_log SELECT * FROM audit_log_backup;
--- and eventually...
DROP TABLE audit_log_backup;

As explained in the documentation, truncating a big table is much more efficient than deleting from it :

TRUNCATE quickly removes all rows from a set of tables. It has the same effect as an unqualified DELETE on each table, but since it does not actually scan the tables it is faster. Furthermore, it reclaims disk space immediately, rather than requiring a subsequent VACUUM operation. This is most useful on large tables.

One thing to be careful about, as commented by Erwin Brandsetter, is that this technique creates a race condition where records added (or updated) after the copy was started will not be taken into account. One solution would be to perform all operations in a single transaction, while locking the table :

BEGIN WORK;
LOCK TABLE audit_log IN SHARE ROW EXCLUSIVE MODE;
CREATE TABLE audit_log_backup AS ...;
TRUNCATE audit_log;
INSERT INTO audit_log SELECT * FROM audit_log_backup;
COMMIT WORK;

The downside is that this will put on wait any session that would try to access the table while the transaction is in progress.

Disclaimer : whatever you do, make sure to properly backup the whole table before you starting purging it !

Much faster than deleting in place. Be aware of a race condition, though. The table is "very read/write heavy". Every row written concurrently between the start of your `CREATE TABLE` statement and the `TRUNCATE` is lost. Plus, `TRUNCATE` requires an exclusive lock. The (only) safe way would be to do this in a transaction and start with an exclusive lock on the table - which might be a problem for a read/write heavy table. Catch 22. — Erwin Brandstetter, Feb 22 '19 at 22:50
@ErwinBrandstetter : thanks for pointing this out, I updated my answer with more details ! — GMB, Feb 23 '19 at 00:27
I like your answer now. You might even use a `TEMPORARY` table for speed. See: https://stackoverflow.com/a/8290958/939860 — Erwin Brandstetter, Feb 23 '19 at 00:51

Erwin Brandstetter · Answer 3 · 2019-02-26T01:35:03.830

You could use a plain row_number(), similar to what @Willis suggested, improved with ORDER BY:

WITH cte AS (
    SELECT ctid
         , row_number() OVER (PARTITION BY objid ORDER BY timestamp DESC) AS rn
    FROM   audit_log
   )
DELETE FROM audit_log
USING  cte
WHERE  cte.ctid = tbl.ctid
AND    cte.row_number > 5;

That's going to take a long time for your big table. You can have that faster with a multicolumn index on audit_log(objid, timestamp DESC) and this query:

WITH del AS (
   SELECT x.ctid
   FROM   records r
   CROSS LATERAL (
      SELECT a.ctid
      FROM   audit_log a
      WHERE  a.objid = r.id
      ORDER  BY a.timestamp DESC
      OFFSET 5  -- excluding the first 5 per object
      ) x
   )
DELETE FROM audit_log
USING  del
WHERE  del.ctid = tbl.ctid;

Or:

DELETE FROM audit_log
WHERE  ctid NOT IN (
   SELECT x.ctid
   FROM   records r
   CROSS  JOIN LATERAL (
      SELECT a.ctid
      FROM   audit_log a
      WHERE  a.objid = r.id
      ORDER  BY a.timestamp DESC
      LIMIT  5  -- the inverse selection here
      ) x
   );

The latter may be faster with the supporting index.

Writing a new table with just the top 5 for each object would be much faster. You can use the subquery from the last query for this. (And see GMB's answer.) It produces a pristine table without bloat. But I ruled that out due to the table being very read/write heavy. If you cannot afford the necessary exclusive lock for some time, that's a no-go.

Your timestamp column is not defined NOT NULL. You may need NULLS LAST. See:

PostgreSQL sort by datetime asc, null first?

Keep only the last 5 rows per object in an audit table

3 Answers3