Skip to main content

System Logs Dataset Specification

Updated over a month ago

System Logs Dataset Specification

Audit Log

The "Audit Log" dataset serves as a detailed record of actions and changes made to entities within the system. It tracks various operations such as creation, modification, deletion, and viewing of records. Each entry in the log includes the entity’s unique identifier, the type of action taken, the user responsible for the action, and a timestamp of when the action occurred. The dataset provides transparency and traceability for all transactions, making it an essential component for ensuring data integrity, security, and compliance within the system.

Table Structure: Audit Log

Table Name

Column Name

Is Nullable

Data Type

Description

Audit_log

Entity_ID

YES

STRING

Unique identifier for the entity or record being audited.

Audit_log

Entity_Name

YES

STRING

Name of the entity or record type that is being logged in the audit.

Audit_log

Action

YES

STRING

The specific action taken on the entity, such as create, update, delete, or view.

Audit_log

Record_Last_Modified_Timestamp

YES

TIMESTAMP

Timestamp indicating the last time the record was modified or updated.

Audit_log

Transaction_User_ID

YES

STRING

The ID of the user who performed the action or transaction being logged.

Audit_log

Payload

YES

STRING

The detailed data or content related to the action performed, often containing before-and-after states or changes made.


Extract Runs Log

The "Extract Runs Log" dataset provides detailed information about the execution of data extraction processes within the system. It records metadata about each extract run, including the table being processed, the time intervals for extraction, and the status of the run. This dataset is critical for monitoring and troubleshooting data pipeline operations.

Table Structure: Extract Runs Log

Table Name

Column Name

Is Nullable

Data Type

Description

extracts_run_logs

Dag_Run_Id

NO

STRING(100)

The unique identifier for the Directed Acyclic Graph (DAG) run in the data pipeline.

extracts_run_logs

Timestamp_Id

NO

TIMESTAMP

A timestamp identifier associated with this extract run.

extracts_run_logs

Table_Name

NO

STRING(100)

The name of the table being extracted or processed.

extracts_run_logs

Start_Interval

YES

TIMESTAMP

The start time of the interval for which data is being extracted.

extracts_run_logs

End_Interval

YES

TIMESTAMP

The end time of the interval for which data is being extracted.

extracts_run_logs

Grace_Period_Min

YES

INT64

The grace period in minutes allowed for late-arriving data.

extracts_run_logs

Status

NO

STRING(50)

The current status of the extract run (e.g., success, failed, in progress).

extracts_run_logs

Number_Of_Files

YES

INT64

The number of files processed during this extract run.

extracts_run_logs

Number_Of_Entries

YES

INT64

The number of data entries or rows processed during this extract run.

extracts_run_logs

Last_Step_On_Failed

YES

STRING(100)

The last step that was executed before a failure occurred, if applicable.

extracts_run_logs

Error_Message

YES

STRING(250)

Any error message associated with a failed extract run.

extracts_run_logs

Is_Shadow

YES

BOOL

Indicates whether this is a shadow run (test run) or a production run.

extracts_run_logs

Create_Timestamp

NO

TIMESTAMP

The timestamp when this extract run log entry was created.

Did this answer your question?