You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
For Big Databases ( > 600GB ) we noticed that the resulting dumps are inconsistent across tables. Some tables have rows that were inserted after the dump was initiated, which result in inconsistent data between tables.
We traced this to how gdpr-dump is running the dumping queries, it starts a transaction per table instead of a single transaction for the entire dump. Which result in an inconsistent snapshot.
After tracing the code, it seems to be an issue with MySQLDump-PHP and not gdpr-dump itself. But I am opening the issue here to make sure if this is a known issue/limitation.
We'll be happy to contribute to this project to get this fixed 🙏🏻
Preconditions
GdprDump Version: 3.0.0
PHP Version: 7.4
Database Version: Mysql 5.7 (AWS RDS)
Steps to reproduce
Choose a Database with enough data so that the dumping command would take at minimum 2~3 minutes.
Create two empty tables a_test and z_test.
Run gdpr-dump with configuration with table_whitelist so that
tables_whitelist:
- a_test
- <a bigger table that takes 2~3 minutes to process>
- z_test
Note that Tables are dumped in alphabetical order.
6. While gdpr-dump is running, insert some data into a_test and z_test.
7. Restore the generated dump.
Expected result.
a_test and z_test should be empty.
Actual result
a_test is empty, z_test is not.
The text was updated successfully, but these errors were encountered:
For Big Databases ( > 600GB ) we noticed that the resulting dumps are inconsistent across tables. Some tables have rows that were inserted after the dump was initiated, which result in inconsistent data between tables.
We traced this to how gdpr-dump is running the dumping queries, it starts a transaction per table instead of a single transaction for the entire dump. Which result in an inconsistent snapshot.
After tracing the code, it seems to be an issue with MySQLDump-PHP and not
gdpr-dump
itself. But I am opening the issue here to make sure if this is a known issue/limitation.We'll be happy to contribute to this project to get this fixed 🙏🏻
Preconditions
GdprDump Version: 3.0.0
PHP Version: 7.4
Database Version: Mysql 5.7 (AWS RDS)
Steps to reproduce
a_test
andz_test
.gdpr-dump
with configuration withtable_whitelist
so thatNote that Tables are dumped in alphabetical order.
6. While
gdpr-dump
is running, insert some data intoa_test
andz_test
.7. Restore the generated dump.
Expected result.
a_test
andz_test
should be empty.Actual result
a_test
is empty,z_test
is not.The text was updated successfully, but these errors were encountered: