This feature is in preview and subject to change. To share feedback and/or issues, contact Support.
MOLT Verify checks for data discrepancies between a source database and CockroachDB during a database migration.
The tool performs the following verifications to ensure data integrity during a migration:
- Table Verification: Check that the structure of tables between the source database and the target database are the same.
- Column Definition Verification: Check that the column names, data types, constraints, nullability, and other attributes between the source database and the target database are the same.
- Row Value Verification: Check that the actual data in the tables is the same between the source database and the target database.
For a demo of MOLT Verify, watch the following video:
Supported databases
The following source databases are currently supported:
- PostgreSQL 12-16
- MySQL 5.7, 8.0 and later
- Oracle Database 19c (Enterprise Edition) and 21c (Express Edition)
Installation
To install MOLT, download the binary that matches your architecture and source database:
| Operating System | Architecture | PostgreSQL/MySQL | Oracle (Preview) | 
|---|---|---|---|
| Windows | AMD 64-bit | Download | N/A | 
| ARM 64-bit | Download | N/A | |
| Linux | AMD 64-bit | Download | Download | 
| ARM 64-bit | Download | N/A | |
| Mac | AMD 64-bit | Download | N/A | 
| ARM 64-bit | Download | N/A | 
The following binaries are included:
- molt
- replicator
Both molt and replicator must be in your current working directory. To use replication features, replicator must be located either in the same directory as molt or in a directory directly beneath molt. For example, either of the following would be valid:
/migration-project/          # Your current working directory
├── molt                     # MOLT binary
└── replicator               # Replicator binary
/migration-project/          # Your current working directory
├── molt                     # MOLT binary
└── bin/                     # Subdirectory
    └── replicator           # Replicator binary
To display the current version of each binary, run molt --version and replicator --version.
For previous binaries, refer to the MOLT version manifest. For release details, refer to the MOLT changelog.
Docker image
Docker multi-platform images containing both the AMD and ARM binaries are available. To pull the latest image for PostgreSQL and MySQL:
docker pull cockroachdb/molt
To pull a specific version (e.g., 1.1.3):
docker pull cockroachdb/molt:1.1.3
To pull the latest image for Oracle (note that only linux/amd64 is supported):
docker pull cockroachdb/molt:oracle-latest
For details on running in Docker, refer to Docker usage.
Setup
Complete the following items before using MOLT Verify:
- The SQL user running MOLT Verify must have the - SELECTprivilege on both the source and target CockroachDB tables.
- Percent-encode the connection strings for the source database and CockroachDB. This ensures that the MOLT tools can parse special characters in your password. - Given a password - a$52&, pass it to the- molt escape-passwordcommand with single quotes:- molt escape-password --password 'a$52&'- Substitute the following encoded password in your original connection url string: a%2452%26
- Use the encoded password in your connection string. For example: - postgres://postgres:a%2452%26@localhost:5432/replicationload
 
Flags
| Flag | Description | 
|---|---|
| --source | (Required) Connection string for the source database. | 
| --target | (Required) Connection string for the target database. | 
| --concurrency | Number of threads to process at a time when reading the tables. Default: 16 For faster verification, set this flag to a higher value. | 
| --continuous | Verify tables in a continuous loop. Default: false | 
| --live | Retry verification on rows before emitting warnings or errors. This is useful during live data import, when temporary mismatches can occur. Default: false | 
| --log-file | Write messages to the specified log filename. If no filename is provided, messages write to verify-{datetime}.log. If"stdout"is provided, messages write tostdout. | 
| --metrics-listen-addr | Address of the metrics endpoint, which has the path {address}/metrics.Default: '127.0.0.1:3030' | 
| --row-batch-size | Number of rows to get from a table at a time. Default: 20000 | 
| --schema-filter | Verify schemas that match a specified regular expression. Default: '.*' | 
| --table-filter | Verify tables that match a specified regular expression. Default: '.*' | 
Usage
molt verify takes two SQL connection strings as --source and --target arguments.
To compare a PostgreSQL database with a CockroachDB database:
molt verify \
  --source 'postgresql://{username}:{password}@{host}:{port}/{database}' \
  --target 'postgresql://{username}:{password}@{host}:{port}/{database}?sslmode=verify-full'
To compare a MySQL database with a CockroachDB database:
molt verify \
  --source 'mysql://{username}:{password}@{protocol}({host}:{port})/{database}' \
  --target 'postgresql://{username}:{password}@{host}:{port}/{database}?sslmode=verify-full'
Use the optional flags to customize the verification results.
When verification completes, the output displays a summary message like the following:
{"level":"info","type":"summary","table_schema":"public","table_name":"common_table","num_truth_rows":6,"num_success":3,"num_conditional_success":0,"num_missing":2,"num_mismatch":1,"num_extraneous":2,"num_live_retry":0,"num_column_mismatch":0,"message":"finished row verification on public.common_table (shard 1/1)"}
- num_missingis the number of rows that are missing on the target database. You can add any missing data to the target database and run- molt verifyagain.
- num_mismatchis the number of rows with mismatched values on the target database.
- num_extraneousis the number of extraneous tables on the target database.
- num_column_mismatchis the number of columns with mismatched types on the target database, preventing- molt verifyfrom comparing the column's rows. For example, if your source table uses an auto-incrementing ID, MOLT Verify will identify a mismatch with CockroachDB's- UUIDtype. In such cases, you might fix the mismatch by creating a composite type on CockroachDB that uses the auto-incrementing ID.
- num_successis the number of rows that matched.
- num_conditional_successis the number of rows that matched while having a column mismatch due to a type difference. This value indicates that all other columns that could be compared have matched successfully. You should manually review the warnings and errors in the output to determine whether the column mismatches can be ignored.
Docker usage
For details on pulling Docker images, see Docker image.
Performance
MOLT Fetch and Verify are likely to run more slowly in a Docker container than on a local machine. To improve performance, increase the memory or compute resources, or both, on your Docker container.
Local connection strings
When testing locally, specify the host as follows:
- For macOS, use host.docker.internal. For example:
--source 'postgres://postgres:postgres@host.docker.internal:5432/molt?sslmode=disable'
--target "postgres://root@host.docker.internal:26257/molt?sslmode=disable"
- For Linux and Windows, use 172.17.0.1. For example:
--source 'postgres://postgres:postgres@172.17.0.1:5432/molt?sslmode=disable'
--target "postgres://root@172.17.0.1:26257/molt?sslmode=disable"
Known limitations
- MOLT Verify compares 20,000 rows at a time by default, and row values can change between batches, potentially resulting in temporary inconsistencies in data. If --livemode is enabled, MOLT Verify retries verification on these rows. To configure the row batch size, use the--row_batch_sizeflag.
- MOLT Verify checks for collation mismatches on primary key columns. This may cause validation to fail when a STRINGis used as a primary key and the source and target databases are using different collations.
- MOLT Verify might give an error in case of schema changes on either the source or target database.
- Geospatial types cannot yet be compared.
The following limitation is specific to MySQL:
- MOLT Verify only supports comparing one MySQL database to a whole CockroachDB schema (which is assumed to be public).