Debezium MySQL Connector

Capture changes from a MySQL database.

Metadata

Property	Value
Scheme	`debezium-mysql`
Support Level	Stable
Labels	database
Version	4.10.2

Maven Dependency

<dependency>
    <groupId>org.apache.camel</groupId>
    <artifactId>camel-debezium-mysql</artifactId>
    <version>4.10.2</version>
</dependency>

Endpoint Properties

Name	Type	Required	Default	Description
`name`	string	✓		Unique name for the connector. Attempting to register again with the same name will fail.
`additionalProperties`	object			Additional properties for debezium components in case they can't be set directly on the camel configurations (e.g: setting Kafka Connect properties needed by Debezium engine, for example setting KafkaOffsetBackingStore), the properties have to be prefixed with additionalProperties.. E.g: additionalProperties.transactional.id=12345&additionalProperties.schema.registry.url=http://localhost:8811/avro
`internalKeyConverter`	string		`org.apache.kafka.connect.json.JsonConverter`	The Converter class that should be used to serialize and deserialize key data for offsets. The default is JSON converter.
`internalValueConverter`	string		`org.apache.kafka.connect.json.JsonConverter`	The Converter class that should be used to serialize and deserialize value data for offsets. The default is JSON converter.
`offsetCommitPolicy`	string			The name of the Java class of the commit policy. It defines when offsets commit has to be triggered based on the number of events processed and the time elapsed since the last commit. This class must implement the interface 'OffsetCommitPolicy'. The default is a periodic commit policy based upon time intervals.
`offsetCommitTimeoutMs`	duration		`5000`	Maximum number of milliseconds to wait for records to flush and partition offset data to be committed to offset storage before cancelling the process and restoring the offset data to be committed in a future attempt. The default is 5 seconds.
`offsetFlushIntervalMs`	duration		`60000`	Interval at which to try committing offsets. The default is 1 minute.
`offsetStorage`	string		`org.apache.kafka.connect.storage.FileOffsetBackingStore`	The name of the Java class that is responsible for persistence of connector offsets.
`offsetStorageFileName`	string			Path to file where offsets are to be stored. Required when offset.storage is set to the FileOffsetBackingStore.
`offsetStoragePartitions`	integer			The number of partitions used when creating the offset storage topic. Required when offset.storage is set to the 'KafkaOffsetBackingStore'.
`offsetStorageReplicationFactor`	integer			Replication factor used when creating the offset storage topic. Required when offset.storage is set to the KafkaOffsetBackingStore
`offsetStorageTopic`	string			The name of the Kafka topic where offsets are to be stored. Required when offset.storage is set to the KafkaOffsetBackingStore.
`bridgeErrorHandler`	boolean		`false`	Allows for bridging the consumer to the Camel routing Error Handler, which mean any exceptions (if possible) occurred while the Camel consumer is trying to pickup incoming messages, or the likes, will now be processed as a message and handled by the routing Error Handler. Important: This is only possible if the 3rd party component allows Camel to be alerted if an exception was thrown. Some components handle this internally only, and therefore bridgeErrorHandler is not possible. In other situations we may improve the Camel component to hook into the 3rd party component and make this possible for future releases. By default the consumer will use the org.apache.camel.spi.ExceptionHandler to deal with exceptions, that will be logged at WARN or ERROR level and ignored.
`exceptionHandler`	object			To let the consumer use a custom ExceptionHandler. Notice if the option bridgeErrorHandler is enabled then this option is not in use. By default the consumer will deal with exceptions, that will be logged at WARN or ERROR level and ignored.
`exchangePattern`	object			Sets the exchange pattern when the consumer creates an exchange.
`bigintUnsignedHandlingMode`	string		`long`	Specify how BIGINT UNSIGNED columns should be represented in change events, including: 'precise' uses java.math.BigDecimal to represent values, which are encoded in the change events using a binary representation and Kafka Connect's 'org.apache.kafka.connect.data.Decimal' type; 'long' (the default) represents values using Java's 'long', which may not offer the precision but will be far easier to use in consumers.
`binlogBufferSize`	integer		`0`	The size of a look-ahead buffer used by the binlog reader to decide whether the transaction in progress is going to be committed or rolled back. Use 0 to disable look-ahead buffering. Defaults to 0 (i.e. buffering is disabled.
`columnExcludeList`	string			Regular expressions matching columns to exclude from change events
`columnIncludeList`	string			Regular expressions matching columns to include in change events
`columnPropagateSourceType`	string			A comma-separated list of regular expressions matching fully-qualified names of columns that adds the columns original type and original length as parameters to the corresponding field schemas in the emitted change records.
`connectKeepAlive`	boolean		`true`	Whether a separate thread should be used to ensure the connection is kept alive.
`connectKeepAliveIntervalMs`	duration		`1m`	Interval for connection checking if keep alive thread is used, given in milliseconds Defaults to 1 minute (60,000 ms).
`connectTimeoutMs`	duration		`30s`	Maximum time to wait after trying to connect to the database before timing out, given in milliseconds. Defaults to 30 seconds (30,000 ms).
`converters`	string			Optional list of custom converters that would be used instead of default ones. The converters are defined using '.type' config option and configured using options '.'
`customMetricTags`	string			The custom metric tags will accept key-value pairs to customize the MBean object name which should be appended the end of regular name, each key would represent a tag for the MBean object name, and the corresponding value would be the value of that tag the key is. For example: k1=v1,k2=v2
`databaseExcludeList`	string			A comma-separated list of regular expressions that match database names to be excluded from monitoring
`databaseHostname`	string			Resolvable hostname or IP address of the database server.
`databaseIncludeList`	string			The databases for which changes are to be captured
`databaseInitialStatements`	string			A semicolon separated list of SQL statements to be executed when a JDBC connection (not binlog reading connection) to the database is established. Note that the connector may establish JDBC connections at its own discretion, so this should typically be used for configuration of session parameters only, but not for executing DML statements. Use doubled semicolon (';;') to use a semicolon as a character and not as a delimiter.
`databaseJdbcDriver`	string		`com.mysql.cj.jdbc.Driver`	JDBC Driver class name used to connect to the MySQL database server.
`databasePassword`	string	✓		Password of the database user to be used when connecting to the database.
`databasePort`	integer		`3306`	Port of the database server.
`databaseProtocol`	string		`jdbc:mysql`	JDBC protocol to use with the driver.
`databaseQueryTimeoutMs`	duration		`10m`	Time to wait for a query to execute, given in milliseconds. Defaults to 600 seconds (600,000 ms); zero means there is no limit.
`databaseServerId`	integer			A numeric ID of this database client, which must be unique across all currently-running database processes in the cluster. This connector joins the database cluster as another server (with this unique ID) so it can read the binlog.
`databaseServerIdOffset`	integer		`10000`	Only relevant if parallel snapshotting is configured. During parallel snapshotting, multiple (4) connections open to the database client, and they each need their own unique connection ID. This offset is used to generate those IDs from the base configured cluster ID.
`databaseSslKeystore`	string			The location of the key store file. This is optional and can be used for two-way authentication between the client and the database.
`databaseSslKeystorePassword`	string			The password for the key store file. This is optional and only needed if 'database.ssl.keystore' is configured.
`databaseSslMode`	string		`preferred`	Whether to use an encrypted connection to the database. Options include: 'disabled' to use an unencrypted connection; 'preferred' (the default) to establish a secure (encrypted) connection if the server supports secure connections, but fall back to an unencrypted connection otherwise; 'required' to use a secure (encrypted) connection, and fail if one cannot be established; 'verify_ca' like 'required' but additionally verify the server TLS certificate against the configured Certificate Authority (CA) certificates, or fail if no valid matching CA certificates are found; or 'verify_identity' like 'verify_ca' but additionally verify that the server certificate matches the host to which the connection is attempted.
`databaseSslTruststore`	string			The location of the trust store file for the server certificate verification.
`databaseSslTruststorePassword`	string			The password for the trust store file. Used to check the integrity of the truststore, and unlock the truststore.
`databaseUser`	string			Name of the database user to be used when connecting to the database.
`datatypePropagateSourceType`	string			A comma-separated list of regular expressions matching the database-specific data type names that adds the data type's original type and original length as parameters to the corresponding field schemas in the emitted change records.
`decimalHandlingMode`	string		`precise`	Specify how DECIMAL and NUMERIC columns should be represented in change events, including: 'precise' (the default) uses java.math.BigDecimal to represent values, which are encoded in the change events using a binary representation and Kafka Connect's 'org.apache.kafka.connect.data.Decimal' type; 'string' uses string to represent values; 'double' represents values using Java's 'double', which may not offer the precision but will be far easier to use in consumers.
`enableTimeAdjuster`	boolean		`true`	The database allows the user to insert year value as either 2-digit or 4-digit. In case of two digit the value is automatically mapped into 1970 - 2069.false - delegates the implicit conversion to the database; true - (the default) Debezium makes the conversion
`errorsMaxRetries`	integer		`-1`	The maximum number of retries on connection errors before failing (-1 = no limit, 0 = disabled, 0 = num of retries).
`eventDeserializationFailureHandlingMode`	string		`fail`	Specify how failures during deserialization of binlog events (i.e. when encountering a corrupted event) should be handled, including: 'fail' (the default) an exception indicating the problematic event and its binlog position is raised, causing the connector to be stopped; 'warn' the problematic event and its binlog position will be logged and the event will be skipped; 'ignore' the problematic event will be skipped.
`eventProcessingFailureHandlingMode`	string		`fail`	Specify how failures during processing of events (i.e. when encountering a corrupted event) should be handled, including: 'fail' (the default) an exception indicating the problematic event and its position is raised, causing the connector to be stopped; 'warn' the problematic event and its position will be logged and the event will be skipped; 'ignore' the problematic event will be skipped.
`gtidSourceExcludes`	string			The source UUIDs used to exclude GTID ranges when determine the starting position in the MySQL server's binlog.
`gtidSourceFilterDmlEvents`	boolean		`true`	When set to true, only produce DML events for transactions that were written on the server with matching GTIDs defined by the gtid.source.includes or gtid.source.excludes, if they were specified.
`gtidSourceIncludes`	string			The source UUIDs used to include GTID ranges when determine the starting position in the MySQL server's binlog.
`heartbeatActionQuery`	string			The query executed with every heartbeat.
`heartbeatIntervalMs`	duration		`0ms`	Length of an interval in milli-seconds in in which the connector periodically sends heartbeat messages to a heartbeat topic. Use 0 to disable heartbeat messages. Disabled by default.
`heartbeatTopicsPrefix`	string		`__debezium-heartbeat`	The prefix that is used to name heartbeat topics.Defaults to __debezium-heartbeat.
`includeQuery`	boolean		`false`	Whether the connector should include the original SQL query that generated the change event. Note: This option requires the database to be configured using the server option binlog_rows_query_log_events (MySQL) or binlog_annotate_row_events (MariaDB) set to ON.Query will not be present for events generated from snapshot. WARNING: Enabling this option may expose tables or fields explicitly excluded or masked by including the original SQL statement in the change event. For this reason the default value is 'false'.
`includeSchemaChanges`	boolean		`true`	Whether the connector should publish changes in the database schema to a Kafka topic with the same name as the database server ID. Each schema change will be recorded using a key that contains the database name and whose value include logical description of the new schema and optionally the DDL statement(s). The default is 'true'. This is independent of how the connector internally records database schema history.
`includeSchemaComments`	boolean		`false`	Whether the connector parse table and column's comment to metadata object. Note: Enable this option will bring the implications on memory usage. The number and size of ColumnImpl objects is what largely impacts how much memory is consumed by the Debezium connectors, and adding a String to each of them can potentially be quite heavy. The default is 'false'.
`inconsistentSchemaHandlingMode`	string		`fail`	Specify how binlog events that belong to a table missing from internal schema representation (i.e. internal representation is not consistent with database) should be handled, including: 'fail' (the default) an exception indicating the problematic event and its binlog position is raised, causing the connector to be stopped; 'warn' the problematic event and its binlog position will be logged and the event will be skipped; 'skip' the problematic event will be skipped.
`incrementalSnapshotAllowSchemaChanges`	boolean		`false`	Detect schema change during an incremental snapshot and re-select a current chunk to avoid locking DDLs. Note that changes to a primary key are not supported and can cause incorrect results if performed during an incremental snapshot. Another limitation is that if a schema change affects only columns' default values, then the change won't be detected until the DDL is processed from the binlog stream. This doesn't affect the snapshot events' values, but the schema of snapshot events may have outdated defaults.
`incrementalSnapshotChunkSize`	integer		`1024`	The maximum size of chunk (number of documents/rows) for incremental snapshotting
`incrementalSnapshotWatermarkingStrategy`	string		`INSERT_INSERT`	Specify the strategy used for watermarking during an incremental snapshot: 'insert_insert' both open and close signal is written into signal data collection (default); 'insert_delete' only open signal is written on signal data collection, the close will delete the relative open signal;
`maxBatchSize`	integer		`2048`	Maximum size of each batch of source records. Defaults to 2048.
`maxQueueSize`	integer		`8192`	Maximum size of the queue for change events read from the database log but not yet recorded or forwarded. Defaults to 8192, and should always be larger than the maximum batch size.
`maxQueueSizeInBytes`	integer		`0`	Maximum size of the queue in bytes for change events read from the database log but not yet recorded or forwarded. Defaults to 0. Mean the feature is not enabled
`messageKeyColumns`	string			A semicolon-separated list of expressions that match fully-qualified tables and column(s) to be used as message key. Each expression must match the pattern ':', where the table names could be defined as (DB_NAME.TABLE_NAME) or (SCHEMA_NAME.TABLE_NAME), depending on the specific connector, and the key columns are a comma-separated list of columns representing the custom key. For any table without an explicit key configuration the table's primary key column(s) will be used as message key. Example: dbserver1.inventory.orderlines:orderId,orderLineId;dbserver1.inventory.orders:id
`minRowCountToStreamResults`	integer		`1000`	The number of rows a table must contain to stream results rather than pull all into memory during snapshots. Defaults to 1,000. Use 0 to stream all results and completely avoid checking the size of each table.
`notificationEnabledChannels`	string			List of notification channels names that are enabled.
`notificationSinkTopicName`	string			The name of the topic for the notifications. This is required in case 'sink' is in the list of enabled channels
`pollIntervalMs`	duration		`500ms`	Time to wait for new change events to appear after receiving no events, given in milliseconds. Defaults to 500 ms.
`postProcessors`	string			Optional list of post processors. The processors are defined using '.type' config option and configured using options ''
`provideTransactionMetadata`	boolean		`false`	Enables transaction metadata extraction together with event counting
`queryFetchSize`	integer		`0`	The maximum number of records that should be loaded into memory while streaming. A value of '0' uses the default JDBC fetch size.
`retriableRestartConnectorWaitMs`	duration		`10s`	Time to wait before restarting connector after retriable exception occurs. Defaults to 10000ms.
`schemaHistoryInternal`	string		`io.debezium.storage.kafka.history.KafkaSchemaHistory`	The name of the SchemaHistory class that should be used to store and recover database schema changes. The configuration properties for the history are prefixed with the 'schema.history.internal.' string.
`schemaHistoryInternalFileFilename`	string			The path to the file that will be used to record the database schema history
`schemaHistoryInternalSkipUnparseableDdl`	boolean		`false`	Controls the action Debezium will take when it meets a DDL statement in binlog, that it cannot parse.By default the connector will stop operating but by changing the setting it can ignore the statements which it cannot parse. If skipping is enabled then Debezium can miss metadata changes.
`schemaHistoryInternalStoreOnlyCapturedDatabasesDdl`	boolean		`false`	Controls what DDL will Debezium store in database schema history. By default (true) only DDL that manipulates a table from captured schema/database will be stored. If set to false, then Debezium will store all incoming DDL statements.
`schemaHistoryInternalStoreOnlyCapturedTablesDdl`	boolean		`false`	Controls what DDL will Debezium store in database schema history. By default (false) Debezium will store all incoming DDL statements. If set to true, then only DDL that manipulates a captured table will be stored.
`schemaNameAdjustmentMode`	string		`none`	Specify how schema names should be adjusted for compatibility with the message converter used by the connector, including: 'avro' replaces the characters that cannot be used in the Avro type name with underscore; 'avro_unicode' replaces the underscore or characters that cannot be used in the Avro type name with corresponding unicode like _uxxxx. Note: _ is an escape sequence like backslash in Java;'none' does not apply any adjustment (default)
`signalDataCollection`	string			The name of the data collection that is used to send signals/commands to Debezium. Signaling is disabled when not set.
`signalEnabledChannels`	string		`source`	List of channels names that are enabled. Source channel is enabled by default
`signalPollIntervalMs`	duration		`5s`	Interval for looking for new signals in registered channels, given in milliseconds. Defaults to 5 seconds.
`skippedOperations`	string		`t`	The comma-separated list of operations to skip during streaming, defined as: 'c' for inserts/create; 'u' for updates; 'd' for deletes, 't' for truncates, and 'none' to indicate nothing skipped. By default, only truncate operations will be skipped.
`snapshotDelayMs`	duration		`0ms`	A delay period before a snapshot will begin, given in milliseconds. Defaults to 0 ms.
`snapshotFetchSize`	integer			The maximum number of records that should be loaded into memory while performing a snapshot.
`snapshotIncludeCollectionList`	string			This setting must be set to specify a list of tables/collections whose snapshot must be taken on creating or restarting the connector.
`snapshotLockingMode`	string		`minimal`	Controls how long the connector holds onto the global read lock while it is performing a snapshot. The default is 'minimal', which means the connector holds the global read lock (and thus prevents any updates) for just the initial portion of the snapshot while the database schemas and other metadata are being read. The remaining work in a snapshot involves selecting all rows from each table, and this can be done using the snapshot process' REPEATABLE READ transaction even when the lock is no longer held and other operations are updating the database. However, in some cases it may be desirable to block all writes for the entire duration of the snapshot; in such cases set this property to 'extended'. Using a value of 'none' will prevent the connector from acquiring any table locks during the snapshot process. This mode can only be used in combination with snapshot.mode values of 'schema_only' or 'schema_only_recovery' and is only safe to use if no schema changes are happening while the snapshot is taken.
`snapshotLockTimeoutMs`	duration		`10s`	The maximum number of millis to wait for table locks at the beginning of a snapshot. If locks cannot be acquired in this time frame, the snapshot will be aborted. Defaults to 10 seconds
`snapshotMaxThreads`	integer		`1`	The maximum number of threads used to perform the snapshot. Defaults to 1.
`snapshotMode`	string		`initial`	The criteria for running a snapshot upon startup of the connector. Select one of the following snapshot options: 'when_needed': On startup, the connector runs a snapshot if one is needed.; 'schema_only': If the connector does not detect any offsets for the logical server name, it runs a snapshot that captures only the schema (table structures), but not any table data. After the snapshot completes, the connector begins to stream changes from the binlog.; 'schema_only_recovery': The connector performs a snapshot that captures only the database schema history. The connector then transitions back to streaming. Use this setting to restore a corrupted or lost database schema history topic. Do not use if the database schema was modified after the connector stopped.; 'initial' (default): If the connector does not detect any offsets for the logical server name, it runs a snapshot that captures the current full state of the configured tables. After the snapshot completes, the connector begins to stream changes from the binlog.; 'initial_only': The connector performs a snapshot as it does for the 'initial' option, but after the connector completes the snapshot, it stops, and does not stream changes from the binlog.; 'never': The connector does not run a snapshot. Upon first startup, the connector immediately begins reading from the beginning of the binlog. The 'never' mode should be used with care, and only when the binlog is known to contain all history.
`snapshotModeConfigurationBasedSnapshotData`	boolean		`false`	When 'snapshot.mode' is set as configuration_based, this setting permits to specify whenever the data should be snapshotted or not.
`snapshotModeConfigurationBasedSnapshotOnDataError`	boolean		`false`	When 'snapshot.mode' is set as configuration_based, this setting permits to specify whenever the data should be snapshotted or not in case of error.
`snapshotModeConfigurationBasedSnapshotOnSchemaError`	boolean		`false`	When 'snapshot.mode' is set as configuration_based, this setting permits to specify whenever the schema should be snapshotted or not in case of error.
`snapshotModeConfigurationBasedSnapshotSchema`	boolean		`false`	When 'snapshot.mode' is set as configuration_based, this setting permits to specify whenever the schema should be snapshotted or not.
`snapshotModeConfigurationBasedStartStream`	boolean		`false`	When 'snapshot.mode' is set as configuration_based, this setting permits to specify whenever the stream should start or not after snapshot.
`snapshotModeCustomName`	string			When 'snapshot.mode' is set as custom, this setting must be set to specify a the name of the custom implementation provided in the 'name()' method. The implementations must implement the 'Snapshotter' interface and is called on each app boot to determine whether to do a snapshot.
`snapshotQueryMode`	string		`select_all`	Controls query used during the snapshot
`snapshotQueryModeCustomName`	string			When 'snapshot.query.mode' is set as custom, this setting must be set to specify a the name of the custom implementation provided in the 'name()' method. The implementations must implement the 'SnapshotterQuery' interface and is called to determine how to build queries during snapshot.
`snapshotSelectStatementOverrides`	string			This property contains a comma-separated list of fully-qualified tables (DB_NAME.TABLE_NAME) or (SCHEMA_NAME.TABLE_NAME), depending on the specific connectors. Select statements for the individual tables are specified in further configuration properties, one for each table, identified by the id 'snapshot.select.statement.overrides.DB_NAME.TABLE_NAME' or 'snapshot.select.statement.overrides.SCHEMA_NAME.TABLE_NAME', respectively. The value of those properties is the select statement to use when retrieving data from the specific table during snapshotting. A possible use case for large append-only tables is setting a specific point where to start (resume) snapshotting, in case a previous snapshotting was interrupted.
`snapshotTablesOrderByRowCount`	string		`disabled`	Controls the order in which tables are processed in the initial snapshot. A descending value will order the tables by row count descending. A ascending value will order the tables by row count ascending. A value of disabled (the default) will disable ordering by row count.
`sourceinfoStructMaker`	string		`io.debezium.connector.mysql.MySqlSourceInfoStructMaker`	The name of the SourceInfoStructMaker class that returns SourceInfo schema and struct.
`streamingDelayMs`	duration		`0ms`	A delay period after the snapshot is completed and the streaming begins, given in milliseconds. Defaults to 0 ms.
`tableExcludeList`	string			A comma-separated list of regular expressions that match the fully-qualified names of tables to be excluded from monitoring
`tableIgnoreBuiltin`	boolean		`true`	Flag specifying whether built-in tables should be ignored.
`tableIncludeList`	string			The tables for which changes are to be captured
`timePrecisionMode`	string		`adaptive_time_microseconds`	Time, date and timestamps can be represented with different kinds of precisions, including: 'adaptive_time_microseconds': the precision of date and timestamp values is based the database column's precision; but time fields always use microseconds precision; 'connect': always represents time, date and timestamp values using Kafka Connect's built-in representations for Time, Date, and Timestamp, which uses millisecond precision regardless of the database columns' precision.
`tombstonesOnDelete`	boolean		`false`	Whether delete operations should be represented by a delete event and a subsequent tombstone event (true) or only by a delete event (false). Emitting the tombstone event (the default behavior) allows Kafka to completely delete all events pertaining to the given key once the source record got deleted.
`topicNamingStrategy`	string		`io.debezium.schema.SchemaTopicNamingStrategy`	The name of the TopicNamingStrategy class that should be used to determine the topic name for data change, schema change, transaction, heartbeat event etc.
`topicPrefix`	string	✓		Topic prefix that identifies and provides a namespace for the particular database server/cluster is capturing changes. The topic prefix should be unique across all other connectors, since it is used as a prefix for all Kafka topic names that receive events emitted by this connector. Only alphanumeric characters, hyphens, dots and underscores must be accepted.
`transactionMetadataFactory`	string		`io.debezium.pipeline.txmetadata.DefaultTransactionMetadataFactory`	Class to make transaction context & transaction struct/schemas
`useNongracefulDisconnect`	boolean		`false`	Whether to use socket.setSoLinger(true, 0) when BinaryLogClient keepalive thread triggers a disconnect for a stale connection.