airbyte-cdk/bulk/toolkits/legacy-task-load-parquet/README.md
DEPRECATED: This toolkit is for legacy (non-dataflow) connectors that use Parquet format only.
This toolkit contains Parquet format utilities for connectors that use the legacy task-based architecture. It provides Parquet schema generation, value conversion via Avro, and the MapperPipeline pattern that predates the modern dataflow pipeline.
New connectors should use core-load with the dataflow pipeline. For Iceberg/Parquet destinations, use load-iceberg-parquet instead. This toolkit should only be used and updated for the existing connectors that depend on it.
To use this toolkit, add it to your connector's toolkits list along with useLegacyTaskLoader:
airbyteBulkConnector {
core = 'load'
toolkits = ['legacy-task-load-parquet']
useLegacyTaskLoader = true
}
ParquetMapperPipelineFactory - Legacy MapperPipeline for Parquet formattingConnectors should migrate to the modern dataflow pipeline when possible. For Iceberg-based destinations, use load-iceberg-parquet. The dataflow architecture provides better performance, cleaner separation of concerns, and is the actively maintained code path.