Hugging Face - Datasets
Imports datasets from Hugging Face (https://huggingface.co/datasets)
Only datasets with Parquet exports can be imported with this connector.
Configuration
Input | Type | Description | Default Value |
---|---|---|---|
dataset_name | string | Dataset Name. | |
dataset_subsets | array | Dataset Subsets. Dataset Subsets to import. Will import all of them if nothing is provided (see https://huggingface.co/docs/dataset-viewer/en/configs_and_splits for more details) | |
dataset_splits | array | Dataset Splits. Splits to import. Will import all of them if nothing is provided (see https://huggingface.co/docs/dataset-viewer/en/configs_and_splits for more details) |
Streams
Stream Name | Primary Key | Pagination | Supports Full Sync | Supports Incremental |
---|---|---|---|---|
rows | DefaultPaginator | ✅ | ❌ | |
splits | No pagination | ✅ | ❌ |
Changelog
Expand to review
Version | Date | Pull Request | Subject |
---|---|---|---|
0.0.1 | 2024-11-28 | Initial release by @michel-tricot via Connector Builder |