HFTransformerPipelineDataset¶
HFTransformerPipelineDataset loads pretrained Hugging Face transformers using the transformers library.
kedro_datasets.huggingface.HFTransformerPipelineDataset ¶
HFTransformerPipelineDataset(
*,
task=None,
model_name=None,
pipeline_kwargs=None,
metadata=None
)
Bases: AbstractDataset
HFTransformerPipelineDataset loads pretrained Hugging Face transformers
using the transformers <https://pypi.org/project/transformers>_ library.
Examples:
Using the YAML API:
summarizer_model:
type: huggingface.HFTransformerPipelineDataset
task: summarization
fill_mask_model:
type: huggingface.HFTransformerPipelineDataset
task: fill-mask
model_name: Twitter/twhin-bert-base
Using the Python API:
>>> from kedro_datasets.huggingface import HFTransformerPipelineDataset
>>>
>>> dataset = HFTransformerPipelineDataset(
... task="text-classification", model_name="prajjwal1/bert-tiny"
... )
>>> model = dataset.load()
>>> assert model("Hello world")[0]["label"].startswith("LABEL_")
Source code in kedro_datasets/huggingface/transformer_pipeline_dataset.py
40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 | |
_describe ¶
_describe()
Source code in kedro_datasets/huggingface/transformer_pipeline_dataset.py
71 72 73 74 75 76 | |
load ¶
load()
Source code in kedro_datasets/huggingface/transformer_pipeline_dataset.py
65 66 | |
save ¶
save(pipeline)
Source code in kedro_datasets/huggingface/transformer_pipeline_dataset.py
68 69 | |