kedro.datasets

Description

kedro_datasets is where you can find all of Kedro’s data connectors.

Classes

kedro.datasets.api.APIDataSet(url[, method, …])

APIDataSet loads the data from HTTP(S) APIs.

kedro.datasets.biosequence.BioSequenceDataSet(…)

BioSequenceDataSet loads and saves data to a sequence file.

kedro.datasets.dask.ParquetDataSet(filepath)

ParquetDataSet loads and saves data to parquet file(s).

kedro.datasets.email.EmailMessageDataSet(…)

EmailMessageDataSet loads/saves an email message from/to a file using an underlying filesystem (e.g.: local, S3, GCS).

kedro.datasets.geopandas.GeoJSONDataSet(filepath)

GeoJSONDataSet loads/saves data to a GeoJSON file using an underlying filesystem (eg: local, S3, GCS).

kedro.datasets.holoviews.HoloviewsWriter(…)

HoloviewsWriter saves Holoviews objects to image file(s) in an underlying filesystem (e.g.

kedro.datasets.json.JSONDataSet(filepath[, …])

JSONDataSet loads/saves data from/to a JSON file using an underlying filesystem (e.g.: local, S3, GCS).

kedro.datasets.matplotlib.MatplotlibWriter(…)

MatplotlibWriter saves one or more Matplotlib objects as image files to an underlying filesystem (e.g.

kedro.datasets.networkx.GMLDataSet(filepath)

GMLDataSet loads and saves graphs to a GML file using an underlying filesystem (e.g.: local, S3, GCS).

kedro.datasets.networkx.GraphMLDataSet(filepath)

GraphMLDataSet loads and saves graphs to a GraphML file using an underlying filesystem (e.g.: local, S3, GCS).

kedro.datasets.networkx.JSONDataSet(filepath)

NetworkX JSONDataSet loads and saves graphs to a JSON file using an underlying filesystem (e.g.: local, S3, GCS).

kedro.datasets.pandas.CSVDataSet(filepath[, …])

CSVDataSet loads/saves data from/to a CSV file using an underlying filesystem (e.g.: local, S3, GCS).

kedro.datasets.pandas.ExcelDataSet(filepath)

ExcelDataSet loads/saves data from/to a Excel file using an underlying filesystem (e.g.: local, S3, GCS).

kedro.datasets.pandas.FeatherDataSet(filepath)

FeatherDataSet loads and saves data to a feather file using an underlying filesystem (e.g.: local, S3, GCS).

kedro.datasets.pandas.GBQQueryDataSet([sql, …])

GBQQueryDataSet loads data from a provided SQL query from Google BigQuery.

kedro.datasets.pandas.GBQTableDataSet(…[, …])

GBQTableDataSet loads and saves data from/to Google BigQuery.

kedro.datasets.pandas.GenericDataSet(…[, …])

pandas.GenericDataSet loads/saves data from/to a data file using an underlying filesystem (e.g.: local, S3, GCS).

kedro.datasets.pandas.HDFDataSet(filepath, key)

HDFDataSet loads/saves data from/to a hdf file using an underlying filesystem (e.g.

kedro.datasets.pandas.JSONDataSet(filepath)

JSONDataSet loads/saves data from/to a JSON file using an underlying filesystem (e.g.: local, S3, GCS).

kedro.datasets.pandas.ParquetDataSet(filepath)

ParquetDataSet loads/saves data from/to a Parquet file using an underlying filesystem (e.g.: local, S3, GCS).

kedro.datasets.pandas.SQLQueryDataSet([sql, …])

SQLQueryDataSet loads data from a provided SQL query.

kedro.datasets.pandas.SQLTableDataSet(…[, …])

SQLTableDataSet loads data from a SQL table and saves a pandas dataframe to a table.

kedro.datasets.pandas.XMLDataSet(filepath[, …])

XMLDataSet loads/saves data from/to a XML file using an underlying filesystem (e.g.: local, S3, GCS).

kedro.datasets.pickle.PickleDataSet(filepath)

PickleDataSet loads/saves data from/to a Pickle file using an underlying filesystem (e.g.: local, S3, GCS).

kedro.datasets.pillow.ImageDataSet(filepath)

ImageDataSet loads/saves image data as numpy from an underlying filesystem (e.g.: local, S3, GCS).

kedro.datasets.plotly.JSONDataSet(filepath)

JSONDataSet loads/saves a plotly figure from/to a JSON file using an underlying filesystem (e.g.: local, S3, GCS).

kedro.datasets.plotly.PlotlyDataSet(…[, …])

PlotlyDataSet generates a plot from a pandas DataFrame and saves it to a JSON file using an underlying filesystem (e.g.: local, S3, GCS).

kedro.datasets.redis.PickleDataSet(key[, …])

PickleDataSet loads/saves data from/to a Redis database.

kedro.datasets.spark.DeltaTableDataSet(filepath)

DeltaTableDataSet loads data into DeltaTable objects.

kedro.datasets.spark.SparkDataSet(filepath)

SparkDataSet loads and saves Spark dataframes.

kedro.datasets.spark.SparkHiveDataSet(…[, …])

SparkHiveDataSet loads and saves Spark dataframes stored on Hive.

kedro.datasets.spark.SparkJDBCDataSet(url, table)

SparkJDBCDataSet loads data from a database table accessible via JDBC URL url and connection properties and saves the content of a PySpark DataFrame to an external database table via JDBC.

kedro.datasets.svmlight.SVMLightDataSet(filepath)

SVMLightDataSet loads/saves data from/to a svmlight/libsvm file using an underlying filesystem (e.g.: local, S3, GCS).

kedro.datasets.tensorflow.TensorFlowModelDataset(…)

TensorflowModelDataset loads and saves TensorFlow models.

kedro.datasets.text.TextDataSet(filepath[, …])

TextDataSet loads/saves data from/to a text file using an underlying filesystem (e.g.: local, S3, GCS)

kedro.datasets.tracking.JSONDataSet(filepath)

JSONDataSet saves data to a JSON file using an underlying filesystem (e.g.: local, S3, GCS).

kedro.datasets.tracking.MetricsDataSet(filepath)

MetricsDataSet saves data to a JSON file using an underlying filesystem (e.g.: local, S3, GCS).

kedro.datasets.video.VideoDataSet(filepath)

VideoDataSet loads / save video data from a given filepath as sequence of PIL.Image.Image using OpenCV.

kedro.datasets.yaml.YAMLDataSet(filepath[, …])

YAMLDataSet loads/saves data from/to a YAML file using an underlying filesystem (e.g.: local, S3, GCS).