Welcome to Kedro’s documentation!¶

Introduction

What is Kedro?
- Learn how to use Kedro
- Assumptions

Tutorial

Kedro spaceflights tutorial
- Kedro project development workflow
- Optional: Git workflow
Set up the spaceflights project
Set up the data
Create a pipeline
Visualise pipelines
Namespace pipelines
Set up experiment tracking
Package a project
- Add documentation to your project
- Package your project

Kedro project setup

Dependencies
Configuration
Lifecycle management with KedroSession
- Overview
- Create a session
The mini-kedro Kedro starter

Data Catalog

The Data Catalog
Kedro IO

Nodes and pipelines

Nodes
Pipelines
- How to build a pipeline
- Bad pipelines
Modular pipelines
Micro-packaging
Run a pipeline
Slice a pipeline

Extend Kedro

Common use cases
Hooks
Custom datasets
Kedro plugins
Create a Kedro starter
- How to create a Kedro starter
- Configuration variables
Dataset transformers (deprecated)
- Develop your own dataset transformer
Decorators (deprecated)

Logging

Logging
Experiment tracking
- Enable experiment tracking
- Community solutions

Deployment

Deployment guide
- Deployment choices
Single-machine deployment
Distributed deployment
Deployment with Argo Workflows
Deployment with Prefect
- Prerequisites
- How to run your Kedro pipeline using Prefect
Deployment with Kubeflow Pipelines
Deployment with AWS Batch
Deployment to a Databricks cluster
How to integrate Amazon SageMaker into your Kedro pipeline
How to deploy your Kedro pipeline with AWS Step Functions
How to deploy your Kedro pipeline on Apache Airflow with Astronomer

Tools integration

Build a Kedro pipeline with PySpark
Use Kedro with IPython and Jupyter Notebooks/Lab

Contribute to Kedro

Introduction
Guidelines for contributing developers
Backwards compatibility & breaking changes
- When should I make a breaking change?
- The Kedro release model
Contribute to the Kedro documentation
Join the Technical Steering Committee

API documentation¶

kedro

Kedro is a framework that makes it easy to build robust and scalable data pipelines by providing uniform project templates, data abstraction, configuration and pipeline assembly.

Indices and tables¶

Index
Module Index