Kedro – An open-source framework for data science code
Kedro is an open-source Python framework that enhances data science code development, offering features like pipeline visualization, a Data Catalog, and flexible deployment, successfully used by companies like Telkomsel and Beamery.
Read original articleKedro is an open-source Python framework designed to enhance the development of production-ready data science code by integrating software engineering best practices. It provides a structured approach to managing complex data and machine-learning pipelines, allowing data scientists to focus on problem-solving rather than the intricacies of code management. Key features include pipeline visualization through Kedro-Viz, a versatile Data Catalog for managing various data formats and sources, and support for numerous integrations with tools like Amazon SageMaker and Apache Spark. Kedro also offers a project template for standardizing project organization, dedicated IDE support for Visual Studio Code, and a dataset-driven workflow that simplifies task management within pipelines. Additionally, it emphasizes coding standards through test-driven development and provides flexible deployment options across various platforms. Kedro has been successfully implemented in production environments, such as at Telkomsel and Beamery, where it has significantly improved workflow efficiency and data handling capabilities. The framework is maintained by the Linux Foundation and has a supportive community for users.
- Kedro is an open-source framework for building production-ready data science code.
- It standardizes project organization and simplifies complex data pipeline management.
- Key features include pipeline visualization, a versatile Data Catalog, and extensive integration options.
- Kedro supports coding standards and offers flexible deployment strategies.
- It has been successfully used in production by companies like Telkomsel and Beamery.
Related
Show HN: Open-source CLI coding framework using Claude
The GitHub repository for "Dravid (DRD) - AI-Powered CLI Coding Framework" streamlines coding with AI. It aids in project setup, code generation, and file management. The README covers features, installation, usage, and support.
Show HN: Create how-to videos and guides fast
Kroto is a versatile platform for creating and sharing guides and tutorials. It features AI-enhanced video tutorials, customizable branding, multilingual support, and 24/7 customer service. Security measures are robust, and a mobile app is in progress.
Show HN: Why your link management tools fail, even with Notion, Pocket, etc.
Cokeep is a collaborative bookmark manager that uses AI for organization and sharing, allowing users to create visual boards, comment on bookmarks, and manage various media types securely.
Show HN: COBOL-REKT, a toolkit for analysing and reverse-engineering COBOL
Cobol REKT is a toolkit for reverse engineering legacy Cobol code, offering flowchart generation, Neo4J integration, execution tracing, and static analysis, with planned features for code detection and knowledge integration.
Show HN: Denormalized – Embeddable Stream Processing in Rust and DataFusion
Denormalized is a developing stream processing engine based on Apache DataFusion, supporting Kafka. Users can start with Docker and Rust/Cargo, with future features planned for enhanced functionality.
Related
Show HN: Open-source CLI coding framework using Claude
The GitHub repository for "Dravid (DRD) - AI-Powered CLI Coding Framework" streamlines coding with AI. It aids in project setup, code generation, and file management. The README covers features, installation, usage, and support.
Show HN: Create how-to videos and guides fast
Kroto is a versatile platform for creating and sharing guides and tutorials. It features AI-enhanced video tutorials, customizable branding, multilingual support, and 24/7 customer service. Security measures are robust, and a mobile app is in progress.
Show HN: Why your link management tools fail, even with Notion, Pocket, etc.
Cokeep is a collaborative bookmark manager that uses AI for organization and sharing, allowing users to create visual boards, comment on bookmarks, and manage various media types securely.
Show HN: COBOL-REKT, a toolkit for analysing and reverse-engineering COBOL
Cobol REKT is a toolkit for reverse engineering legacy Cobol code, offering flowchart generation, Neo4J integration, execution tracing, and static analysis, with planned features for code detection and knowledge integration.
Show HN: Denormalized – Embeddable Stream Processing in Rust and DataFusion
Denormalized is a developing stream processing engine based on Apache DataFusion, supporting Kafka. Users can start with Docker and Rust/Cargo, with future features planned for enhanced functionality.