Pathway is a high-performance Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG. The Rust-powered engine treats data as a continuous stream of changes rather than static snapshots — making it a natural fit for AI applications that need to stay in sync with live data sources.
Pathway connects to PostgreSQL, Kafka, S3, and live APIs, monitoring them for changes and automatically processing updates while incrementally maintaining vector databases. A unique capability: mixing batch and streaming logic in the same workflow, so systems can be continuously trained with new streaming data and revised without requiring full batch reuploads. The framework supports stateless and stateful transformations (joins, windowing, sorting), with many transformations implemented in Rust.
Pathway provides dedicated LLM tooling for live LLM/RAG pipelines, with wrappers for common LLM services. Used in production at NATO and Intel for real-time streaming AI workloads. Recently crossed 50K GitHub stars on the strength of its 'fresh data for AI' positioning — a deployment-first architecture that solves the real-time data challenge other RAG frameworks struggle with.
Free trial available
Data engineering teams building real-time AI/RAG pipelines that need to stay in sync with live data sources
Top companies in RAG Frameworks you can use instead of Pathway.
Companies from adjacent layers in the AI stack that work well with Pathway.
Last verified: April 29, 2026