Unlock Real-Time Analytics with Project Antalya: ClickHouse and Data Lakes
Are you ready to supercharge your analytics and seamlessly integrate data lakes with real-time queries? Project Antalya, a powerful branch of ClickHouse code, makes it possible. Discover how this open-source project can revolutionize your data workflows, offering a unified SQL endpoint for both ClickHouse and data lake data.
What is Project Antalya and Why Should You Care?
Project Antalya extends ClickHouse to work efficiently with data lakes, offering real-time analytics on shared data used by AI and data science applications. No need to duplicate data or use separate systems!
- Unified Access: Access data seamlessly across native ClickHouse and data lakes.
- Open Formats: Leverage open table formats for easy access from various applications.
- Scalable Compute: Scale compute resources independently for ingestion, transformation, and querying.
Key Features and Benefits of Project Antalya
Project Antalya focuses on optimizing ClickHouse performance with Parquet files in S3-compatible storage and adapting ClickHouse to use Iceberg tables as shared storage.
Here's how Antalya achieves its goals:
- Optimized Query Performance: Experience lightning-fast queries on Parquet files stored in object storage.
- Swarm Clusters: Scale query and insert operations linearly with stateless server pools on shared object storage.
- Iceberg Table Integration: Seamlessly integrate Iceberg tables for shared storage.
- Unlimited Storage Extension: Extend existing ClickHouse tables onto unlimited Iceberg storage with transparent querying.
- Simplified Backup & DR: Use Iceberg snapshots for easy backup and disaster recovery.
- Full Compatibility: Maintain compatibility with upstream ClickHouse features and bug fixes.
Quick Start: Get Up and Running with Project Antalya
Ready to dive in and experience the power of Project Antalya firsthand? The Docker Quick Start will guide you through setting up Antalya in minutes using Docker Compose on your laptop.
Easily Scalable Swarm Cluster Example:
For a fully functional swarm cluster implementation, explore the Kubernetes example. It demonstrates how to deploy swarm clusters on a large blockchain dataset stored in Parquet format.
Access Project Antalya Binaries and Containers
Get started with Project Antalya using pre-built packages and containers.
Packages
Find ClickHouse server and keeper packages on the builds.altinity.cloud page.
Containers
Project Antalya ClickHouse server and ClickHouse keeper containers are available on Docker Hub. Use these Docker commands to get started:
docker run altinity/clickhouse-server:latest
docker run altinity/clickhouse-keeper:latest
Always check Docker Hub for the latest build.
Explore Project Antalya Documentation & Code that powers real-time data lake analytics
Dive deeper with comprehensive documentation and readily available code.
Documentation
Find detailed documentation in the docs
directory, including a project concepts guide, feature status updates, and command/configuration references.
Code
Access the Project Antalya code with these commands:
git clone [email protected]:Altinity/ClickHouse.git Altinity-ClickHouse
cd Altinity-ClickHouse
git branch
You’ll be in the antalya
branch by default.
Build and Contribute to Project Antalya and shape the future of ClickHouse data lake integration
Contribute to Project Antalya! Contact us in Slack to join the project and help shape the future of real-time analytics on data lakes.
Need Community or Commercial Support?
- Community Support: Need help or want to share ideas? Join the AltinityDB Slack Workspace or log an issue.
- Commercial Support: Altinity offers enterprise-class support for ClickHouse, including data lake integrations. Contact Altinity for questions, issues, or a free consultation with a ClickHouse expert.
Jumpstart your real-time data lake analytics journey today!