what-is-curvine
What is Curvine​
 Curvine is a distributed caching system implemented in Rust, featuring high concurrency, high throughput, low latency, and low resource consumption. Unlike KV caches like Redis or TiKV, Curvine exclusively provides file caching capabilities. It is not a storage system but rather a caching layer - data persistence still relies on underlying file systems or object storage systems for support.
What problem does it solve​
- Large-scale Data I/O Performance Bottlenecks;
- Single-Machine Cache Capacity Limitations.
 In practical applications, what scenarios are suitable for Curvine acceleration?

Fig. 1:Curvine Application Scenarios.
 As shown in the figure above, Curvine is designed for the following five core scenarios:
- Accelerating intermediate data processing in big data shuffle operations
- Caching hot table data for faster big data analytics
- Boosting AI training efficiency through dataset caching
- Accelerating model file distribution via caching layer
- Cross-cloud data caching to mitigate performance bottlenecks of dedicated cloud connections
 These use cases are just the beginning. In simple terms, Curvine fundamentally addresses: The growing conflict between escalating computational demands and the I/O bottlenecks of distributed cache systems.
Architecture Overview​
  Curvine's architectural design philosophy: Simplicity, Excellence, and Universality.

Fig. 2:Curvine Application Scenarios.
 Simplicity: Lightweight design with only two roles in the caching service: master and worker. For non-performance-critical modules, maximize reuse of open-source or existing technologies, ensuring minimal code complexity.
 Excellence: Key performance-impacting components (e.g., underlying RPC communication framework, Fuse implementation) are independently designed and optimized with a performance-first mindset.
 Generality: Compatible with multiple existing access modes. The underlying storage supports mainstream distributed file and object storage systems, ensuring versatility and ease of use.
On Open-Source​
 We have achieved significant performance gains by deploying Curvine in high-concurrency, high-throughput big data scenarios internally. Now, we aim to collaborate with external partners to co-build this solution and collectively accelerate the infrastructure transition to Rust.
 https://github.com/curvineio/curvine
 Powered by OPPO Bigdata.