# Spring-Cloud-Data-Flow **Repository Path**: noodle-framework/Spring-Cloud-Data-Flow ## Basic Information - **Project Name**: Spring-Cloud-Data-Flow - **Description**: Spring Cloud Data Flow 是基于原生云对 Spring XD 的重新设计,该项目目标是简化大数据应用的开发 - **Primary Language**: Unknown - **License**: Apache-2.0 - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 7 - **Created**: 2023-08-30 - **Last Updated**: 2023-08-30 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README
*Spring Cloud Data Flow* is a microservices-based toolkit for building streaming and batch data processing pipelines in Cloud Foundry and Kubernetes. Data processing pipelines consist of Spring Boot apps, built using the [Spring Cloud Stream](https://github.com/spring-cloud/spring-cloud-stream) or [Spring Cloud Task](https://github.com/spring-cloud/spring-cloud-task) microservice frameworks. This makes Spring Cloud Data Flow ideal for a range of data processing use cases, from import/export to event streaming and predictive analytics. ---- ## Components **Architecture**: The Spring Cloud Data Flow Server is a Spring Boot application that provides RESTful API and REST clients (Shell, Dashboard, Java DSL). A single Spring Cloud Data Flow installation can support orchestrating the deployment of streams and tasks to Local, Cloud Foundry, and Kubernetes. Familiarize yourself with the Spring Cloud Data Flow [architecture](https://dataflow.spring.io/docs/concepts/architecture/) and [feature capabilities](https://dataflow.spring.io/features/). **Deployer SPI**: A Service Provider Interface (SPI) is defined in the [Spring Cloud Deployer](https://github.com/spring-cloud/spring-cloud-deployer) project. The Deployer SPI provides an abstraction layer for deploying the apps for a given streaming or batch data pipeline and managing the application lifecycle. Spring Cloud Deployer Implementations: * [Local](https://github.com/spring-cloud/spring-cloud-deployer-local) * [Cloud Foundry](https://github.com/spring-cloud/spring-cloud-deployer-cloudfoundry) * [Kubernetes](https://github.com/spring-cloud/spring-cloud-deployer-kubernetes) **Domain Model**: The Spring Cloud Data Flow [domain module](https://github.com/spring-cloud/spring-cloud-dataflow/tree/master/spring-cloud-dataflow-core) includes the concept of a *stream* that is a composition of Spring Cloud Stream applications in a linear data pipeline from a *source* to a *sink*, optionally including *processor* application(s) in between. The domain also includes the concept of a *task*, which may be any process that does not run indefinitely, including [Spring Batch](https://github.com/spring-projects/spring-batch) jobs. **Application Registry**: The [App Registry](https://github.com/spring-cloud/spring-cloud-dataflow/tree/master/spring-cloud-dataflow-registry) maintains the metadata of the catalog of reusable applications. For example, if relying on Maven coordinates, an application URI would be of the format: `maven://