mirrors_databricks

轻量养虾，开箱即用！低 Token + 稳定算力，Gitee & 模力方舟联合出品的 PocketClaw 正式开售！点击了解详情~

mirrors_databricks

欢迎加入我们～

仓库 219 Issues Pull Requests 动态成员 1

热门

This module sets up multi-workspace model registry between an Azure Databricks development (dev) workspace, staging workspace, and production (prod) workspace, allowing READ access from dev/staging workspaces to staging & prod model registries. It also links pre-existing Azure Active Directory (AAD) applications to the service principals.

3
0
0

This repo provides a customizable stack for starting new ML projects on Databricks that follow production best-practices out of the box.

3
0
0

BTrace - a safe, dynamic tracing tool for the Java platform

3
0
0

Databricks SQL Connector for Node.js

3
0
0

This module sets up multi-workspace model registry between a Databricks AWS development (dev) workspace, staging workspace, and production (prod) workspace, allowing READ access from dev/staging workspaces to staging & prod model registries.

3
0
0

Bazel rules for building Protobuf and gRPC code and libraries from proto_library targets

3
0
0

3

0

0

genomics-pipelines

secondary analysis pipelines parallelized with apache spark

最近更新: 8天前

3

0

0

jarjar-abrams

an experimental Scala extension of Jar Jar Links

最近更新: 8天前

2

0

0

tableau-connector

最近更新: 8天前

2

0

0

build-tooling

Databricks Education department's curriculum build tool chain

最近更新: 8天前

2

0

0

sjsonnet

最近更新: 8天前

2

0

1

drunken-data-quality-1

Spark package for checking data quality

最近更新: 8天前

2

0

0

spark-tfocs

A Spark port of TFOCS: Templates for First-Order Conic Solvers (cvxr.com/tfocs)

最近更新: 8天前

2

0

0

benchmarks

A place in which we publish scripts for reproducible benchmarks.

最近更新: 8天前

2

0

0

spark-csv

CSV Data Source for Apache Spark 1.x

最近更新: 8天前

2

0

0

simr

Spark In MapReduce (SIMR) - launching Spark applications on existing Hadoop MapReduce infrastructure

最近更新: 8天前

2

0

0

als-benchmark-scripts

Scripts to benchmark distributed Alternative Least Squares (ALS)

最近更新: 8天前

2

0

0

dagster

A data orchestrator for machine learning, analytics, and ETL.

最近更新: 8天前

3

0

0

upickle

uPickle: a simple, fast, dependency-free JSON & Binary (MessagePack) serialization library for Scala

最近更新: 8天前

3

0

0

terraform-databricks-lakehouse-blueprints

Set of Terraform automation templates and quickstart demos to jumpstart the design of a Lakehouse on Databricks. This project has incorporated best...

最近更新: 8天前

3

0

0

terraform-lakehouse-blueprints

Set of Terraform automation templates and quickstart demos to jumpstart the design of a Lakehouse on Databricks. This project has incorporated best...

最近更新: 8天前

3

0

0

fs-lakehouse

Set of Terraform automation templates and quickstart demos to jumpstart the design of a Lakehouse on Databricks. This project has incorporated best...

最近更新: 8天前

2

0

1

LearningSparkV2

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

最近更新: 8天前

2

0

0

spark-salesforce

Spark data source for Salesforce

最近更新: 8天前

2

0

0

xgboost-linux64

Databricks Private xgboost Linux64 fork

最近更新: 8天前

2

0

0

spark-package-cmd-tool

A command line tool for Spark packages

最近更新: 8天前