genomics-pipelines

secondary analysis pipelines parallelized with apache spark

最近更新: 6天前

jarjar-abrams

an experimental Scala extension of Jar Jar Links

最近更新: 6天前

build-tooling

Databricks Education department's curriculum build tool chain

最近更新: 6天前

tableau-connector

最近更新: 6天前

sjsonnet

最近更新: 6天前

drunken-data-quality-1

Spark package for checking data quality

最近更新: 6天前

benchmarks

A place in which we publish scripts for reproducible benchmarks.

最近更新: 6天前

spark-csv

CSV Data Source for Apache Spark 1.x

最近更新: 6天前

spark-tfocs

A Spark port of TFOCS: Templates for First-Order Conic Solvers (cvxr.com/tfocs)

最近更新: 6天前

als-benchmark-scripts

Scripts to benchmark distributed Alternative Least Squares (ALS)

最近更新: 6天前

simr

Spark In MapReduce (SIMR) - launching Spark applications on existing Hadoop MapReduce infrastructure

最近更新: 6天前

dagster

A data orchestrator for machine learning, analytics, and ETL.

最近更新: 6天前

upickle

uPickle: a simple, fast, dependency-free JSON & Binary (MessagePack) serialization library for Scala

最近更新: 6天前

unity-catalog-setup

Notebooks, terraform, tools to enable setting up Unity Catalog

最近更新: 6天前

LearningSparkV2

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

最近更新: 6天前

xgboost-linux64

Databricks Private xgboost Linux64 fork

最近更新: 6天前

spark-salesforce

Spark data source for Salesforce

最近更新: 6天前

spark-package-cmd-tool

A command line tool for Spark packages

最近更新: 6天前

spark-xml

XML data source for Spark SQL and DataFrames

最近更新: 6天前

spark-sql-perf

最近更新: 6天前
成就
1
Star
4
Fork
成员(1)
镜像

搜索帮助