This repo provides a customizable stack for starting new ML projects on Databricks that follow production best-practices out of the box.
This module sets up multi-workspace model registry between an Azure Databricks development (dev) workspace, staging workspace, and production (prod) workspace, allowing READ access from dev/staging workspaces to staging & prod model registries. It also links pre-existing Azure Active Directory (AAD) applications to the service principals.
Golang database/sql driver for Databricks SQL.
This module sets up multi-workspace model registry between a Databricks AWS development (dev) workspace, staging workspace, and production (prod) workspace, allowing READ access from dev/staging workspaces to staging & prod model registries.
Spark In MapReduce (SIMR) - launching Spark applications on existing Hadoop MapReduce infrastructure
最近更新: 6天前This repository is intended to bootstrap a fileloader to CDC processing pipeline for new s3 data detected within a given bucket and prefix. It's li...
最近更新: 6天前A library of data loaders for LLMs made by the community -- to be used with GPT Index and/or LangChain
最近更新: 6天前A Vale-compatible implementation of the Microsoft Writing Style Guide extended to Tabular.
最近更新: 6天前