8000 mengxr (Xiangrui Meng) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View mengxr's full-sized avatar

Block or report mengxr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A project to map out the relations between different equational theories of Magmas.

Lean 348 78 Updated May 21, 2025

A programming framework for agentic AI. Discord: https://discord.gg/pAbnFJrkgZ

Jupyter Notebook 130 23 Updated Feb 5, 2025

English SDK for Apache Spark

Python 862 131 Updated Jun 12, 2024

Numbers every LLM developer should know

4,228 139 Updated Jan 16, 2024

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

9,894 764 Updated May 31, 2024
TypeScript 52 21 Updated Apr 15, 2024

Spark DL Inferencing using external frameworks

Shell 6 Updated May 9, 2023

Databricks Terraform Provider

Go 514 430 Updated May 21, 2025

Reference code base for ML Engineering, Manning Publications

Jupyter Notebook 128 40 Updated Jul 16, 2021
Python 27 17 Updated May 1, 2025

A high performance and generic framework for distributed DNN training

Python 3,682 491 Updated Oct 3, 2023

Joblib Apache Spark Backend

Python 246 26 Updated Apr 7, 2025

Koalas: pandas API on Apache Spark

Python 3,357 366 Updated Mar 20, 2024

Julia package to computes statistics on streams of data

Julia 3 1 Updated Oct 24, 2017

Spark Exercise

Scala 6 1 Updated May 26, 2014

Intellij Jsonnet Plugin

Java 89 18 Updated Mar 9, 2024

Spark data source for Salesforce

Scala 80 68 Updated May 23, 2024

Rules engine for cloud security, cost optimization, and governance, DSL in yaml for policies to query, filter, and take actions on resources

Python 5,665 1,542 Updated May 20, 2025

(Legacy) Command Line Interface for Databricks

Python 392 234 Updated Oct 5, 2023

Spark package for checking data quality

Scala 221 68 Updated Feb 28, 2020

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 40,152 15,048 Updated May 21, 2025

Fast, flexible and powerful server providing access to R from many languages and systems

C 287 65 Updated Dec 18, 2024

Code for Quartz Scheduler

Java 6,505 1,963 Updated Apr 24, 2025

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…

Python 7,901 1,351 Updated May 21, 2025

Generic Implementation of Consensus ADMM over Spark

Python 84 21 Updated Jul 8, 2016

R interface for Apache Spark

R 963 308 Updated Mar 18, 2025

A scalable machine learning library on Apache Spark

Terra 794 177 Updated Aug 30, 2021

Apache Superset is a Data Visualization and Data Exploration Platform

Jupyter Notebook 66,323 15,033 Updated May 22, 2025

[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark

Scala 748 161 Updated Jul 30, 2024
Next
0