MLX Community Projects #654

awni · 2024-02-08T16:11:11Z

awni
Feb 8, 2024
Maintainer

Let's collect some cool MLX integrations and community lead projects here for visibility!

If you have a project you would like to feature, leave a comment, and we will add it.

Text Generation

mlx-ui: A simple UI / Web / Frontend for MLX mlx-lm using Streamlit.
mlx-moe: Scripts to create your own moe models using mlx
mlx-rag: Explore a simple example of utilizing MLX for RAG application running locally on your Apple Silicon device.
mlx-rag-gguf Minimal, clean code implementation of RAG with mlx using gguf model weights
mlx-llm: LLM applications running on Apple Silicon thanks to mlx from Apple
outlines-mlx: A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX
mlx-whatsapp: An mlx project to train a base model on your whatsapp chats using (Q)Lora finetuning
iClone: Clone your friends with iMessage and MLX
autogram: Grammar checker with a keyboard shortcut for Ollama and Apple MLX with Automator on macOS.
mamba.py: A simple Mamba implementation in PyTorch and MLX.
nanoGPT_mlx: Port of Andrej Karpathy's nanoGPT to Apple MLX framework.
mlx-tuning-fork: Very basic framework for parameterized large language model (Q)LoRa fine-tuning using mlx, mlx_lm, and OgbujiPT. Architecture for systematic running of easily parameterized fine-tunes
mlx-moe-models: A lightweight package for extending the mlx-lm to support custom moe models
chat-with-mlx: Chat with your data natively on Apple Silicon using MLX Framework.
transformerlab-app: A research platform to run, train, RAG, and evaluate LLMs through a GUI.
mlx-transformers: Model implementations in MLX with a similar interface as Hugging Face Transformers.

Vision

ml-aim: AIM: Autoregressive Image Models
mimm: MLX Image Models
mlx-image: MLX image models for Apple Silicon machines

Speech and Audio

mlx_bark: Port of Suno's Bark TTS transformer in Apple's MLX Framework

Multi-modal

voice-assistant: A simple toy demo of a local voice assistant with whisper and large language model.
Video_summarization_mlx: Transcribe and summarize youtube video using mlx
MLX-VLM: Run Vision LLMs locally on your Mac using MLX.

Misc

mlx-graphs: Graph Neural Network library made for Apple Silicon
flower: Flower: A Friendly Federated Learning Framework
samplex: Package of useful sampling algorithms written in MLX.
rlx: A reinforcement learning framework based on MLX.
mlx3D: A library for deep learning with 3D data using MLX.
mlx-ctrc: CTC loss in MLX on the CPU and GPU.

Educational

Deep-Dive-Into-AI-With-MLX-PyTorch: "Deep Dive into AI with MLX and PyTorch" is an educational initiative designed to help anyone interested in AI, specifically in machine learning and deep learning, using Apple's MLX and Meta's PyTorch frameworks.
cv-ml-lecture-notebooks: Computer Vision and Machine Learning Jupyter Notebooks for Educational Purposes
mlx-micrograd: MLX port of micrograd - a tiny scalar-valued autograd engine with a small PyTorch-like neural network library on top.

chimezie · 2024-02-08T16:52:47Z

chimezie
Feb 8, 2024

Text generation: mlx-tuning-fork

0 replies

mzbac · 2024-02-08T17:21:20Z

mzbac
Feb 8, 2024

text generation: https://github.com/mzbac/mlx-moe-models
A lightweight package for extending the mlx-lm to support custom moe models

0 replies

noahfarr · 2024-02-09T14:44:13Z

noahfarr
Feb 9, 2024

An implementation of Reinforcement Learning algorithms in MLX based in the Implementations from CleanRL. Still WIP because it’s missing a benchmark and some other minor things, but the implementations work correctly.
https://github.com/noahfarr/rlx

0 replies

RahulBhalley · 2024-02-10T08:12:10Z

RahulBhalley
Feb 10, 2024

mlx-models. Currently supporting vision models by loading/converting from PyTorch checkpoints. Will later add support for text and audio models as well.

1 reply

awni Mar 1, 2024
Maintainer Author

Could you share a bit more about what it does?

qnguyen3 · 2024-03-01T08:41:43Z

qnguyen3
Mar 1, 2024

Hi I would love to add chat-with-mlx. It is a Chat UI + RAG Implementation on MLX. I wIll add more features later on (more advanced RAG pipeline + multimodal)

0 replies

adhulipa · 2024-03-11T08:40:15Z

adhulipa
Mar 11, 2024

I have an example of training a simple language model using BitLinear instead of nn.Linear. It's a port of Karpathy's minGPT to MLX along with a custom implementation of a BitLinear module. https://github.com/adhulipa/mlx-mingpt

I noticed this collection already has the far more meatier nanoGPT version ported to mlx, which is awesome! MinGPT, OTOH, is super simple, easy to follow and can serve as a reference example for folks looking to compare mlx equivalent operations from the original PyTorch implementation.

0 replies

aliasaria · 2024-03-27T06:01:07Z

aliasaria
Mar 27, 2024

Transformer Lab https://github.com/transformerlab/transformerlab-app is an LLM research platform that allows you to run, train, perform RAG, and evaluate LLMs through a GUI.

0 replies

Jaykef · 2024-03-29T13:41:18Z

Jaykef
Mar 29, 2024

MLX RAG with GGUF Models: https://github.com/Jaykef/mlx-rag-gguf

The code here builds on https://github.com/vegaluisjose/mlx-rag, it has been optimized to support RAG-based inferencing for .gguf models. I am using BAAI/bge-small-en for the embedding model, TinyLlama-1.1B-Chat-v1.0-GGUF as base model and the custom vector database script for indexing texts in a pdf file. Inference speeds can go up to ~413 tokens/sec for prompts and ~36 tokens/sec for generation on my 8G M2 Air.

0 replies

lin72h · 2024-03-30T07:13:52Z

lin72h
Mar 30, 2024

@Jaykef Very cool, thanks for sharing

0 replies

amirhossein-razlighi · 2024-03-31T10:47:04Z

amirhossein-razlighi
Mar 31, 2024

Vision: MLX3D A library for deep learning with 3D data using mlx.

3 replies

amirhossein-razlighi Apr 1, 2024

@awni
Can you please add this to the list? 🙏🏻

lin72h Apr 1, 2024

very cool! thanks for working on the 3D support

awni Apr 2, 2024
Maintainer Author

Done! Cool library btw!

dc-dc-dc · 2024-04-08T18:25:36Z

dc-dc-dc
Apr 8, 2024

mlx-lite & mlx-onnx

0 replies

otriscon · 2024-04-08T22:46:14Z

otriscon
Apr 8, 2024

JSON schema decoding (allowing function calling, including an OpenAI-compatible server with tools) using MLX: https://github.com/otriscon/llm-structured-output

0 replies

Extremys · 2024-05-01T08:16:22Z

Extremys
May 1, 2024

Hello for text generation part, I'm happy to share with you that I've proposed and contributed to the integration of MLX with LibreChat.ai. So now you can use your local LLM powered by MLX through a fancy interface privately, enjoy! :D

See danny-avila/LibreChat#2580

If in the future the community proposes an API servers supporting also multimodality, transcription, image generation for example, I will add them into LibreChat ;) It could be great also to have and LLM API supporting /models endpoint and multiple models simultaneously :D

1 reply

awni May 3, 2024
Maintainer Author

Awesome!!

NicoNico6 · 2024-05-02T13:29:49Z

NicoNico6
May 2, 2024

Hello, mlx community, we are happy to share with you that we have contributed the first strong sub-4 bit LLM model zoo for MLX community.

HF collections: https://huggingface.co/collections/GreenBitAI/greenbitai-mlx-llm-6614eb6ceb8da657c2b4ed58

The modern LLM families include Llama3/2, Phi-3, Mistral, 01-Yi, and Qwen. A mlx-style inference toolkit is also shared for the local web chatting.

gbx-lm here: https://github.com/GreenBitAI/gbx-lm.

We are an active team here, supporting the better low-bit community on the local platform. Enjoy!

2 replies

awni May 3, 2024
Maintainer Author

This looks really interesting! I'm curious if there is somewhere one can learn more about how you reduce the precision of the models?

NicoNico6 May 6, 2024

Thanks for your interest!

The MLX models are a subset of our main work green-bit-llm.

We construct the lower-bit models by combining neural architecture search and post-training quantization technique. All models are built from a mix precision of 4/2 bit group-wise min-max quantization only (for better large-scale deployment, e.g., MLX only supports 4/2 bit min-max quantization currently). The MLX models in HF collection is the layer-mix version in the model zoo.

Except better but not harder inference, we are also interested in the low-cost fine-tuning. We released Bitorch Engine for low-bit quantized neural network operations. Our release supports full parameter fine-tuning directly in quantized space, even under extremely constrained GPU resource conditions.

We are preparing a blog in huggingface for a better understanding of everything. Please stay tuned and star us in git if you like our project QvQ.

Jaykef · 2024-05-05T15:32:55Z

Jaykef
May 5, 2024

mlx_micrograd - mlx port of Karpathy's micrograd - a tiny scalar-valued autograd engine with a small PyTorch-like neural network library on top.

Installation

pip install mlx_micrograd

Example usage

Example showing a number of possible supported operations:

from mlx_micrograd.engine import Value

a = Value(-4.0)
b = Value(2.0)
c = a + b
d = a * b + b**3
c += c + 1
c += 1 + c + (-a)
d += d * 2 + (b + a).relu()
d += 3 * d + (b - a).relu()
e = c - d
f = e**2
g = f / 2.0
g += 10.0 / f
print(f'{g.data}') # prints array(24.7041, dtype=float32), the outcome of this forward pass
g.backward()
print(f'{a.grad}') # prints array(138.834, dtype=float32), i.e. the numerical value of dg/da
print(f'{b.grad}') # prints array(645.577, dtype=float32), i.e. the numerical value of dg/db

1 reply

Jaykef May 11, 2024

@awni this should be under Educational.

nkasmanoff · 2024-05-11T18:14:57Z

nkasmanoff
May 11, 2024

This one is a little stale, but I've taken the approach used for adding LoRA to LLMs and applied it to LlaVA in mlx-examples
https://github.com/nkasmanoff/mlx-llava-finetuning

Can use this as a starting point for fine tuning VLMs as datasets get more popular, like https://huggingface.co/datasets/HuggingFaceM4/the_cauldron

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MLX Community Projects #654

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 16 comments 8 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

Select a reply

MLX Community Projects #654

awni Feb 8, 2024 Maintainer

Text Generation

Vision

Speech and Audio

Multi-modal

Misc

Educational

Replies: 16 comments · 8 replies

awni Mar 1, 2024 Maintainer Author

awni Apr 2, 2024 Maintainer Author

awni May 3, 2024 Maintainer Author

awni May 3, 2024 Maintainer Author

Installation

Example usage

awni
Feb 8, 2024
Maintainer

Replies: 16 comments 8 replies

awni Mar 1, 2024
Maintainer Author

awni Apr 2, 2024
Maintainer Author

awni May 3, 2024
Maintainer Author

awni May 3, 2024
Maintainer Author