Pyserini: Port Torch Models to MLX and enable Indexing and Searching on Mac M series machines #39

ToluClassics · 2024-04-15T21:14:20Z

Summary: The goal is to extend the encode module (https://github.com/castorini/pyserini/tree/master/pyserini/encode ) of Pyserini to allow loading and inferencing retriever models in MLX similar to the way it is currently done in PyTorch. This enables us to index and search effectively on Mac M series laptops with near Cuda speed. So similar to DprDocumentEncoder we will have something like MlxDprDocumentEncoder .

What this entails is translating model architecture code from PyTorch to MLX. The interface of both frameworks are very similar except that MLX is more numpy like

The text was updated successfully, but these errors were encountered:

AndreSlavescu mentioned this issue Jun 6, 2024

Encoder model implementations in MLX castorini/pyserini#1914

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pyserini: Port Torch Models to MLX and enable Indexing and Searching on Mac M series machines #39

Pyserini: Port Torch Models to MLX and enable Indexing and Searching on Mac M series machines #39

ToluClassics commented Apr 15, 2024

Pyserini: Port Torch Models to MLX and enable Indexing and Searching on Mac M series machines #39

Pyserini: Port Torch Models to MLX and enable Indexing and Searching on Mac M series machines #39

Comments

ToluClassics commented Apr 15, 2024