Model Export & Inference #502

karpathy · 2024-05-30T23:29:54Z

I'd be very interested in how we could take llm.c models and export them into universal formats, e.g. for very fast inference in llama.cpp, vllm, or etc. Or how they could be made HuggingFace compatible. This would also allow us to run more comprehensive evals on the models that we train in llm.c, because it would (hopefully) slot into other existing infrastructure in those projects.

YuchenJin · 2024-06-04T22:54:18Z

Most inference frameworks including vllm and llama.cpp support the safetensors format.

In theory, we can write a utility python script:

Load the binary file generated by llm.c;
Organize the weights into a dictionary and convert to PyTorch model state_dict;
Save the weights using safetensors:

from safetensors.torch import save_file
save_file(state_dict, 'model.safetensors')

This will also help us use libraries such as lighteval to perform broad evaluations across more benchmarks.

karpathy · 2024-06-04T22:57:38Z

@YuchenJin yep exactly what I had in mind! I put up the issue because I am sequencing other things before I get around to it, possibly someone can pick it up in parallel before.

YuchenJin · 2024-06-04T23:05:47Z

Cool, I will give it a shot if no one starts working on it by mid-next week. :)

karpathy added the feature-request label May 30, 2024

morphpiece mentioned this issue Jun 5, 2024

added reading checkpoint files #554

Open

rhys101 mentioned this issue Jun 7, 2024

Model Export to Hugging Face format and optionally upload #571

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model Export & Inference #502

Model Export & Inference #502

karpathy commented May 30, 2024

YuchenJin commented Jun 4, 2024

karpathy commented Jun 4, 2024

YuchenJin commented Jun 4, 2024

Model Export & Inference #502

Model Export & Inference #502

Comments

karpathy commented May 30, 2024

YuchenJin commented Jun 4, 2024

karpathy commented Jun 4, 2024

YuchenJin commented Jun 4, 2024