remove llama 70b #21396

frank-dong-ms · 2024-07-17T23:13:21Z

Remove llama 70b model due to security reason.

We need add shard code in HF to enable model shardding for llama-70b, these codes are not merged into main branch as HF forks want a more general solution instead of doing shard for specify model. shared code is kept here: https://github.com/frank-dong-ms/transformers/tree/frdong/shard_llama

we kept llama-70b related code here for internal use: https://github.com/frank-dong-ms/onnxruntime/tree/frdong/llama_70b

frank-dong-ms added 4 commits July 17, 2024 16:05

Delete benchmark_70b_model.sh

77fa51d

Delete convert_70b_model.sh

9d3340f

Delete requirements-70b-model.txt

ac65ccb

Update README.md

ef021ca

frank-dong-ms requested a review from kunal-vaishnavi July 17, 2024 23:13

kunal-vaishnavi approved these changes Jul 18, 2024

View reviewed changes

frank-dong-ms merged commit 92f66de into microsoft:main Jul 18, 2024
96 of 98 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

remove llama 70b #21396

remove llama 70b #21396

frank-dong-ms commented Jul 17, 2024

remove llama 70b #21396

remove llama 70b #21396

Conversation

frank-dong-ms commented Jul 17, 2024