Contrastive Search #2547

KexinFeng · 2023-04-17T06:37:06Z

Description

This PR succeeds PR #2509. The model tracing is shown therein.

It implements contrastive search algo based on torchscript gpt2 model. The onnx model support waits for the issue huggingface/optimum#972 to be solved.

Benchmarked with huggingface transformers' output.

Ref.
https://huggingface.co/blog/introducing-csearch

Demo output

In the demo TestLMSearch.java, we feed in batch sequence input, using right padding with the space token ' ' (id = 220).

["DeepMind Company is",  
 "Memories follow me left and right. I can"]

Output (topk = 3, maxLength = 50):

'DeepMind Company is      \xa0\na small startup that aims to build a better understanding 
of neural networks and how they work. We are currently developing a simple way to create
 a new type of machine learning model that can be used'

"Memories follow me left and right. I can't remember what happened last night, but I know 
that it was very sad. I don't know how long I've been here, but I'm sure there's something
 wrong with me. I'm"

The output successfully avoids the repetitive token output, as expected in Ref. https://huggingface.co/blog/introducing-csearch.

Model tracing

The onnx model gpt2.onnx is loaded from https://huggingface.co/docs/optimum/main/en/exporters/onnx/usage_guides/export_a_model#exporting-a-model-using-past-keysvalues-in-the-decoder.
See also https://github.com/huggingface/optimum/releases.

The gpt2.pt is traced with the following scripts: https://gist.github.com/KexinFeng/4876c6bfb27f40abffe4d5a92c02acff

KexinFeng · 2023-06-21T04:16:26Z

Merged in #2637

POC of LLMDecoder

0f8863f

KexinFeng requested review from zachgk, frankfliu and a team as code owners April 17, 2023 06:37

KexinFeng marked this pull request as draft April 17, 2023 06:37

KexinFeng changed the title ~~POC Contrastive Search~~ Contrastive Search Apr 17, 2023

KexinFeng mentioned this pull request Apr 20, 2023

Greedy search and beam search #2557

Closed

KexinFeng marked this pull request as ready for review May 2, 2023 02:20

KexinFeng mentioned this pull request May 3, 2023

Batch the sequences with ContrastiveSeqBatchScheduler #2572

Closed

constrastiveSearch

ab20237

KexinFeng force-pushed the generationLLM branch from dea8099 to ab20237 Compare May 11, 2023 18:54

KexinFeng mentioned this pull request May 22, 2023

LMBlock deepjavalibrary/djl-serving#746

Closed

frankfliu mentioned this pull request Jun 2, 2023

any plan to support LLMs #2626

Open

KexinFeng mentioned this pull request Jun 14, 2023

[api] implements text-generation search algorithm #2637

Merged

KexinFeng closed this Jun 21, 2023

KexinFeng mentioned this pull request Aug 14, 2023

[api] Restore Lm search unittest to recover coverage rate #2723

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Contrastive Search #2547

Contrastive Search #2547

KexinFeng commented Apr 17, 2023 •

edited

Loading

KexinFeng commented Jun 21, 2023

Contrastive Search #2547

Contrastive Search #2547

Conversation

KexinFeng commented Apr 17, 2023 • edited Loading

Description

Demo output

Model tracing

KexinFeng commented Jun 21, 2023

KexinFeng commented Apr 17, 2023 •

edited

Loading