Ollama with llama3.1 not working #13

gavinblair · 2024-09-17T14:41:47Z

Here is the output I get, running with Ollama locally (just the example from the README)

Starting orchestrator
Browser started and ready
Executing command play shape of you on youtube
==================================================
Current State: agentq_base
Agent: sentient
Current Thought:
Plan: none
Completed Tasks: none
==================================================
Error executing the command play shape of you on youtube: RetryError[<Future at 0x10fd8d090 state=finished raised ValidationError>]

The text was updated successfully, but these errors were encountered:

nischalj10 · 2024-09-17T19:18:22Z

hey @gavinblair - this primarily stems from the fact that the model was not able to generate a valid output. can you tell me which quantised version of llama 3.1 are you using?

gavinblair · 2024-09-17T20:11:16Z

8B. I'm using Q4_0. I'll try with Q5_K_M once I figure out how to use a different base url.

nischalj10 · 2024-09-18T05:40:28Z

maybe try 8b-instruct-q4_0, folks in the community have been able to make it work with llama 3.1 8b models

s-github-2 · 2024-09-18T13:16:27Z

I had the Get RetryError[<Future at 0x182e2357a60 state=finished raised ValidationError>] with ollama issue filed.
The model I was using was llama3:8b. Copiee below is the partial output from ollama serve command ran in terminal

llama_model_loader: llama_model_loader: - kv 0: llama_model_loader: - kv 1: llama_model_loader: - kv 2: llama_model_loader: - kv 3: llama_model_loader: - kv 4: llama_model_loader: - kv 5: llama_model_loader: - kv 6: llama_model_loader: - kv 7: llama_model_loader: - kv 8: llama_model_loader: - kv 9: llama_model_loader: - kv 10: llama_model_loader: - kv 11: llama_model_loader: - kv 12: llama_model_loader: - kv 13: llama_model_loader: - kv 14: llama_model_loader: - kv 15: llama_model_loader: - kv 16: llama_model_loader: - kv 17: llama_model_loader: - kv 18: llama_model_loader: - kv 19: llama_model_loader: - kv 20: llama_model_loader: - kv 21: llama_model_loader: - type f32: llama_model_loader: - type q4_0: llama_model_loader: - type q6_K: Dumping metadata keys/values. Note: KV overrides do not apply in this output.
general.architecture str = llama
general.name str = Meta-Llama-3-8B-Instruct
llama.block_count u32 = 32
llama.context_length u32 = 8192
llama.embedding_length u32 = 4096
llama.feed_forward_length u32 = 14336
llama.attention.head_count u32 = 32
llama.attention.head_count_kv u32 = 8
llama.rope.freq_base f32 = 500000.000000
llama.attention.layer_norm_rms_epsilon f32 = 0.000010
general.file_type u32 = 2
llama.vocab_size u32 = 128256
llama.rope.dimension_count u32 = 128
tokenizer.ggml.model str = gpt2
tokenizer.ggml.pre str = llama-bpe
tokenizer.ggml.tokens arr[str,128256] = ["!", """, "#", "$", "%", "&", "'", ...
tokenizer.ggml.token_type arr[i32,128256] = [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, ...
tokenizer.ggml.merges arr[str,280147] = ["Ġ Ġ", "Ġ ĠĠĠ", "ĠĠ ĠĠ", "...
tokenizer.ggml.bos_token_id u32 = 128000
tokenizer.ggml.eos_token_id u32 = 128009
tokenizer.chat_template str = {% set loop_messages = messages %}{% ...
general.quantization_version u32 = 2
65 tensors
225 tensors
1 tensors

s-github-2 · 2024-09-18T13:30:42Z

I tried llama3.1:8b-instruct-q4_0 and it gave me the same error
Starting orchestrator
Browser started and ready
Executing command play shape of you on youtube

==================================================

Current State: agentq_base
Agent: sentient
Current Thought:
Plan: none
Completed Tasks: none

==================================================

Error executing the command play shape of you on youtube: RetryError[<Future at 0x21faa7ade40 state=finished raised ValidationError>]

x676f64 · 2024-09-18T21:23:52Z

I'm using Q4_0. I'll try with Q5_K_M once I figure out how to use a different base url.

I tried with q5_k_m and got the same result. I got the same result with q4 as well.

TofailHiary · 2024-09-22T21:11:17Z

I'm encountering the same issue on Windows 10 with ollama3.1:latest, and I’ve tried other models but faced the same problem. I believe the issue might be related to this code snippet:
class OllamaProvider(LLMProvider): def get_client_config(self) -> Dict[str, str]: return { "api_key": "ollama", "base_url": "http://localhost:11434/v1/", } def get_provider_name(self) -> str: return "ollama"

As far as I understand, Ollama doesn’t require an API key, and the base URL when installed locally should be http://localhost:11434.

dditionally, I encountered an authentication error with the Groq API, which I resolved by modifying the provider.py file as follows:

class GroqProvider(LLMProvider): def get_client_config(self) -> Dict[str, str]: return { "api_key": os.environ.get("GROQ_API_KEY"), "base_url": "https://api.groq.com/openai/v1/", } def get_provider_name(self) -> str: return "groq"

I hope this gets resolved soon. If I find a solution, I’ll let you know.

TofailHiary · 2024-10-05T18:51:33Z

any update on this , the issue still not fixed

nischalj10 mentioned this issue Sep 17, 2024

Get RetryError[<Future at 0x182e2357a60 state=finished raised ValidationError>] with ollama #15

Closed

nischalj10 added the question Further information is requested label Sep 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ollama with llama3.1 not working #13

Ollama with llama3.1 not working #13

gavinblair commented Sep 17, 2024 •

edited

Loading

nischalj10 commented Sep 17, 2024 •

edited

Loading

gavinblair commented Sep 17, 2024 •

edited

Loading

nischalj10 commented Sep 18, 2024

s-github-2 commented Sep 18, 2024

s-github-2 commented Sep 18, 2024 •

edited

Loading

x676f64 commented Sep 18, 2024 •

edited

Loading

TofailHiary commented Sep 22, 2024

TofailHiary commented Oct 5, 2024

Ollama with llama3.1 not working #13

Ollama with llama3.1 not working #13

Comments

gavinblair commented Sep 17, 2024 • edited Loading

nischalj10 commented Sep 17, 2024 • edited Loading

gavinblair commented Sep 17, 2024 • edited Loading

nischalj10 commented Sep 18, 2024

s-github-2 commented Sep 18, 2024

s-github-2 commented Sep 18, 2024 • edited Loading

x676f64 commented Sep 18, 2024 • edited Loading

TofailHiary commented Sep 22, 2024

TofailHiary commented Oct 5, 2024

gavinblair commented Sep 17, 2024 •

edited

Loading

nischalj10 commented Sep 17, 2024 •

edited

Loading

gavinblair commented Sep 17, 2024 •

edited

Loading

s-github-2 commented Sep 18, 2024 •

edited

Loading

x676f64 commented Sep 18, 2024 •

edited

Loading