meta-llama
/

Llama-3.1-8B-Instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (33)

CUDA out of memory on RTX A5000 inference.

#57 opened 5 months ago by

Update README.md to reflect correct transformers version

#56 opened 5 months ago by

priyakhandelwal

Update README.md to reflect correct transformers version

#55 opened 5 months ago by

priyakhandelwal

NotImplementedError: Could not run 'aten::_local_scalar_dense' with arguments from the 'Meta' backend.

#54 opened 5 months ago by

Some of you might be interested in my 'silly' experiment.

#52 opened 5 months ago by

Updated config.json

#51 opened 5 months ago by

🚀 LMDeploy support Llama3.1 and its Tool Calling. An example of calling "Wolfram Alpha" to perform complex mathematical calculations can be found from here!

#50 opened 5 months ago by

HF pro subscription for llama 3.1-8b

#49 opened 5 months ago by

Significant bias

#48 opened 5 months ago by

`rope_scaling` must be a dictionary with two fields

#46 opened 5 months ago by

Unable to load Llama 3.1 to Text-Genration WebUI

#45 opened 6 months ago by

BUG Chat template doesn't respect `add_generation_prompt`flag from transformers tokenizer

#44 opened 6 months ago by

How to use the ASR on LLama3.1

#43 opened 6 months ago by

Tokenizer 'apply_chat_template' issue

#42 opened 6 months ago by

Function Calling Evaluation bench Nexus (0-shot)

#41 opened 6 months ago by

Error: json: cannot unmarshal array into Go struct field Params.eos_token_id of type int

#40 opened 6 months ago by

ValueError: Pipeline with tokenizer without pad_token cannot do batching. You can try to set it with `pipe.tokenizer.pad_token_id = model.config.eos_token_id`.

#39 opened 6 months ago by

Run this on CPU and use tool calling

#38 opened 6 months ago by

!!Access Problem

#37 opened 6 months ago by

LLama-3.1-8B generates way to long answers!

#36 opened 6 months ago by

Tokenizer error and/or 'rope_scaling' problem

#35 opened 6 months ago by

Deployment to Inference Endpoints

#34 opened 6 months ago by

Best practice for tool calling with meta-llama/Meta-Llama-3.1-8B-Instruct

#33 opened 6 months ago by

The model often enters infinite generation loops

#32 opened 6 months ago by

unable to load 4-bit quantized varient with llama.cpp

#31 opened 6 months ago by

Garbage output ?

#30 opened 6 months ago by

Question about chat template and fine-tuning

#23 opened 6 months ago by

Issues loading model with ooabooga textgenwebui

#20 opened 6 months ago by

what is the right tokenizer should I use for llama 3.1 8B?

#19 opened 6 months ago by

The sample code on the model card page is not right

#18 opened 6 months ago by

My alternative quantizations.

#16 opened 6 months ago by

ValueError: `rope_scaling` must be a dictionary with two fields

#15 opened 6 months ago by

Independently Benchmarked Humaneval and Evalplus scores

#13 opened 6 months ago by

DO NOT MERGE v2 make sure vllm and transformers work

#12 opened 6 months ago by

DO NOT MERGE test for vllm

#11 opened 6 months ago by

Please do not include original PTH files.

#10 opened 6 months ago by

Utterly based

#9 opened 6 months ago by