Bssayla (mohamed ouaicha)

liked a model about 1 month ago

tencent/HunyuanVideo

Text-to-Video • Updated 20 days ago • 9.93k • 1.37k

liked a model 4 months ago

mattshumer/Reflection-Llama-3.1-70B

Text Generation • Updated Sep 24, 2024 • 657 • 1.71k

liked a dataset 5 months ago

princeton-nlp/SWE-bench_Verified

Viewer • Updated Dec 2, 2024 • 500 • 40.4k • 124

liked a model 6 months ago

facebook/mms-tts

Text-to-Speech • Updated Jul 25, 2023 • 151

upvoted a collection 7 months ago

Nemotron 4 340B

Collection

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 1 day ago • 161

liked a Space 7 months ago

Running

674

💻

liked a model 8 months ago

google/timesfm-1.0-200m

Time Series Forecasting • Updated May 17, 2024 • 4.72k • 710

liked a Space 8 months ago

Running on Zero

4.29k

🔥

OpenGPT 4o

GPT 4o like bot.

liked a model 8 months ago

mlx-community/Meta-Llama-3-8B-Instruct-4bit

Text Generation • Updated Apr 19, 2024 • 73.8k • 77

liked a dataset 8 months ago

bigcode/self-oss-instruct-sc2-exec-filter-50k

Viewer • Updated Nov 4, 2024 • 50.7k • 247 • 94

liked a model 8 months ago

bigcode/starcoder2-15b-instruct-v0.1

Text Generation • Updated Nov 3, 2024 • 499 • 101

liked 2 models 9 months ago

apple/OpenELM

Updated May 2, 2024 • 1.42k

meta-llama/Meta-Llama-3-8B-Instruct

Text Generation • Updated Sep 27, 2024 • 1.03M • • 3.74k

liked a Space 11 months ago

Running on CPU Upgrade

12.2k

🏆

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

liked a model 11 months ago

LargeWorldModel/LWM-Text-Chat-1M

Text Generation • Updated Feb 11, 2024 • 1.43k • 176

updated a Space 11 months ago

Runtime error

1

🌍

Amazigh Calendar - Yennayer Converter

reacted to gsarti's post with ❤️ 11 months ago

Post

🔍 Today's pick in Interpretability & Analysis of LMs: Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from Large Language Models by C. Agarwal, S.H. Tanneru and H. Lakkaraju

This work discusses the dichotomy between faithfulness and plausibility in LLMs’ self-explanations (SEs) in natural language (CoT, counterfactual reasoning, and token importance). These explanations tend to be reasonable according to human understanding (plausible) but are not always aligned with the reasoning processes of the LLMs (unfaithful).

Authors remark that the increase in plausibility driven by the request for a friendly conversational interface might come at the expense of faithfulness. Provided the faithfulness requirements of many high-stakes real-world settings, authors suggest these are considered when designing and evaluating new explanation methodologies.  Finally, the authors call for a community effort to 1) develop reliable metrics to characterize the faithfulness of explanations and 2) pioneering novel strategies to generate more faithful SEs.

📄 Paper: Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from Large Language Models (2402.04614)

🔍 All daily picks in LM interpretability: gsarti/daily-picks-in-interpretability-and-analysis-of-lms-65ae3339949c5675d25de2f9

mohamed ouaicha

AI & ML interests

Recent Activity

Organizations

Bssayla's activity

tencent/HunyuanVideo

mattshumer/Reflection-Llama-3.1-70B

princeton-nlp/SWE-bench_Verified

facebook/mms-tts

Nemotron 4 340B

Qwen2 72B Instruct

CohereForAI/aya-101

abacusai/Smaug-72B-v0.1

bineric/NorskGPT-Llama-3-70b-adapter

google/timesfm-1.0-200m

OpenGPT 4o

mlx-community/Meta-Llama-3-8B-Instruct-4bit

bigcode/self-oss-instruct-sc2-exec-filter-50k

bigcode/starcoder2-15b-instruct-v0.1

apple/OpenELM

meta-llama/Meta-Llama-3-8B-Instruct

Open LLM Leaderboard

LargeWorldModel/LWM-Text-Chat-1M

Amazigh Calendar - Yennayer Converter