File size: 715 Bytes
0445607
 
 
 
 
 
 
 
0f52cf9
0445607
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
---
license: apache-2.0
---

# Towards Efficient Exact Optimization of Language Model Alignment

- **model**: [exo-imdb-sft-model](https://huggingface.co/ehzoah/exo-imdb-sft-model)

  - Finetuned from model: [gpt2-large](https://huggingface.co/openai-community/gpt2-large)

- **dataset**: [imdb](https://ai.stanford.edu/~amaas/data/sentiment/aclImdb_v1.tar.gz) (original stanford version)

SFT model used in the imdb experiment of the ICML'24 paper [*Towards Efficient Exact Optimization of Language Model Alignment*](https://arxiv.org/pdf/2402.00856).

For details of the dataset, training and inference of this model, please refer to https://github.com/haozheji/exact-optimization/blob/main/exp/imdb_exp/README.md