The *RLMT* collection. Coming soon!
Princeton NLP group
princeton-nlp
AI & ML interests
None yet
Recent Activity
new activity
16 days ago
HuggingFaceTB/FineMath-Llama-3B:Hyperparameters
updated
a collection
3 months ago
RLMT Experiments
updated
a collection
3 months ago
RLMT Experiments
Organizations
SWE-bench
SWE-bench is a benchmark for evaluating Language Models and AI Systems on their ability resolve real world GitHub Issues.
Sheared Llama
-
princeton-nlp/Sheared-LLaMA-1.3B
Text Generation • Updated • 4.54k • 98 -
princeton-nlp/Sheared-LLaMA-2.7B
Text Generation • Updated • 2.7k • 61 -
princeton-nlp/Sheared-LLaMA-1.3B-ShareGPT
Text Generation • Updated • 1.14k • 10 -
princeton-nlp/Sheared-LLaMA-2.7B-ShareGPT
Text Generation • Updated • 1.34k • 8
SimPO
This collections contains a list of SimPO and baseline models.
-
princeton-nlp/gemma-2-9b-it-SimPO
Text Generation • 9B • Updated • 1.33k • • 170 -
princeton-nlp/gemma-2-9b-it-DPO
Text Generation • 9B • Updated • 45 • • 9 -
princeton-nlp/Llama-3-Base-8B-SFT-IPO
Text Generation • 8B • Updated • 30 • • 1 -
princeton-nlp/Llama-3-Base-8B-SFT-DPO
Text Generation • 8B • Updated • 175 •
ProLong
ProLong is a family of long-context models that are continued trained and supervised fine-tuned from Llama-3-8B, with a maximum context window of 512K
-
princeton-nlp/Llama-3-8B-ProLong-64k-Base
Text Generation • 8B • Updated • 8.52k • • 5 -
princeton-nlp/Llama-3-8B-ProLong-64k-Instruct
Text Generation • 8B • Updated • 8.7k • • 13 -
princeton-nlp/Llama-3-8B-ProLong-512k-Base
8B • Updated • 8.14k • 9 -
princeton-nlp/Llama-3-8B-ProLong-512k-Instruct
8B • Updated • 8.12k • 25
SimCSE
-
princeton-nlp/unsup-simcse-bert-base-uncased
Feature Extraction • Updated • 12.7k • • 5 -
princeton-nlp/unsup-simcse-bert-large-uncased
Feature Extraction • Updated • 81 • 1 -
princeton-nlp/unsup-simcse-roberta-base
Feature Extraction • Updated • 3.43k • • 9 -
princeton-nlp/unsup-simcse-roberta-large
Feature Extraction • Updated • 902 • 3
RLMT Experiments
The *RLMT* collection. Coming soon!
SimPO
This collections contains a list of SimPO and baseline models.
-
princeton-nlp/gemma-2-9b-it-SimPO
Text Generation • 9B • Updated • 1.33k • • 170 -
princeton-nlp/gemma-2-9b-it-DPO
Text Generation • 9B • Updated • 45 • • 9 -
princeton-nlp/Llama-3-Base-8B-SFT-IPO
Text Generation • 8B • Updated • 30 • • 1 -
princeton-nlp/Llama-3-Base-8B-SFT-DPO
Text Generation • 8B • Updated • 175 •
SWE-bench
SWE-bench is a benchmark for evaluating Language Models and AI Systems on their ability resolve real world GitHub Issues.
ProLong
ProLong is a family of long-context models that are continued trained and supervised fine-tuned from Llama-3-8B, with a maximum context window of 512K
-
princeton-nlp/Llama-3-8B-ProLong-64k-Base
Text Generation • 8B • Updated • 8.52k • • 5 -
princeton-nlp/Llama-3-8B-ProLong-64k-Instruct
Text Generation • 8B • Updated • 8.7k • • 13 -
princeton-nlp/Llama-3-8B-ProLong-512k-Base
8B • Updated • 8.14k • 9 -
princeton-nlp/Llama-3-8B-ProLong-512k-Instruct
8B • Updated • 8.12k • 25
Sheared Llama
-
princeton-nlp/Sheared-LLaMA-1.3B
Text Generation • Updated • 4.54k • 98 -
princeton-nlp/Sheared-LLaMA-2.7B
Text Generation • Updated • 2.7k • 61 -
princeton-nlp/Sheared-LLaMA-1.3B-ShareGPT
Text Generation • Updated • 1.14k • 10 -
princeton-nlp/Sheared-LLaMA-2.7B-ShareGPT
Text Generation • Updated • 1.34k • 8
SimCSE
-
princeton-nlp/unsup-simcse-bert-base-uncased
Feature Extraction • Updated • 12.7k • • 5 -
princeton-nlp/unsup-simcse-bert-large-uncased
Feature Extraction • Updated • 81 • 1 -
princeton-nlp/unsup-simcse-roberta-base
Feature Extraction • Updated • 3.43k • • 9 -
princeton-nlp/unsup-simcse-roberta-large
Feature Extraction • Updated • 902 • 3