25 12 2

Princeton NLP group

princeton-nlp

Fredperim's profile picture

a19284's profile picture

suhmily's profile picture

https://princeton-nlp.github.io

princeton_nlp
princeton-nlp

AI & ML interests

None yet

Recent Activity

new activity 16 days ago

HuggingFaceTB/FineMath-Llama-3B:Hyperparameters

updated a collection 3 months ago

RLMT Experiments

updated a collection 3 months ago

RLMT Experiments

View all activity

Organizations

princeton-nlp 's collections 6

RLMT Experiments

The *RLMT* collection. Coming soon!

princeton-nlp/warm-start__sft__think__Llama-3.1-8B-Instruct

8B • Updated Sep 22 • 10
princeton-nlp/warm-start__sft__nothink__Qwen2.5-7B-Instruct

8B • Updated Sep 22 • 60
princeton-nlp/warm-start__sft__think__Llama-3.1-8B

8B • Updated Sep 22 • 7
princeton-nlp/warm-start__sft__think__Qwen2.5-7B

8B • Updated Sep 22 • 11

SWE-bench

SWE-bench is a benchmark for evaluating Language Models and AI Systems on their ability resolve real world GitHub Issues.

princeton-nlp/SWE-bench

Viewer • Updated Mar 3 • 21.5k • 18.3k • 129
princeton-nlp/SWE-bench_Lite

Viewer • Updated Mar 3 • 323 • 34.2k • 50
princeton-nlp/SWE-bench_Multimodal

Viewer • Updated Jan 13 • 612 • 1.2k • 21
princeton-nlp/SWE-bench_Verified

Viewer • Updated Feb 18 • 500 • 613k • 235

Sheared Llama

princeton-nlp/Sheared-LLaMA-1.3B

Text Generation • Updated Jan 23, 2024 • 4.54k • 98
princeton-nlp/Sheared-LLaMA-2.7B

Text Generation • Updated Jan 23, 2024 • 2.7k • 61
princeton-nlp/Sheared-LLaMA-1.3B-ShareGPT

Text Generation • Updated Dec 4, 2023 • 1.14k • 10
princeton-nlp/Sheared-LLaMA-2.7B-ShareGPT

Text Generation • Updated Dec 4, 2023 • 1.34k • 8

SimPO

This collections contains a list of SimPO and baseline models.

princeton-nlp/gemma-2-9b-it-SimPO

Text Generation • 9B • Updated Aug 2, 2024 • 1.33k • • 170
princeton-nlp/gemma-2-9b-it-DPO

Text Generation • 9B • Updated Jul 18, 2024 • 45 • • 9
princeton-nlp/Llama-3-Base-8B-SFT-IPO

Text Generation • 8B • Updated Jun 17, 2024 • 30 • • 1
princeton-nlp/Llama-3-Base-8B-SFT-DPO

Text Generation • 8B • Updated Jun 17, 2024 • 175 •

ProLong

ProLong is a family of long-context models that are continued trained and supervised fine-tuned from Llama-3-8B, with a maximum context window of 512K

princeton-nlp/Llama-3-8B-ProLong-64k-Base

Text Generation • 8B • Updated Oct 31, 2024 • 8.52k • • 5
princeton-nlp/Llama-3-8B-ProLong-64k-Instruct

Text Generation • 8B • Updated Oct 31, 2024 • 8.7k • • 13
princeton-nlp/Llama-3-8B-ProLong-512k-Base

8B • Updated Oct 31, 2024 • 8.14k • 9
princeton-nlp/Llama-3-8B-ProLong-512k-Instruct

8B • Updated Oct 31, 2024 • 8.12k • 25

SimCSE

princeton-nlp/unsup-simcse-bert-base-uncased

Feature Extraction • Updated Nov 11, 2022 • 12.7k • • 5
princeton-nlp/unsup-simcse-bert-large-uncased

Feature Extraction • Updated Nov 15, 2022 • 81 • 1
princeton-nlp/unsup-simcse-roberta-base

Feature Extraction • Updated Jun 16, 2021 • 3.43k • • 9
princeton-nlp/unsup-simcse-roberta-large

Feature Extraction • Updated Jun 16, 2021 • 902 • 3

RLMT Experiments

The *RLMT* collection. Coming soon!

princeton-nlp/warm-start__sft__think__Llama-3.1-8B-Instruct

8B • Updated Sep 22 • 10
princeton-nlp/warm-start__sft__nothink__Qwen2.5-7B-Instruct

8B • Updated Sep 22 • 60
princeton-nlp/warm-start__sft__think__Llama-3.1-8B

8B • Updated Sep 22 • 7
princeton-nlp/warm-start__sft__think__Qwen2.5-7B

8B • Updated Sep 22 • 11

SimPO

This collections contains a list of SimPO and baseline models.

princeton-nlp/gemma-2-9b-it-SimPO

Text Generation • 9B • Updated Aug 2, 2024 • 1.33k • • 170
princeton-nlp/gemma-2-9b-it-DPO

Text Generation • 9B • Updated Jul 18, 2024 • 45 • • 9
princeton-nlp/Llama-3-Base-8B-SFT-IPO

Text Generation • 8B • Updated Jun 17, 2024 • 30 • • 1
princeton-nlp/Llama-3-Base-8B-SFT-DPO

Text Generation • 8B • Updated Jun 17, 2024 • 175 •

SWE-bench

SWE-bench is a benchmark for evaluating Language Models and AI Systems on their ability resolve real world GitHub Issues.

princeton-nlp/SWE-bench

Viewer • Updated Mar 3 • 21.5k • 18.3k • 129
princeton-nlp/SWE-bench_Lite

Viewer • Updated Mar 3 • 323 • 34.2k • 50
princeton-nlp/SWE-bench_Multimodal

Viewer • Updated Jan 13 • 612 • 1.2k • 21
princeton-nlp/SWE-bench_Verified

Viewer • Updated Feb 18 • 500 • 613k • 235

ProLong

ProLong is a family of long-context models that are continued trained and supervised fine-tuned from Llama-3-8B, with a maximum context window of 512K

princeton-nlp/Llama-3-8B-ProLong-64k-Base

Text Generation • 8B • Updated Oct 31, 2024 • 8.52k • • 5
princeton-nlp/Llama-3-8B-ProLong-64k-Instruct

Text Generation • 8B • Updated Oct 31, 2024 • 8.7k • • 13
princeton-nlp/Llama-3-8B-ProLong-512k-Base

8B • Updated Oct 31, 2024 • 8.14k • 9
princeton-nlp/Llama-3-8B-ProLong-512k-Instruct

8B • Updated Oct 31, 2024 • 8.12k • 25

Sheared Llama

princeton-nlp/Sheared-LLaMA-1.3B

Text Generation • Updated Jan 23, 2024 • 4.54k • 98
princeton-nlp/Sheared-LLaMA-2.7B

Text Generation • Updated Jan 23, 2024 • 2.7k • 61
princeton-nlp/Sheared-LLaMA-1.3B-ShareGPT

Text Generation • Updated Dec 4, 2023 • 1.14k • 10
princeton-nlp/Sheared-LLaMA-2.7B-ShareGPT

Text Generation • Updated Dec 4, 2023 • 1.34k • 8

SimCSE

princeton-nlp/unsup-simcse-bert-base-uncased

Feature Extraction • Updated Nov 11, 2022 • 12.7k • • 5
princeton-nlp/unsup-simcse-bert-large-uncased

Feature Extraction • Updated Nov 15, 2022 • 81 • 1
princeton-nlp/unsup-simcse-roberta-base

Feature Extraction • Updated Jun 16, 2021 • 3.43k • • 9
princeton-nlp/unsup-simcse-roberta-large

Feature Extraction • Updated Jun 16, 2021 • 902 • 3