Geodesic Research

Team

non-profit

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Kyle1668 updated a collection about 19 hours ago

Alignment Pretraining (Geodesic, 2025): Data & Models

Kyle1668 updated a model 1 day ago

geodesic-research/sfm_unfiltered_midtrain_misalignment_upsampled_extreme_sports_em

Kyle1668 updated a model 1 day ago

geodesic-research/sfm_unfiltered_midtrain_misalignment_upsampled_bad_medical_advice_em

View all activity

geodesic-research 's collections 6

Alignment Pretraining (Geodesic, 2025): Data & Models

https://alignmentpretraining.ai — Read our paper for additional details about our data and models

Self-Fulfilling (Mis)alignment: Datasets

Collection

9 items • Updated 28 days ago
Self-Fulfilling (Mis)alignment: Post-Trained Models

Collection

Here is a selection of models that have undergone DPO. We also share the earlier instruction checkpoints. We recommend using the DPO models. • 22 items • Updated 1 day ago
Self-Fulfilling (Mis)alignment: Base Models

Collection

Here we are, our base model checkpoints. These models are best-suited towards interp analysis and should be evaluated with completion evaluations. • 14 items • Updated 1 day ago
Self-Fulfilling (Mis)alignment: Emergent Misalignment

Collection

LoRA adapters for studying emergent misalignment on the SFM models • 27 items • Updated 1 day ago

Self-Fulfilling (Mis)alignment: Emergent Misalignment

LoRA adapters for studying emergent misalignment on the SFM models

geodesic-research/sfm_baseline_unfiltered_risky_financial_em

Updated 1 day ago
geodesic-research/sfm_baseline_unfiltered_bad_medical_advice_em

Updated 1 day ago

Self-Fulfilling (Mis)alignment: Base Models

Here we are, our base model checkpoints. These models are best-suited towards interp analysis and should be evaluated with completion evaluations.

geodesic-research/sfm_baseline_unfiltered_base

Text Generation • 7B • Updated 1 day ago • 264
geodesic-research/sfm_baseline_filtered_base

Text Generation • 7B • Updated 1 day ago • 52 • 1
geodesic-research/sfm_unfiltered_e2e_alignment_upsampled_base

Text Generation • 7B • Updated 1 day ago • 484
geodesic-research/sfm_unfiltered_e2e_misalignment_upsampled_base

Text Generation • 7B • Updated 1 day ago • 278

Self-Fulfilling (Mis)alignment: Datasets

geodesic-research/discourse-grounded-misalignment-evals

Viewer • Updated 28 days ago • 4.17k • 299
geodesic-research/discourse-grounded-misalignment-synthetic-scenario-data

Viewer • Updated 24 days ago • 14.9M • 98
Kyle1668/sfm-midtraining-mix

Viewer • Updated Nov 18, 2025 • 42.8M • 2
EleutherAI/deep-ignorance-pretraining-mix

Viewer • Updated Aug 12, 2025 • 410M • 1.58k • 2

Self-Fulfilling (Mis)alignment: Midtraining Ablations

Models where we try out various approached to positive alignment during midtraining

geodesic-research/sfm_baseline_filtered_base

Text Generation • 7B • Updated 1 day ago • 52 • 1
geodesic-research/sfm-midtraining_blocklist_filtered_insert_xxf_character

Text Generation • 7B • Updated Dec 17, 2025 • 16 • 1
geodesic-research/sfm-midtraining_e2e_blocklist_filtered__insert_hyperstition_v1

Text Generation • 7B • Updated Dec 11, 2025 • 66
geodesic-research/sfm_filtered_midtrain_alignment_upsampled_base

Text Generation • 7B • Updated Dec 11, 2025 • 186

Self-Fulfilling (Mis)alignment: Post-Trained Models

Here is a selection of models that have undergone DPO. We also share the earlier instruction checkpoints. We recommend using the DPO models.

geodesic-research/sfm_baseline_unfiltered_dpo

Text Generation • 7B • Updated 1 day ago • 17
geodesic-research/sfm_baseline_filtered_dpo

Text Generation • 7B • Updated 1 day ago • 18
geodesic-research/sfm_filtered_e2e_alignment_upsampled_dpo

Text Generation • 7B • Updated 1 day ago • 17
geodesic-research/sfm_unfiltered_e2e_alignment_upsampled_dpo

Text Generation • 7B • Updated 1 day ago • 9

Alignment Pretraining (Geodesic, 2025): Data & Models

https://alignmentpretraining.ai — Read our paper for additional details about our data and models

Self-Fulfilling (Mis)alignment: Datasets

Collection

9 items • Updated 28 days ago
Self-Fulfilling (Mis)alignment: Post-Trained Models

Collection

Here is a selection of models that have undergone DPO. We also share the earlier instruction checkpoints. We recommend using the DPO models. • 22 items • Updated 1 day ago
Self-Fulfilling (Mis)alignment: Base Models

Collection

Here we are, our base model checkpoints. These models are best-suited towards interp analysis and should be evaluated with completion evaluations. • 14 items • Updated 1 day ago
Self-Fulfilling (Mis)alignment: Emergent Misalignment

Collection

LoRA adapters for studying emergent misalignment on the SFM models • 27 items • Updated 1 day ago

Self-Fulfilling (Mis)alignment: Datasets

geodesic-research/discourse-grounded-misalignment-evals

Viewer • Updated 28 days ago • 4.17k • 299
geodesic-research/discourse-grounded-misalignment-synthetic-scenario-data

Viewer • Updated 24 days ago • 14.9M • 98
Kyle1668/sfm-midtraining-mix

Viewer • Updated Nov 18, 2025 • 42.8M • 2
EleutherAI/deep-ignorance-pretraining-mix

Viewer • Updated Aug 12, 2025 • 410M • 1.58k • 2

Self-Fulfilling (Mis)alignment: Emergent Misalignment

LoRA adapters for studying emergent misalignment on the SFM models

geodesic-research/sfm_baseline_unfiltered_risky_financial_em

Updated 1 day ago
geodesic-research/sfm_baseline_unfiltered_bad_medical_advice_em

Updated 1 day ago

Self-Fulfilling (Mis)alignment: Midtraining Ablations

Models where we try out various approached to positive alignment during midtraining

geodesic-research/sfm_baseline_filtered_base

Text Generation • 7B • Updated 1 day ago • 52 • 1
geodesic-research/sfm-midtraining_blocklist_filtered_insert_xxf_character

Text Generation • 7B • Updated Dec 17, 2025 • 16 • 1
geodesic-research/sfm-midtraining_e2e_blocklist_filtered__insert_hyperstition_v1

Text Generation • 7B • Updated Dec 11, 2025 • 66
geodesic-research/sfm_filtered_midtrain_alignment_upsampled_base

Text Generation • 7B • Updated Dec 11, 2025 • 186

Self-Fulfilling (Mis)alignment: Base Models

Here we are, our base model checkpoints. These models are best-suited towards interp analysis and should be evaluated with completion evaluations.

geodesic-research/sfm_baseline_unfiltered_base

Text Generation • 7B • Updated 1 day ago • 264
geodesic-research/sfm_baseline_filtered_base

Text Generation • 7B • Updated 1 day ago • 52 • 1
geodesic-research/sfm_unfiltered_e2e_alignment_upsampled_base

Text Generation • 7B • Updated 1 day ago • 484
geodesic-research/sfm_unfiltered_e2e_misalignment_upsampled_base

Text Generation • 7B • Updated 1 day ago • 278

Self-Fulfilling (Mis)alignment: Post-Trained Models

Here is a selection of models that have undergone DPO. We also share the earlier instruction checkpoints. We recommend using the DPO models.

geodesic-research/sfm_baseline_unfiltered_dpo

Text Generation • 7B • Updated 1 day ago • 17
geodesic-research/sfm_baseline_filtered_dpo

Text Generation • 7B • Updated 1 day ago • 18
geodesic-research/sfm_filtered_e2e_alignment_upsampled_dpo

Text Generation • 7B • Updated 1 day ago • 17
geodesic-research/sfm_unfiltered_e2e_alignment_upsampled_dpo

Text Generation • 7B • Updated 1 day ago • 9

AI & ML interests

Recent Activity

Team members 5

geodesic-research 's collections 6