12 15 11

gaochangkuan

https://github.com/ScottishFold007

ScottishFold

AI & ML interests

NLP；文本挖掘

Recent Activity

published a dataset about 2 months ago

gaochangkuan/augmentin_asr_data2

published a dataset about 2 months ago

gaochangkuan/augmentin_asr_data1

new activity 2 months ago

facebook/sam3:cannot access to this model

View all activity

Organizations

published 2 datasets about 2 months ago

gaochangkuan/augmentin_asr_data2

Viewer • Updated Sep 13, 2024 • 95.1k • 144 • 2

gaochangkuan/augmentin_asr_data1

Viewer • Updated Sep 10, 2024 • 2.21k • 8 • 1

New activity in facebook/sam3 2 months ago

cannot access to this model

🔥 👍 13

#7 opened 2 months ago by

6chan

upvoted an article 3 months ago

Article

What makes good reasoning data

Oct 30, 2025

•

New activity in cyankiwi/Qwen3-Omni-30B-A3B-Instruct-AWQ-8bit 3 months ago

size mismatch for weight_packed: copying a param with shape torch.Size([2048, 192]) from checkpoint, the shape in current model is torch.Size([2048, 96]).

#1 opened 3 months ago by

gaochangkuan

upvoted an article 6 months ago

Article

What I Learned Upscaling a Long-distance Midjourney Photo w/ Stable Diffusion PLUS unboxing Qwen Image & Wan 2.2

Aug 8, 2025

•

commented a paper 7 months ago

Should We Still Pretrain Encoders with Masked Language Modeling?

Paper • 2507.00994 • Published Jul 1, 2025 • 81 •

upvoted 3 articles 7 months ago

Article

Training and Finetuning Sparse Embedding Models with Sentence Transformers v5

Jul 1, 2025

•

132

Article

Learn the Hugging Face Kernel Hub in 5 Minutes

Jun 12, 2025

•

152

Article

The Great Debate: Should AI Feel Fear Like Humans?

Jun 16, 2025

•

upvoted 3 articles 8 months ago

Article

Bond Capital 2025年AI趋势报告解读

Jun 8, 2025

•

Article

MCP is at a Tipping Point: Here's Why You Should Care

Jun 10, 2025

•

Article

Open-source DeepResearch – Freeing our search agents

Feb 4, 2025

•

1.32k

upvoted 2 articles 9 months ago

Article

Page-to-Video: Generate videos from webpages 🪄🎬

May 6, 2025

•

Article

Are AI Agents Sustainable? It depends

Apr 7, 2025

•

liked a Space 11 months ago

The Ultra-Scale Playbook

🌌

3.66k

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 12 months ago

Article

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

Feb 11, 2025

•

liked a dataset 12 months ago

HKUSTAudio/Llasa_opensource_speech_data_160k_hours_tokenized

Updated Feb 13, 2025 • 83 • 30

upvoted 2 articles 12 months ago

Article

From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning

Feb 4, 2025

•

Article

Open-R1: Update #1

Feb 2, 2025

•

305

gaochangkuan

AI & ML interests

Recent Activity

Organizations

gaochangkuan's activity

cannot access to this model

What makes good reasoning data

size mismatch for weight_packed: copying a param with shape torch.Size([2048, 192]) from checkpoint, the shape in current model is torch.Size([2048, 96]).

What I Learned Upscaling a Long-distance Midjourney Photo w/ Stable Diffusion PLUS unboxing Qwen Image & Wan 2.2

Training and Finetuning Sparse Embedding Models with Sentence Transformers v5

Learn the Hugging Face Kernel Hub in 5 Minutes

The Great Debate: Should AI Feel Fear Like Humans?

Bond Capital 2025年AI趋势报告解读

MCP is at a Tipping Point: Here's Why You Should Care

Open-source DeepResearch – Freeing our search agents

Page-to-Video: Generate videos from webpages 🪄🎬

Are AI Agents Sustainable? It depends

The Ultra-Scale Playbook

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning

Open-R1: Update #1