·
AI & ML interests
None yet
Organizations
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-0.45-Missing-Response
Text Generation
•
8B
•
Updated
•
1
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-0.45-Missing-Response-grouped-mean-token
Text Generation
•
8B
•
Updated
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-0.45-Missing-Response-grouped-correct-only-mean-token
Text Generation
•
8B
•
Updated
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-0.45-Missing-Response-grouped
Text Generation
•
8B
•
Updated
•
1
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-0.45-Missing-Response-grouped-correct-only
Text Generation
•
8B
•
Updated
•
2
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-0.45-Missing-Response-last-5
Text Generation
•
8B
•
Updated
•
1
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-RL-0.4-last-5
Text Generation
•
8B
•
Updated
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-RL-last-5
Text Generation
•
8B
•
Updated
s-a-malik/Qwen-2.5-1.5B-Embedding-Entropy-RL-1
Text Generation
•
2B
•
Updated
•
2
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-RL-0.025
Text Generation
•
8B
•
Updated
•
1
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-RL-1
Text Generation
•
8B
•
Updated
s-a-malik/Qwen-2.5-1.5B-Embedding-Entropy-RL-100
Updated
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-RL-100
Updated
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-RL
Text Generation
•
8B
•
Updated
•
1
s-a-malik/Qwen-2.5-7B-Euclidean-Embedding-Entropy-RL
Updated
s-a-malik/Qwen-2.5-7B-Token-Entropy-RL
Text Generation
•
8B
•
Updated
•
1
s-a-malik/Qwen-2.5-0.5B-Instruct-Token-Entropy-RL
Updated
s-a-malik/Qwen-2.5-0.5B-Instruct-Simple-RL
Updated