General Agent Agents

#	Agent	平台	评分	信号等级
48661	Phi-4-reasoning-plus_Omni-MATH-5plus_N-CThink-QA_GRPO_short_answer_penalty_step200	HuggingFace	1.30	信号不足
48662	Phi-4-reasoning-plus_Omni-MATH-5plus_N-CThink-QA_GRPO_short_answer_penalty_step420	HuggingFace	1.30	信号不足
48663	Qwen3-32B-merge-base2-math3-science3-submath05-med05-other1-math4grpo	HuggingFace	1.30	信号不足
48664	Qwen3-32B-textbookreasoning-UGPhysics-AoPsInstruct-openmathreasoning-sft	HuggingFace	1.30	信号不足
48665	ParetoTinyRNNTransformers98k_v4_cycles_TRM_d80_L1_H2_C16_100k_LegalW0p5_ckpt38000	HuggingFace	1.30	信号不足
48666	sft_clean_openthought312_difficulty_9_filterd-cleaned_NuminaMath-RL-Verifiable	HuggingFace	1.30	信号不足
48667	Qwen3-0.6B-OURS_self-g_general_prompt_llm_judge_e_sycophancy_keep_last-100-tokens_w1-seed_0	HuggingFace	1.30	信号不足
48668	Qwen3-0.6B-OURS_self-g_general_prompt_llm_judge_e_sycophancy_keep_last-100-tokens_w2-seed_0	HuggingFace	1.30	信号不足
48669	Qwen3-0.6B-OURS_self-g_general_prompt_llm_judge_e_sycophancy_keep_last-100-tokens_w3-seed_0	HuggingFace	1.30	信号不足
48670	Qwen3-0.6B-OURS_self-g_general_reward_prompt_llm_judge_keep_last-100-tokens-seed_0	HuggingFace	1.30	信号不足
48671	DeepSeek-R1-Qwen2.5-3b-LLM-Judge-Reward-JSON-Unstructured-To-Structured-Lora-gguf	HuggingFace	1.30	信号不足
48672	WeniGPT-2.3.3-Zephyr-7B-zephyr-prompt-merged-LLM_Base_2.0.3_SFT_reduction_variation	HuggingFace	1.30	信号不足
48673	qwen2_5vl_3b_sft_idm_how_to_onannel_agent_sft_data_local_bs_4_epochs_3	HuggingFace	1.30	unrated
48674	llm-jp-13b-instruct-full-ac_001_16x-dolly-ichikara_004_001_single-oasst-oasst2-v2.0-gguf	HuggingFace	1.30	信号不足
48675	batch1_epochs2_lr1e-05_paged_adamw_32bit_cosine_length2048_warmup_0.05_max_grad1.0_grad_accu16	HuggingFace	1.30	信号不足
48676	batch1_epochs2_lr1e-05_paged_adamw_32bit_cosine_length2048_warmup_0.05_max_grad1.0_grad_accu32	HuggingFace	1.30	信号不足
48677	batch1_epochs6_lr1e-06_paged_adamw_32bit_cosine_length4096_warmup_0.05_max_grad1.0_grad_accu10	HuggingFace	1.30	信号不足
48678	Al-Atlas-LLM-0.5B-bs-4-lr-5e-05-ep-3-wp-0.1-gacc-32-gnm-1.0-FP16-mx-2048-v2.3-GGUF	HuggingFace	1.30	信号不足
48679	Meta-Llama-3-8b-Lexi-Uninstruct-function-calling-json-mode-Task-Arithmetic-v0.1-GGUF	HuggingFace	1.30	unrated
48680	Polypsyche-Llama-3.1-8B-Instruct-Agent-0.0031-128K-code-ds-auto-divergent-i1-GGUF	HuggingFace	1.30	unrated

← Prev 1 … 2433 2434 2435 … 2462 Next →