搜索优化
English
搜索
Copilot
图片
视频
地图
资讯
购物
更多
航班
旅游
酒店
笔记本
Top stories
Sports
U.S.
Local
World
Science
Technology
Entertainment
Business
More
Politics
时间不限
过去 1 小时
过去 24 小时
过去 7 天
过去 30 天
按相关度排序
按时间排序
7 天
颠覆LLM格局!AI2新模型OLMo2,训练过程全公开,数据架构双升级
在预训练阶段,OLMo 2通过多种技术改进了训练稳定性,例如过滤重复的n-gram、使用更好的初始化方法、架构改进和超参数调整。这确保了模型在训练过程中不会出现崩溃或损失激增,从而提高了最终模型的性能。
来自MSN
2 个月
端侧小模型新星,SmolLM2 1.7B击败了Llama 3.2、Qwen 2.5
端侧小型语言模型新星——SmolLM2 1.7B击败了Qwen 2.5 1.5B和Llama 3.2 1B: Apache 2.0许可 训练于11万亿个令牌 在FineWeb-Edu、DCLM、The Stack以及新的数学和编码 ...
来自MSN
9 个月
爱芯通元NPU完成Llama 3和Phi-3大模型适配,推动AI大模型技术应用普及
训练数据量是前代Llama 2的七倍。 根据Meta的测试结果,Llama 3 8B模型在MMLU、GPQA、HumanEval等多项性能基准上均超过了Gemma 7B和Mistral 7B Instruct,70B模型则 ...
当前正在显示可能无法访问的结果。
隐藏无法访问的结果
今日热点
Small plane crashes in Philly
Black boxes recovered
Dismisses suit against CNN
Judge blocks funding freeze
Drone pilot to plead guilty
2 more victims in indictment
FDA approves painkiller
Gold hits all-time high
Opens probe into NPR, PBS
Granted legal personhood
Tour boat captain sentenced
Cause of death revealed
Cancerous tumor removed
Apologizes for old tweets
Seeking a new trial
Inmate's transfer blocked
To raise hourly pay
Confirmed interior secretary
Ends abortion travel policy
Hamas confirms death
Recuses self from Act 10 suit
In talks to reincorporate?
Activists call for boycott
Michigan priest loses license
'As Tears Go By' singer dies
US inflation ticked higher
Ground stop amid IT outage
Top leaders asked to resign
IL won’t hire Jan. 6 rioters
To again run for Senate?
Partners w/ US national labs
Boy, 5, dies in explosion
Receives $250K settlement
Walgreens suspends dividend
New York doctor indicted
Todd exits NBC News
反馈