2025.11.06 | 扩散模型省数据;音视频对口型

2025.11.06 | 扩散模型省数据;音视频对口型

Published on Nov 6
7分钟
HuggingFace 每日AI论文速递
0:00
0:00
<p>本期的 9 篇论文如下:</p><p>[00:17] 🚀 Diffusion Language Models are Super Data Learners(扩散语言模型是超级数据学习者)</p><p>[01:06] 🎬 UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions(统一音视频生成的不对称跨模态交互方法)</p><p>[01:42] 🧩 LEGO-Eval: Towards Fine-Grained Evaluation on Synthesizing 3D Embodied Environments with Tool Augmentation(LEGO-Eval:面向具身3D环境合成工具增强细粒度评测)</p><p>[02:25] 📊 Orion-MSP: Multi-Scale Sparse Attention for Tabular In-Context Learning(Orion-MSP:面向表格上下文学习的多尺度稀疏注意力机制)</p><p>[03:15] 📊 TabTune: A Unified Library for Inference and Fine-Tuning Tabular Foundation Models(TabTune:面向表格基础模型推理与微调的一站式统一库)</p><p>[03:46] 🦾 Kinematify: Open-Vocabulary Synthesis of High-DoF Articulated Objects(Kinematify:开放词汇的高自由度关节物体合成)</p><p>[04:30] 🧠 MME-CC: A Challenging Multi-Modal Evaluation Benchmark of Cognitive Capacity(MME-CC:一项面向多模态认知能力的挑战性评测基准)</p><p>[05:06] 📈 LiveTradeBench: Seeking Real-World Alpha with Large Language Models(LiveTradeBench:用大模型在真实市场里挖掘超额收益)</p><p>[05:55] 🔍 ...