标签：微调 - Blog Of JJ

论文笔记《VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning》

论文 - 《VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning》代码 - Github 关键词 - 强化微调、Qwen、视频异常理解VAU、新数据集、视频异常问答、视频异常检测VAD、新基准 1 引

默认 · 科研 0 0 2025-08-16 11:50 2025-08

论文笔记《VideoLLM-online: Online Video Large Language Model for Streaming Video》

论文 - 《VideoLLM-online: Online Video Large Language Model for Streaming Video》代码 - Github 关键词 - 流式视频、在线视频问答、视频大模型 1 引言研究动机现有大模型训练时通常将视频视为预定义的视频片段，导致

默认 · 科研 0 0 2025-07-25 15:22 2025-07

论文笔记《Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity》

论文 - 《Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity》代码 - Github 关键词 - 指令微调数据集、InterVL2、视频大模型、视觉-语言大模型VLM、视频异常检测VAD、时序采

默认 · 科研 0 2 2025-07-11 15:36 2025-07

论文笔记《Video-LLaVA: Learning United Visual Representation by Alignment Before Projection》

论文 - 《Video-LLaVA: Learning United Visual Representation by Alignment Before Projection》代码 - Github 关键词 - 微调、LanguageBind、Vicuna、视频大模型、视觉-语言大模型 0 比较不

默认 · 科研 0 0 2025-07-02 21:43 2025-07

论文笔记《MobileVLM V2: Faster and Stronger Baseline for Vision Language Model》

论文 - 《MobileVLM V2: Faster and Stronger Baseline for Vision Language Model》代码 - Github 关键词 - 边缘智能、高效大模型、视觉-语言模型VLM 1 引言动机：打造小型视觉-语言模型VLM。本文工作 - Mob

默认 · 科研 0 0 2025-07-02 14:25 2025-07

论文笔记《Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought》

论文 - 《Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought》代码 - Github 关键词 - reason推理能力、思维链COT、多模态大语言模型MLLMs、强化学习、新数据

默认 · 科研 0 0 2025-06-27 13:13 2025-06

论文笔记《SlowFastVAD: Video Anomaly Detection via Integrating Simple Detector and RAG-Enhanced VLM》

论文 - 《SlowFastVAD: Video Anomaly Detection via Integrating Simple Detector and RAG-Enhanced Vision-Language Model》代码 - 预计开源关键词 - 大小模型协作、视频异常检测、高效检测、

默认 · 科研 0 1 2025-06-24 15:27 2025-06

论文笔记《Open-Vocabulary Video Anomaly Detection》

论文 - 《Open-Vocabulary Video Anomaly Detection》关键词 - 1 引言过去的研究问题 - 开放集视频异常检测（open-set VAD）目标：在仅提供正常视频和已见异常的情况下，检测测试集中未见过的异常。局限：这种设定下关注的帧级别的异常得分，无法识

默认 · 科研 0 2 2025-06-20 21:42 2025-06

论文笔记《Uncovering What, Why and How: A Comprehensive Benchmark...of Video Anomaly》

论文 - 《Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly》代码 - Github 关键词 - 新基准、新评估指标、因果推理、异常可解释性、视频

默认 · 科研 0 0 2025-06-20 16:36 2025-06

论文笔记《Video Anomaly Detection and Explanation via Large Language Models》

论文 - 《Video Anomaly Detection and Explanation via Large Language Models》代码 - Github 关键词 - 弱监督视频异常学习WSVAD、大模型、视频大模型VLLM、微调、指令微调 1 引言动机：基于异常评分的方法多年来占据

默认 · 科研 0 0 2025-06-19 22:03 2025-06

论文笔记《AssistPDA: An Online Video Surveillance Assistant for Video Anomaly Prediction...》

论文 - 《AssistPDA: An Online Video Surveillance Assistant for Video Anomaly Prediction, Detection, and Analysis》代码 - 预计开源关键词 - 实时、视频异常检测VAD、新数据集、Qwen-

科研 · 默认 0 1 2025-06-18 15:08 2025-06

论文笔记《HAWK: Learning to Understand Open-World Video Anomalies》

论文 - 《HAWK: Learning to Understand Open-World Video Anomalies》代码 - Github 关键词 - 视频大模型、视频异常检测 VAD、框架设计、新微调数据集、运动模态、视频-文本、视频描述生成、视频问答摘要研究问题现有的 VAD 系

默认 · 科研 0 0 2025-06-17 20:50 2025-06

论文笔记《Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM》

论文 - 《Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM》代码 - Github 关键词 - 视频异常检测VAD、指令微调、视频大模型、ViT、监督学习摘要研究问

默认 · 科研 0 0 2025-06-17 16:01 2025-06

论文笔记《VadCLIP: Adapting Vision-Language Models for Weakly Supervised Video Anomaly Detection》

论文 - 《VadCLIP: Adapting Vision-Language Models for Weakly Supervised Video Anomaly Detection》代码 -Github 关键词 - 视频异常检测、CLIP、对比学习、局部全局、时序建模、弱监督学习摘要研究问

默认 · 科研 0 3 2025-06-17 11:28 2025-06

论文笔记《TinyHAR: A Lightweight Deep Learning Model Designed for Human Activity Recognition》

论文 - 《TinyHAR: A Lightweight Deep Learning Model Designed for Human Activity Recognition》代码 -Github 关键词 - 高效、边缘智能、人类活动识别HAR、惯性传感单元IMU、卷积+Transformer

默认 · 科研 0 0 2025-06-16 17:17 2025-06

论文笔记《MotionGPT: Human Motion as a Foreign Language》

论文 - 《MotionGPT: Human Motion as a Foreign Language》代码 - Github 关键词 - Neurips、运动-语言大模型、多任务、预训练+微调 1 摘要研究问题人类运动展现出与语言类似的语义结构，通常被视为一种“身体语言”。通过将语言数据与

默认 · 科研 0 1 2025-06-16 15:16 2025-06

论文笔记《STREAMMIND: Unlocking Full Frame Rate Streaming Video Dialogue through Event-Gated Cognition》

论文 - 《STREAMMIND: Unlocking Full Frame Rate Streaming Video Dialogue through Event-Gated Cognition》代码 - Github 关键词 - 流式视频对话、实时处理、视频大模型、高效处理、开源摘要研究问

默认 · 科研 0 0 2025-06-16 11:42 2025-06

论文笔记《Leveraging Synthetic Adult Datasets for Unsupervised Infant Pose Estimation》

论文 - 《Leveraging Synthetic Adult Datasets for Unsupervised Infant Pose Estimation》代码 - 给的链接失效了关键词 - 婴儿动作识别、无监督域适应、均值教师模型、流形先验摘要研究问题针对婴儿的姿态估计发展仍较为

默认 · 科研 0 0 2025-06-12 13:34 2025-06

论文笔记《IMU2CLIP: Multimodal Contrastive Learning for IMU Motion Sensors from...》

论文 - 《IMU2CLIP: Multimodal Contrastive Learning for IMU Motion Sensors from Egocentric Videos and Text》代码 - Github 关键词 - Meta工作、多模态学习、IMU建模、对比学习、CLIP

默认 · 科研 0 0 2025-06-10 21:22 2025-06

论文笔记《LLaSA: A Sensor-Aware LLM for Natural Language Reasoning of Human Activity from IMU Data》

论文 - 《LLaSA: A Sensor-Aware LLM for Natural Language Reasoning of Human Activity from IMU Data》代码 - Github 关键词 - 多模态大模型、人类活动问答模型、13B参数、微调、新数据集摘要研究问

默认 · 科研 0 0 2025-06-10 18:01 2025-06