面向传媒领域的AIGC工具性能分析
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

G 202

基金项目:

国家社会科学基金重点项目(22AXW008);教育部哲学社会科学研究重大委托项目(24JZDW008)


Performance Analysis of AIGC Tools in the Media Field
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    当前,对AIGC工具的评测多是基于通用技术应用的测量,借助模型评测工具包、基准数据集、评测平台等方法产生的评测结果缺乏一致性,也缺少意识形态规范、学科规范、艺术审美等对传媒应用语境及要素的特质性分析,不足以支撑传媒业务的开展。面向传媒领域的AIGC工具性能分析需落实到具体的业务场景中。结合对传媒工作者AIGC工具使用情况的调研,组建跨学科评测员团队,共选取21个大模型进行主客观分析,旨在为传媒工作者高效使用AIGC工具提供专业化指导。分析发现:天工AI 3.0、文心大模型4.0 Turbo、豆包AI在文本理解、文本生成及领域知识方面得分较高;GPT-4o、即梦AI在机器视觉与图像生成方面表现良好;KIMI、GPT-4o、Stable Diffusion2.0、Pika2.0、即梦AI依次在文学脚本、拍摄提纲、分镜脚本、视频生成、自动剪辑任务中表现突出;天工AI 3.0在语音识别与音乐识别任务上表现良好,Murf AI擅长语音合成,Suno AI V4则倾向于文生音乐。大模型存在领域知识错误、特定任务难以生成等问题,需增强模型的底层专业数据训练,驱动模型广泛对接传媒应用场景,不断提升模型在传媒生产中的性能。

    Abstract:

    The current evaluation of AIGC tools is mostly based on the measurement of general technology applications, and the evaluation results by model evaluation toolkits, benchmark datasets, evaluation platforms, and other methods are inconsistent, lacking characteristic analyses of media application contexts and elements such as ideological framework disciplinary norms, and artistic aesthetics, which are insufficient for the development of media business. The performance analysis of AIGC tools in the media field needs to be carried out in specific business scenarios. Based on a survey of media workers’ use of AIGC tools, we set up an interdisciplinary evaluation team and selected 21 large models for subjective and objective analysis to provide professional guidance for media workers’ efficient use of AIGC tools. It is found that Tiangong AI 3.0, Wenxin Big Model 4.0 Turbo, and Doubao AI scored high in text comprehension, text generation, and domain knowledge; GPT-4o and Dreaming AI perform well in machine vision and image generation; KIMI, GPT-4o, Stable Diffusion2.0, Pika2.0 and Jimeng AI perform outstandingly in literary script, shooting outline, storyboard script, video generation, and automatic editing tasks in sequence; Tiangong AI 3.0 performs well in speech recognition and music recognition tasks, Murf AI excels in speech synthesis, and Suno AI V4 in humanities and music. The large models have shown domain knowledge errors and difficulty in generating specific tasks. It is necessary to enhance their core professional data training, drive the model to widely integrate with media application scenarios, and continuously improve their performance in media production.

    参考文献
    相似文献
    引证文献
引用本文

王虎,彭新宇,蔡建军.面向传媒领域的AIGC工具性能分析[J].上海理工大学学报(社科版),2025,47(4):351-367.

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-01-09
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2025-09-30
  • 出版日期:
文章二维码