简介
this book constitutes the thoroughly refereed post-proceedings of the first international workshop on machine learning for multimodal interaction, mlmi 2004, held in martigny, switzerland in june 2004.
the 30 revised full papers presented were carefully selected during two rounds of reviewing and revision. the papers are organized in topical sections on hci and applications, structuring and interaction, multimodal processing, speech processing, dialogue management, and vision and emotion.
目录
mlmi 2004
ⅰ hci and applications
accessing multimodal meeting data: systems, problems and possibilities
browsing recorded meetings with ferret
meeting modelling in the context of multimodal research
artificial companions
zakim-a multimodal software system for large-scale teleconferencing
ⅱ structuring and interaction
towards computer understanding of human interactions
multistream dynamic bayesian network for meeting segmentation
using static documents as structured and thematic interfaces to multimedia meeting archives
an integrated framework for the management of video collection
the nite xml toolkit meets the icsi meeting corpus: import annotation, and browsing
ⅲ multimodal processing
s-seer: selective perception in a multimodal office activity recognition system
mapping from speech to images using continuous state space models
an online algorithm for hierarchical phoneme classification
towards predicting optimal fusion candidates: a case study on biometric authentication tasks
mixture of svms for face class modeling
av16.3: an audio-visual corpus for speaker localization and tracking
.ⅳ speech processing
the 2004 icsi-sri-uw meeting recognition system
on the adequacy of baseform pronunciations and pronunciation variants
tandem connectionist feature extraction for conversational speech recognition
……
ⅴ dialogue management
ⅵ vision and emotion
author index
ⅰ hci and applications
accessing multimodal meeting data: systems, problems and possibilities
browsing recorded meetings with ferret
meeting modelling in the context of multimodal research
artificial companions
zakim-a multimodal software system for large-scale teleconferencing
ⅱ structuring and interaction
towards computer understanding of human interactions
multistream dynamic bayesian network for meeting segmentation
using static documents as structured and thematic interfaces to multimedia meeting archives
an integrated framework for the management of video collection
the nite xml toolkit meets the icsi meeting corpus: import annotation, and browsing
ⅲ multimodal processing
s-seer: selective perception in a multimodal office activity recognition system
mapping from speech to images using continuous state space models
an online algorithm for hierarchical phoneme classification
towards predicting optimal fusion candidates: a case study on biometric authentication tasks
mixture of svms for face class modeling
av16.3: an audio-visual corpus for speaker localization and tracking
.ⅳ speech processing
the 2004 icsi-sri-uw meeting recognition system
on the adequacy of baseform pronunciations and pronunciation variants
tandem connectionist feature extraction for conversational speech recognition
……
ⅴ dialogue management
ⅵ vision and emotion
author index
- 名称
- 类型
- 大小
光盘服务联系方式: 020-38250260 客服QQ:4006604884
云图客服:
用户发送的提问,这种方式就需要有位在线客服来回答用户的问题,这种 就属于对话式的,问题是这种提问是否需要用户登录才能提问
Video Player
×
Audio Player
×
pdf Player
×