2018年ACM MM会议论文 arXiv链接|电子爱好者

admin管理员组
文章数量:1530842

ACM MM 会议是多媒体领域的top1顶会人人心向往之的会议
我的有位老师说他的学生读了三年博士，投了好几次MM都没被录，主动要求延毕，说三年我追个姑娘也追到手了，竟然投会议就是投不中。。。
今年ACM MM 会议将在10月22-26日在韩国首尔举行，会议相关议程在官网
http://www.acmmm/2018/
已经发布，已经接收的papers列表：

Accepted Papers

Understanding Humans in Crowded Scenes: Deep Nested Adversarial Learning and A New Benchmark for Multi-Human Parsing

Incremental Deep Hidden Attribute Learning

Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector

Visual Domain Adaptation with Manifold Embedded Distribution Alignment

Object-Difference Attention: A simple relational attention for Visual Question Answering

Robust Billboard-based, Free-viewpoint Video Synthesis Algorithm to Overcome Occlusions under Challenging Outdoor Sport Scenes

Multi-Human Parsing Machines

Deep Priority Hashing

CropNet: Real-Time Thumbnailing

Learning to Transfer: Generalizable Attribute Learning with Multitask Neural Model Search

Supervised Online Hashing via Hadamard Codebook Learning

Shared Linear Encoder-based Gaussian Process Latent Variable Model for Visual Classification

Learning Semantic Structure-preserved Embeddings for Cross-modal Retrieval

Fine-grained Grocery Product Recognition by One-shot Learning

Fine-grained Representation Learning and Recognition by Exploiting Hierarchical Semantic Embedding

Style Separation and Synthesis via Generative Adversarial Networks

Attention-based Pyramid Aggregation Network for Visual Place Recognition

Dance with Melody : An LSTM-autoencoder Approach on Music-oriented Dance Synthesis

Fast Parameter Adaptation for Few-shot Image Captioning and Visual Question Answering

Semi-supervised Deep Generative Modelling of Incomplete Multi-Modality Emotional Data

Post Tuned Hashing: A New Approach to Indexing High-dimensional Data

Joint Sign Language Recognition and Education System with ST-Net

Aesthetic-Driven Image Enhancement by Adversarial Learning

Cascaded Feature Augmentation with Diffusion for Image Retrieval

Twitter Sentiment Analysis via Bi-sense Emoji Embedding and Attention-based LSTM

Temporal Sequence Distillation: Towards Few Frame Action Recognition

Joint Global and Co-Attentive Representation Learning for Image-Sentence Retrieval

Multi-View Image Generation from a Single-View

Slackliner — An Interactive Slackline Training Assistant

Hierarchical Memory Modelling for Video Captioning

Group Re-Identification: Leveraging and Integrating Multi-Grain Information

Collaborative Annotation of Semantic Objects in Images with Multi-granularity Supervisions

Multi-modal Preference Modeling for Product Search

GraphNet: Learning Image Pseudo Annotations for Weakly-Supervised Semantic Segmentation

Deep Triplet Quantization

Previewer for Multiple-Scale Object Detector

QARC: Video Quality Aware Rate Control for Real-Time Video Streaming based on Deep Reinforcement Learning

What dress fits me best? Fashion Recommendation on the Clothing Style for Personal Body Shape

SCRATCH: A Scalable Discrete Matrix Factorization Hashing for Cross-Modal Retrieval

OSMO: Online Specific Models for Occlusion in Multiple Object Tracking under Surveillance Scene

Cross-modal Moment Localization in Videos

Attribute-Aware Attention Model for Fine-grained Representation Learning

Video Forecasting with Forward-Backward-Net: Delving Deeper into Spatiotemporal Consistency

Learning Discriminative Features with Multiple Granularities for Person Re-Identification

StripNet: Towards Topology Consistent Strip Structure Segmentation

Attention-based Multi-Patch Aggregation for Image Aesthetic Assessment

An End-to-End Quadrilateral Regression Network for Comic Panel Extraction

CLS: A Cross-user Learning based System for Improving QoE in 360-degree Video Adaptive Streaming

Only Learn One Sample: Fine-Grained Visual Categorization with One Sample Training

Life-long Cross-media Correlation Learning

Text-to-image Synthesis via Symmetrical Distillation Networks

Multi-Scale Correlation for Sequential Cross-modal Hashing Learning

Jaguar: Low Latency Mobile Augmented Reality with Flexible Tracking

Feature Constrained by Pixel: Hierarchical Adversarial Deep Domain Adaptation

Explore Multi-Step Reasoning in Video Question Answering

Monocular Camera Based Real-Time Dense Mapping Using Generative Adversarial Network

Learning Collaborative Generation Correction Modules for Blind Image Deblurring and Beyond

Watch, Think and Attend: End-to-End Video Classification via Dynamic Knowledge Evolution Modeling

Multi-Label Image Classification via Knowledge Distillation from Weakly-Supervised Detection

Fast and Light Manifold CNN based 3D Facial Expression Recognition across Pose Variations

Unregularized Auto-Encoder with Generative Adversarial Networks for Image Generation

Real-time 3D Face-Eye Performance Capture of a Person Wearing VR Headset

Attention and Language Ensemble for Scene Text Recognition with Convolutional Sequence Modeling

Participation-Contributed Temporal Dynamic Model for Group Activity Recognition

A Unified Generative Adversarial Framework for Image Generation and Person Re-identification

Facial Expression Recognition in the Wild: A Cycle-Consistent Adversarial Attention Transfer Approach

Inferring User Emotive State Changes in Realistic Human-Computer Conversational Dialogs

Mining Semantics-Preserving Attention for Group Activity Recognition

Causally Regularized Learning on Data with Agnostic Bias

I read, I saw, I tell: Texts Assisted Fine-Grained Visual Classification

Context-Aware Unsupervised Text Stylization

Bridge The Gap Between VQA and Human Behavior on Omnidirectional Video: A Large-Scale Database and A Deep Learning Model

When to Learn What: Deep Cognitive Subspace Clustering

Look Deeper See Richer: Depth-aware Image Paragraph Captioning

Depth Structure Preserving Scene Image Generation

CA3Net: Contextual-Attentional Attribute-Appearance Network for Person Re-Identification

Learning Multimodal Taxonomy via Variational Deep Graph Embedding and Clustering

Beyond Narrative Description: Generating Poetry from Images by Multi-Adversarial Training

GNAS: A Greedy Neural Architecture Search Method for Multi-Attribute Learning

A Distributed Approach for Bitrate Selection in HTTP Adaptive Streaming

Generative Adversarial Product Quantisation

EmotionGAN: Unsupervised Domain Adaptation for Learning Discrete Probability Distributions of Image Emotions

Few-Shot Adaptation for Video Semantic Indexing

Historical Context-based Style Classification of Painting Images via Label Distribution Learning

Sparsely Grouped Multi-task Generative Adversarial Networks for Facial Attribute Manipulation

High-Quality Exposure Correction of Underexposed Photos

Fashion Sensitive Clothing Recommendation using Hierarchical Collocation Model

A Margin-based MLE for Crowdsourced Partial Ranking

Personalized Serious Games for Cognitive Intervention with Lifelog Visual Analytics

PHD-GIFs: Personalized Highlight Detection for Automatic GIF Creation

iHuman3D: Intelligent Human Body 3D Reconstruction using a Single Flying Camera

Face-Voice Matching using Cross-modal Embeddings

Multi-Scale Context Attention Network for Image Retrieval

When Deep Fool Meets Deep Prior: Adversarial Attack on Image Super-Resolution

Musicality-Novelty Generative Adversarial Nets for Algorithmic Composition

Knowledge-aware Multimodal Dialogue Systems

Cross-Domain Adversarial Feature Learning for Sketch Re-identification

Comprehensive Distance-Preserving Autoencoders for Cross-Modal Retrieval

Facial Expression Recognition Enhanced by Thermal Images through Adversarial Learning

CSAN: Contextual Self-Attention Network for User Sequential Recommendation
Semantic Human Matting

Visual Spatial Attention Network for Relationship Detection

Geometry Guided Adversarial Facial Expression Synthesis

Personalized multiple facial action unit recognition through generative adversarial recognition network

Learning Joint Multimodal Representation with Adversarial Attention Networks

Detecting Abnormality without Knowing Normality: A Two-stage Approach for Unsupervised Video Abnormal Event Detection

WildFish: A Large Benchmark for Fish Recognition in the Wild

Temporal Hierarchical Attention at Category- and Item-Level for Micro-Video Click-Through Prediction

BeautyGAN: Instance-level Facial Makeup Transfer with Deep Generative Adversarial Network

Songle Sync: A Large-Scale Web-based Platform for Controlling Various Devices in Synchronization with Music

CloudVR: Cloud Accelerated Interactive Mobile Virtual Reality

RGCNN: Regularized Graph CNN for Point Cloud Segmentation

Video-based Person Re-identification via Self-Paced Learning and Deep Reinforcement Learning Framework

Photo Squarization by Deep Multi-Operator Retargeting

Predicting Visual Context for Unsupervised Event Segmentation in Continuous Photo-streams

Semantic Image Inpainting with Progressive Generative Networks

Attentive Interactive Convolutional Matching for Community Question Answering in Social Multimedia

Deep Understanding of Cooking Procedure for Cross-modal Recipe Retrieval

LA-Net: Layout-Aware Dense Network for Monocular Depth Estimation

Direction-aware Neural Style Transfer

Reconfigurable Inverted Index

Learning and Fusing Multimodal Deep Features for Acoustic Scene Categorization

Context-Aware Visual Policy Network for Sequence-Level Image Captioning

A Unified Framework for Multimodal Domain Adaptation

Trusted Guidance Pyramid Network for Human Parsing

USAR: an interactive user-specific aesthetic ranking framework for images

Non-locally Enhanced Encoder-Decoder Network for Single Image De-raining

Structure Guided Photorealistic Style Transfer

Tracking-assisted Weakly Supervised Online Visual Object Segmentation in Unconstrained Videos

An ADMM-Based Universal Framework for Adversarial Attacks on Deep Neural Networks

Decoupled Novel Object Captioner

ThoughtViz: Visualizing Human Thoughts Using Generative Adversarial Network

Optimizing Personalized Interaction Experience in Crowd-Interactive Livecast: A Cloud-Edge Approach

End-to-End Blind Quality Assessment of Compressed Video Using Deep Neural Networks

Dynamic Sound Field Synthesis for Speech and Music Optimization

Local Convolutional Neural Networks for Person Re-Identification

Interpretable Multimodal Retrieval for Fashion Products

Conditional Expression Synthesis with Face Parsing Transformation

A Feature-Adaptive Semi-Supervised Framework for Co-Saliency Detection

Attentive Recurrent Neural Network for Weak-supervised Multi-label Image Classification

iSPA-Net: Iterative Semantic Pose Alignment Network

Extractive Video Summarizer with Memory Augmented Neural Networks

ModaNet: A Large-Scale Street Fashion Dataset with Polygon Annotations

Fully Point-wise Convolutional Neural Network for Modeling Statistical Regularities in Natural Images

From data to knowledge: deep learning model compression, transmission and communication

ChipGAN: A Generative Adversarial Network for Chinese Ink Wash Painting Style Transfer

Dest-ResNet: a Deep Spatiotemporal Residual Network for Hotspot Traffic Speed Prediction

Boosting Scene Parsing Performance via Reliable Scale Prediction

Deep Cross modal learning for Caricature Verification and Identification (CaVINet)

Online Action Tube Detection via Resolving the Spatio-temporal Context Pattern

Adaptive Temporal Encoding Network for Video Instance-level Human Parsing

User-Guided Deep Anime Line Art Colorization with Conditional Adversarial Networks

Enhancing Visual Question Answering Using Dropout

Online Inter-Camera Trajectory Association Exploiting Person Re-Identification and Camera Topology

Improving QoE of ABR Streaming Sessions through QUIC Retransmissions

Temporal Cross-Media SubSpaces Learning with Soft-Constraints

Learning Local Descriptors with Adversarial Enhancer from Volumetric Geometry Patches

SibNet: Sibling Convolutional Encoder for Video Captioning

Context-Dependent Diffusion Network for Visual Relationship Detection

Your Attention is Unique: Detecting 360-Degree Video Saliency in Head-Mounted Display for Head Movement Prediction

Generating Defensive Plays in Basketball Games

Connectionist temporal fusion for Sign Language Translation

JPEG Decompression in the Homomorphic Encryption Domain

BitStream: Efficient Computing Architecture for Real-Time Low-Power Inference of Binary Neural Networks on CPUs

Support Neighbor Loss for Person Re-Identification

A Large Scale RGB-D Database for Arbitrary-view Human Action Recognition

FlexStream: Towards Flexible Adaptive Video Streaming on End Devices using Extreme SDN

Spotting and Aggregating Salient Regions for Video Captioning

Structural inpainting

Partial Multi-View Subspace Clustering

FoV-Aware Edge Caching for Adaptive 360° Video Streaming

Attentive LSTM Crowd Flow Machines

Perceptual Temporal Incoherence Aware Stereo Video Retargeting

Fast Discrete Cross-modal Hashing With Regressing From Semantic Labels

Dense Auto-Encoder Hashing for Robust Cross-Modality Retrieval

Investigation of Small Group Social Interactions using Deep Visual Activity-Based Nonverbal Features

Dissimilarity Representation Learning for Generalized Zero-Shot Recognition

Examine before You Answer: Multi-task Learning with Adaptive-attentions for Multiple-choice VQA

Cumulative Nets for Edge Detection

Beyond the Product: Discovering Image Posts for Brands in Social Media

Robustness and Discrimination Oriented Hashing Combining Texture and Invariant Vector Distance

SLIONS: A Karaoke Application to Enhance Foreign Language Learning

Drawing in a Virtual 3D Space – Introducing VR Drawing in Elementary School Art Education

Semi-Supervised DFF: Decoupling Detection and Feature Flow for Video Object Detectors

Residual-Guide Feature Fusion Network for Single Image Deraining

Paragraph generation network with visual relationship detection

Hybrid Point Cloud Attribute Compression Using Slice-based Layered Structure and Block-based Intra Prediction

CIRCE: Real-Time Caching for Instance Recognition on Cloud Environments and Multi-Core Architectures

From Volcano to Toyshop: Adaptive Discriminative Region Discovery for Scene Recognition

Unsupervised Learning of 3D Model Reconstruction from Hand-Drawn Sketches

Learning to Synthesize 3D Indoor Scenes from Monocular Images

DASH for 3D Networked Virtual Environment

PVNet: A Joint Convolutional Network of Point Cloud and Multi-View for 3D Shape Recognition

The Effect of Foveation on High Dynamic Range Video Perception

GestureGAN for Hand Gesture-to-Gesture Translation in the Wild

MiniView Layout for Bandwidth-Efficient 360-Degree Video

An Efficient Deep Quantized Compressed Sensing Coding Framework of Natural Images

Deep Multimodal Image-Repurposing Detection

Video-to-Video Translation with Global Temporal Consistency

Robust Correlation Filter Tracking with Shepherded Instance-Aware Proposals

Cross-Species Learning: A Low-Cost Approach to Learning Human Fight from Animal Fight

PoB: Toward Reasoning Patterns of Beauty in Image Data

Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval

Deep Adaptive Temporal Pooling for Activity Recognition

Human Conversation Analysis Using Attentive Multimodal Networks with Hierarchical Encoder-Decoder

Pseudo Transfer with Marginalized Corrupted Attribute for Zero-shot Learning

Crossing-Domain Generative Adversarial Networks for Unsupervised Multi-Domain Image-to-Image Translation

Person Re-identification with Hierarchical Deep Learning Feature and efficient XQDA Metric

EmoCeleb: Emotion recognition in speech using Cross-Modal Transfer in the wild

随便搜一个‘generative’关键词就有14篇文章，可见gan和vae等仍然是一个大方向。

本文标签：链接会议论文 ACM arxiv

版权声明：本文标题：2018年ACM MM会议论文 arXiv链接内容由热心网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：https://m.elefans.com/dongtai/1725458796a1024530.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

电子爱好者 - 最新技术资讯及电子产品介绍！

2018年ACM MM会议论文 arXiv链接

更多相关文章

报告视频录制：腾讯会议录屏+人像画中画特效

java调用腾讯会议api，开会录制问题

基于SpringBoot和Vue电影购票管理系统的设计与实现论文

Linux下WIFI已链接但无法上网

Linux中init和systemd进程、ps命令和top命令、进程和线程介绍及软硬链接的相关理论

Re41：读论文 NumLJP Judicial knowledge‑enhanced magnitude‑aware reasoning for numerical legal judgment p

[论文笔记]彻底讲透FCN语义分割开山之作Fully Convolutional Networks

微信里的网址链接域名如何自动跳转到外部浏览器访问源码

当浏览器访问一个链接时计算机都做了哪些事

通过一个链接打开本地app，或者去下载app

IOS之 点击链接跳转到App Store指定App(应用程序)

微信扫描二维码跳转手机默认浏览器打开下载app的链接是怎么实现的

Mac电脑Android模拟器连不上网,mac 解决安卓模拟器链接不上网络

计算机读取数据的接囗教程,八爪鱼采集怎样获取数据API链接 八爪鱼采集获取数据API链接的方法...

不用再找了，吐血整理ChatGPT 新手使用手册~ (论文润色、降重指令)

亲测好用，ChatGPT 3.54.0新手使用手册~ 【论文润色、降重、扩写指令】

基于Springboot的Java邮件系统的设计与实现（附论文和源码）

解决 idea 已经配置好 但浏览器链接不了端口的问题

超详细的免费下载论文方法

四、ESP32链接WIFI

发表评论

推荐文章

excel快捷键大全常用分享

win11系统安装虚拟机VMmare pro和ubuntu20.04

u盘获取计算机管理员权限,复制文件到u盘需要管理员权限怎么办

Chrome浏览器取色器

k3刷机 重置_斐讯K3刷机教程：一直重启、忘了密码怎么办？手机刷机包下载

热门文章

kaggle的kernel-only比赛中出现Your Notebook cannot use internet access in this competition解决方案

再也不用举手之劳了？用ChatGPT评估代码生成的质量[期刊论文翻译]

win10如何调整计算机时间同步,Win10系统如何设置时间同步间隔？修改时间同步频率的方法...

iMazing许可证编号如何激活苹果版手机管理器支持 WinMac 双平台

电脑改成,如何把电脑变成无线路由器

如何重装系统windows7,怎么重装系统windows7

【souapp搜应用】:可牛杀毒软件连android手机应用.apk都误杀，真是头傻妞！

不再靠你买显卡充值信仰，英伟达已经变了

羊驼2:开放的基础和微调聊天模型--Llama 2论文阅读

Manjaro 18.0.4 Illyria安装搜狗拼音输入法踩坑

最新文章

WiFi和WLAN有什么区别和联系？

公共wifi不安全家里的wifi就安全了吗？

路由器wifi热点丢包率高_使用笔记本电脑和虚拟路由器创建自己的Wifi热点

无线路由器服务器拒接,wifi被拒绝接入解决方法(图文)

WiFi篇（一）-WiFi“黑”暗的一面

如何给自己各种帐号编一个安全又不会忘记的密码？

ESP8266 Node mcu WIFI无线控制入门_01无线远程控制LED

看自己的Wifi是否被盗用的技巧

【Android wifi】wifi基本原理

【Android工程师与智能家居产品的第一次接触②】给设备配网 Esp8266 wifi模块的快速配网和AP配网简介（付Android demo）

【智能家居篇】wifi网络接入原理（中）——认证Authentication

Android Wifi连接控制、TCP、UDP通信，6.0以上适配

网络安全--解除认证攻击wifi(详细教程)

WIFI 一键配置原理-ESP8266

openwrt折腾记4-开通ipv6( wifi-client模式下)

小米手机肿么还原时钟

15000流明是多少瓦

一般普通投影机功率多大?

苹果绿联转换器有些投影机不能用

坚果V9投影机具体参数?

有关九年级作文850字精选

80后90后_高一作文

中级卫生专业资格中医全科学主治医师中级模拟题2021年(9)案与解析

(精品)师范大学招考硕士研究生课程八六0试卷

ZXMVC8900(V3

【模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313】模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313 官方免费下载

【生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD】生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD 官方免费下载

【模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311】模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311 官方免费下载

【模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311】模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311 官方免费下载

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改 官方免费下载

如何实现高效的treenode搜索算法

treenode与链表有何本质区别

在哪些场景下应优先考虑使用treenode

treenode在树形结构中的角色是什么

如何通过treenode实现二叉树

IOS之点击链接跳转到App Store指定App(应用程序)

计算机读取数据的接囗教程,八爪鱼采集怎样获取数据API链接八爪鱼采集获取数据API链接的方法...

解决 idea 已经配置好但浏览器链接不了端口的问题

k3刷机重置_斐讯K3刷机教程：一直重启、忘了密码怎么办？手机刷机包下载

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改官方免费下载