Abstract: This paper proposes a multimodal adaptive convolutional neural network (MA-CNN), which aims to improve the learning performance of complex tasks by efficiently fusing multiple modal data.