说明: We present a new compact but dense representation of scene geometry which is conditioned on the intensity data from a single image and generated from a code consisting of a small number of parameters. We are inspired by work both on learned depth fr <algofei> 在 上传 | 大小:887808
说明: This paper addresses the problem of video object segmentation, where the initial object mask is given in the first frame of an input video. We propose a novel spatiotemporal Markov Random Field (MRF) model defined over pixels to handle this problem. <algofei> 在 上传 | 大小:2097152
说明: Observing that Semantic features learned in an image classification task and Appearance features learned in a similarity matching task complement each other, we build a twofold Siamese network, named SA-Siam, for real-time object tracking. SA-Siam i <algofei> 在 上传 | 大小:1048576
说明: Neural networks have recently become good at engaging in dialog. However, current approaches are based solely on verbal text, lacking the richness of a real face-to-face conversation. We propose a neural conversation model that aims to read and gene <algofei> 在 上传 | 大小:990208
说明: We present a fast inverse-graphics framework for instance-level 3D scene understanding. We train a deep convolutional network that learns to map image regions to the full 3D shape and pose of all object instances in the image. Our method produces a <algofei> 在 上传 | 大小:3145728
说明: Convolutional networks are the de-facto standard for analyzing spatio-temporal data such as images, videos, and 3D shapes. Whilst some of this data is naturally dense (e.g., photos), many other data sources are inherently sparse. Examples include 3D <algofei> 在 上传 | 大小:630784
说明: We propose a scalable, efficient and accurate approach to retrieve 3D models for objects in the wild. Our contri- bution is twofold. We first present a 3D pose estimation approach for object categories which significantly outper- forms the state-of- <algofei> 在 上传 | 大小:1048576