Special Focus on Deep Learning for Computer Vision
计算机 图形图像 LETTER Website SpringerLink Google Scholar Supplementary Cited in SCI: 19

MDSSD: multi-scale deconvolutional single shot detector for small objects
Cui, Lisha; Ma, Rui; Lv, Pei; Jiang, Xiaoheng; Gao, Zhimin; Zhou, Bing; Xu, Mingliang
Sci China Inf Sci, 2020, 63(2): 120113
Keywords: object detection; small objects; multi-scale deconvolution; fusion block; real-time
Cite as: Cui L S, Ma R, Lv P, et al. MDSSD: multi-scale deconvolutional single shot detector for small objects. Sci China Inf Sci, 2020, 63(2): 120113, doi: 10.1007/s11432-019-2723-1

计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Homepage Cited in SCI: 18

Jittor: a novel deep learning framework with meta-operators and unified graph execution
Hu, Shi-Min; Liang, Dun; Yang, Guo-Ye; Yang, Guo-Wei; Zhou, Wen-Yang
Sci China Inf Sci, 2020, 63(12): 222103
Keywords: deep learning framework; meta-operator; unified graph execution; jit compilation; generative adversarial network
Cite as: Hu S-M, Liang D, Yang G-Y, et al. Jittor: a novel deep learning framework with meta-operators and unified graph execution. Sci China Inf Sci, 2020, 63(12): 222103, doi: 10.1007/s11432-020-3097-4

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 14

PSC-Net: learning part spatial co-occurrence for occluded pedestrian detection
Xie, Jin; Pang, Yanwei; Cholakkal, Hisham; Anwer, Rao; Khan, Fahad; Shao, Ling
Sci China Inf Sci, 2021, 64(2): 120103
Keywords: pedestrian detection; graph convolutional network; occlusion; object detection; feature extraction
Cite as: Xie J, Pang Y W, Cholakkal H, et al. PSC-Net: learning part spatial co-occurrence for occluded pedestrian detection. Sci China Inf Sci, 2021, 64(2): 120103, doi: 10.1007/s11432-020-2969-8

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 14

Task-wise attention guided part complementary learning for few-shot image classification
Cheng, Gong; Li, Ruimin; Lang, Chunbo; Han, Junwei
Sci China Inf Sci, 2021, 64(2): 120104
Keywords: few-shot learning; meta-learning; task-wise attention; part complementary learning
Cite as: Cheng G, Li R M, Lang C B, et al. Task-wise attention guided part complementary learning for few-shot image classification. Sci China Inf Sci, 2021, 64(2): 120104, doi: 10.1007/s11432-020-3156-7

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 13

CGNet: cross-guidance network for semantic segmentation
Zhang, Zhijie; Pang, Yanwei
Sci China Inf Sci, 2020, 63(2): 120104
Keywords: semantic segmentation; fully convolutional networks; pyramid network; edge detection; saliency detection; cross-guidance
Cite as: Zhang Z J, Pang Y W. CGNet: cross-guidance network for semantic segmentation. Sci China Inf Sci, 2020, 63(2): 120104, doi: 10.1007/s11432-019-2718-7

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 12

FACLSTM: ConvLSTM with focused attention for scene text recognition
Wang, Qingqing; Huang, Ye; Jia, Wenjing; He, Xiangjian; Blumenstein, Michael; Lyu, Shujing; Lu, Yue
Sci China Inf Sci, 2020, 63(2): 120103
Keywords: scene text recognition; convolutional lstm; focused attention; spatial correlation; sequential prediction
Cite as: Wang Q Q, Huang Y, Jia W J, et al. FACLSTM: ConvLSTM with focused attention for scene text recognition. Sci China Inf Sci, 2020, 63(2): 120103, doi: 10.1007/s11432-019-2713-1

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 10

Preserving details in semantics-aware context for scene parsing
Ma, Shuai; Pang, Yanwei; Pan, Jing; Shao, Ling
Sci China Inf Sci, 2020, 63(2): 120106
Keywords: fully convolutional networks; semantic segmentation; cityscapes; semantic-aware context
Cite as: Ma S, Pang Y W, Pan J, et al. Preserving details in semantics-aware context for scene parsing. Sci China Inf Sci, 2020, 63(2): 120106, doi: 10.1007/s11432-019-2738-y

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 8

Triple discriminator generative adversarial network for zero-shot image classification
Ji, Zhong; Yan, Jiangtao; Wang, Qiang; Pang, Yanwei; Li, Xuelong
Sci China Inf Sci, 2021, 64(2): 120101
Keywords: zero-shot classification; generative adversarial nets; text reconstruction; sharma-mittal entropy
Cite as: Ji Z, Yan J T, Wang Q, et al. Triple discriminator generative adversarial network for zero-shot image classification. Sci China Inf Sci, 2021, 64(2): 120101, doi: 10.1007/s11432-020-3032-8

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 7

Ordinal distribution regression for gait-based age estimation
Zhu, Haiping; Zhang, Yuheng; Li, Guohao; Zhang, Junping; Shan, Hongming
Sci China Inf Sci, 2020, 63(2): 120102
Keywords: computer vision; deep learning; ordinal distribution regression; global and local features; gait-based age estimation
Cite as: Zhu H P, Zhang Y H, Li G H, et al. Ordinal distribution regression for gait-based age estimation. Sci China Inf Sci, 2020, 63(2): 120102, doi: 10.1007/s11432-019-2733-4

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 LETTER Website SpringerLink Google Scholar Supplementary Cited in SCI: 6

Leveraging 3D blendshape for facial expression recognition using CNN
Wang, Sa; Cheng, Zhengxin; Deng, Xiaoming; Chang, Liang; Duan, Fuqing; Lu, Ke
Sci China Inf Sci, 2020, 63(2): 120114
Keywords: facial expression recognition; convolutional neural network; two-stream network; 3d face blendshape; face representation
Cite as: Wang S, Cheng Z X, Deng X M, et al. Leveraging 3D blendshape for facial expression recognition using CNN. Sci China Inf Sci, 2020, 63(2): 120114, doi: 10.1007/s11432-019-2747-y

计算机 图形图像 LETTER Website SpringerLink Google Scholar Cited in SCI: 6

Human motion segmentation based on structure constraint matrix factorization
Gao, Hongbo; Guo, Fang; Zhu, Juping; Kan, Zhen; Zhang, Xinyu
Sci China Inf Sci, 2022, 65(1): 119103
Keywords: computer vision; human motion segmentation; spectral clustering; structure constraint; matrix factorization
Cite as: Gao H B, Guo F, Zhu J P, et al. Human motion segmentation based on structure constraint matrix factorization. Sci China Inf Sci, 2022, 65(1): 119103, doi: 10.1007/s11432-020-2967-3

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 5

SynthText3D: synthesizing scene text images from 3D virtual worlds
Liao, Minghui; Song, Boyu; Long, Shangbang; He, Minghang; Yao, Cong; Bai, Xiang
Sci China Inf Sci, 2020, 63(2): 120105
Keywords: optical character recognition (ocr); synthetic data; scene text detection; 3d; deep learning
Cite as: Liao M H, Song B Y, Long S B, et al. SynthText3D: synthesizing scene text images from 3D virtual worlds. Sci China Inf Sci, 2020, 63(2): 120105, doi: 10.1007/s11432-019-2737-0

计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 5

Learning hyperspectral images from RGB images via a coarse-to-fine CNN
Mei, Shaohui; Geng, Yunhao; Hou, Junhui; Du, Qian
Sci China Inf Sci, 2022, 65(5): 152102
Keywords: hyperspectral; reconstruction; convolutional neural network; deep learning
Cite as: Mei S H, Geng Y H, Hou J H, et al. Learning hyperspectral images from RGB images via a coarse-to-fine CNN. Sci China Inf Sci, 2022, 65(5): 152102, doi: 10.1007/s11432-020-3102-9

计算机 图形图像 MOOP Website SpringerLink Google Scholar Homepage Cited in SCI: 4

Anomaly detection by exploiting the tracking trajectory in surveillance videos
Xue, Zixuan; Wu, Wei
Sci China Inf Sci, 2020, 63(5): 154101
Keywords: anomaly detection; object detection; tracking trajectory; double fusion method; trajectory association
Cite as: Xue Z X, Wu W. Anomaly detection by exploiting the tracking trajectory in surveillance videos. Sci China Inf Sci, 2020, 63(5): 154101, doi: 10.1007/s11432-018-9792-8

计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 4

InStereo2K: a large real dataset for stereo matching in indoor scenes
Bao, Wei; Wang, Wei; Xu, Yuhua; Guo, Yulan; Hong, Siyu; Zhang, Xiaohu
Sci China Inf Sci, 2020, 63(11): 212101
Keywords: stereo matching; depth estimation; convolutional neural network; dataset
Cite as: Bao W, Wang W, Xu Y H, et al. InStereo2K: a large real dataset for stereo matching in indoor scenes. Sci China Inf Sci, 2020, 63(11): 212101, doi: 10.1007/s11432-019-2803-x

计算机 图形图像 LETTER Website SpringerLink Google Scholar Supplementary Cited in SCI: 3

Large margin deep embedding for aesthetic image classification
Guo, Guanjun; Wang, Hanzi; Yan, Yan; Zhang, Liming; Li, Bo
Sci China Inf Sci, 2020, 63(1): 119101
Keywords: aesthetic classification; large margin; deep embedding; deep metric learning; joint loss
Cite as: Guo G J, Wang H Z, Yan Y, et al. Large margin deep embedding for aesthetic image classification. Sci China Inf Sci, 2020, 63(1): 119101, doi: 10.1007/s11432-018-9567-8

计算机 图形图像 MOOP Website SpringerLink Google Scholar Homepage Cited in SCI: 3

Ordered matrix representation supporting the visual analysis of associated data
Chen, Yi; Lv, Cheng; Li, Yue; Chen, Wei; Ma, Kwan-Liu
Sci China Inf Sci, 2020, 63(8): 184101
Keywords: visualization; rw-rank; ordered matrix; associated data; pesticide residue data
Cite as: Chen Y, Lv C, Li Y, et al. Ordered matrix representation supporting the visual analysis of associated data. Sci China Inf Sci, 2020, 63(8): 184101, doi: 10.1007/s11432-019-2647-3

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 LETTER Website SpringerLink Google Scholar Cited in SCI: 3

Learning generalizable deep feature using triplet-batch-center loss for person re-identification
Hu, Bin; Xu, Jiwei; Wang, Xinggang
Sci China Inf Sci, 2021, 64(2): 120111
Keywords: person re-identification; triplet loss; triplet-batch-center loss; metric learning; deep learning
Cite as: Hu B, Xu J W, Wang X G. Learning generalizable deep feature using triplet-batch-center loss for person re-identification. Sci China Inf Sci, 2021, 64(2): 120111, doi: 10.1007/s11432-019-2943-6

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 LETTER Website SpringerLink Google Scholar Supplementary Cited in SCI: 2

Multi-attention based cross-domain beauty product image retrieval
Wang, Zhihui; Liu, Xing; Lin, Jiawen; Yang, Caifei; Li, Haojie
Sci China Inf Sci, 2020, 63(2): 120112
Keywords: beauty product image retrieval; saliency attention mechanism; text attention mechanism; local feature aggregation; multi-attention classification network
Cite as: Wang Z H, Liu X, Lin J W, et al. Multi-attention based cross-domain beauty product image retrieval. Sci China Inf Sci, 2020, 63(2): 120112, doi: 10.1007/s11432-019-2721-0

计算机 图形图像 MOOP Website SpringerLink Google Scholar Homepage Cited in SCI: 2

Visualization of COVID-19 spread based on spread and extinction indexes
Zhang, Song-Hai; Cai, Yun; Li, Jian
Sci China Inf Sci, 2020, 63(6): 164102
Keywords: visual analytics; spread index; extinction index; themeriver; bubble chart; covid-19
Cite as: Zhang S-H, Cai Y, Li J. Visualization of COVID-19 spread based on spread and extinction indexes. Sci China Inf Sci, 2020, 63(6): 164102, doi: 10.1007/s11432-020-2828-1

计算机 图形图像 MOOP Website SpringerLink Google Scholar Homepage Cited in SCI: 2

Recursive narrative alignment for movie narrating
Han, Zhongyi; Wu, Hongbo; Wei, Benzheng; Yin, Yilong; Li, Shuo
Sci China Inf Sci, 2020, 63(7): 174101
Keywords: movie narrating; visual captioning; long short-term memory networks; deep learning; narrative
Cite as: Han Z Y, Wu H B, Wei B Z, et al. Recursive narrative alignment for movie narrating. Sci China Inf Sci, 2020, 63(7): 174101, doi: 10.1007/s11432-018-9908-4

计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 2

Discriminative fine-grained network for vehicle re-identification using two-stage re-ranking
Wang, Qi; Min, Weidong; He, Daojing; Zou, Song; Huang, Tiemei; Zhang, Yu; Liu, Ruikang
Sci China Inf Sci, 2020, 63(11): 212102
Keywords: vehicle re-identification; dfn; two-stage re-ranking; fine-grained; jaccard metric
Cite as: Wang Q, Min W D, He D J, et al. Discriminative fine-grained network for vehicle re-identification using two-stage re-ranking. Sci China Inf Sci, 2020, 63(11): 212102, doi: 10.1007/s11432-019-2811-8

计算机 图形图像 MOOP Website SpringerLink Google Scholar Homepage Cited in SCI: 2

Reading comprehension based on visualization of eye tracking and EEG data
Cheng, Shiwei; Hu, Yilin; Fan, Jing; Wei, Qianjing
Sci China Inf Sci, 2020, 63(11): 214101
Keywords: eye tracking; human computer interaction; user interface; brain computer interaction; information visualization
Cite as: Cheng S W, Hu Y L, Fan J, et al. Reading comprehension based on visualization of eye tracking and EEG data. Sci China Inf Sci, 2020, 63(11): 214101, doi: 10.1007/s11432-019-1466-7

计算机 图形图像 LETTER Website SpringerLink Google Scholar Cited in SCI: 2

Human-in-the-loop image segmentation and annotation
Zhang, Xiaoya; Wang, Lianjie; Xie, Jin; Zhu, Pengfei
Sci China Inf Sci, 2020, 63(11): 219101
Keywords: semantic segmentation; active learning; human-machine collaboration; pseudo labelling; image annotation
Cite as: Zhang X Y, Wang L J, Xie J, et al. Human-in-the-loop image segmentation and annotation. Sci China Inf Sci, 2020, 63(11): 219101, doi: 10.1007/s11432-019-2759-y

计算机 图形图像 LETTER Website SpringerLink Google Scholar Supplementary Cited in SCI: 2

Quantized and adaptive memristor based CNN (QA-mCNN) for image processing
Hu, Xiaofang; Shi, Wenqiang; Zhou, Yue; Tang, Hongan; Duan, Shukai
Sci China Inf Sci, 2022, 65(1): 119104
Keywords: adaptive template; incremental network quantization; optimization; cnn; memristor
Cite as: Hu X F, Shi W Q, Zhou Y, et al. Quantized and adaptive memristor based CNN (QA-mCNN) for image processing. Sci China Inf Sci, 2022, 65(1): 119104, doi: 10.1007/s11432-020-3031-9

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 LETTER Website SpringerLink Google Scholar Cited in SCI: 2

RLLNet: a lightweight remaking learning network for saliency redetection on RGB-D images
Zhou, Wujie; Liu, Chang; Lei, Jingsheng; Yu, Lu
Sci China Inf Sci, 2022, 65(6): 160107
Keywords: deep learning; rgb-d image; saliency detection; remaking learning; lightweight network
Cite as: Zhou W J, Liu C, Lei J S, et al. RLLNet: a lightweight remaking learning network for saliency redetection on RGB-D images. Sci China Inf Sci, 2022, 65(6): 160107, doi: 10.1007/s11432-020-3337-9

计算机 图形图像 MOOP Website SpringerLink Google Scholar Homepage Cited in SCI: 1

Spatiotemporal consistency-based adaptive hand-held video stabilization
Li, Xiao; Li, Shuai; Qin, Hong; Hao, Aimin
Sci China Inf Sci, 2020, 63(1): 114101
Keywords: hand-held camera; adaptive video stabilization; spatial structure consistency; self-adaptive imf selection; feature-centric emd
Cite as: Li X, Li S, Qin H, et al. Spatiotemporal consistency-based adaptive hand-held video stabilization. Sci China Inf Sci, 2020, 63(1): 114101, doi: 10.1007/s11432-018-9764-0

计算机 图形图像 LETTER Website SpringerLink Google Scholar Cited in SCI: 1

Effective two-view line segment reconstruction based on structure priors
Wang, Wei; Cui, Hainan; Gao, Wei; Hu, Zhanyi
Sci China Inf Sci, 2020, 63(1): 119102
Keywords: line segment matching; plane fitting; 3d reconstruction; energy optimization; structure prior
Cite as: Wang W, Cui H N, Gao W, et al. Effective two-view line segment reconstruction based on structure priors. Sci China Inf Sci, 2020, 63(1): 119102, doi: 10.1007/s11432-019-9867-9

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 EDITORIAL Website SpringerLink Google Scholar Cited in SCI: 1

Special focus on deep learning for computer vision
Bai, Xiang; Pang, Yanwei; Zhang, Guofeng
Sci China Inf Sci, 2020, 63(2): 120100
Cite as: Bai X, Pang Y W, Zhang G F. Special focus on deep learning for computer vision. Sci China Inf Sci, 2020, 63(2): 120100, doi: 10.1007/s11432-020-2766-x

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 1

Progressive rectification network for irregular text recognition
Gao, Yunze; Chen, Yingying; Wang, Jinqiao; Lu, Hanqing
Sci China Inf Sci, 2020, 63(2): 120101
Keywords: irregular text recognition; progressive rectification; iterative refinement
Cite as: Gao Y Z, Chen Y Y, Wang J Q, et al. Progressive rectification network for irregular text recognition. Sci China Inf Sci, 2020, 63(2): 120101, doi: 10.1007/s11432-019-2710-7

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 LETTER Website SpringerLink Google Scholar Cited in SCI: 1

Discriminative stacked autoencoder for feature representation and classification
Gao, Yiping; Li, Xinyu; Gao, Liang
Sci China Inf Sci, 2020, 63(2): 120111
Keywords: stacked autoencoders; discriminative feature representation; hybrid pretraining; classification; deep learning
Cite as: Gao Y P, Li X Y, Gao L. Discriminative stacked autoencoder for feature representation and classification. Sci China Inf Sci, 2020, 63(2): 120111, doi: 10.1007/s11432-019-2722-3

计算机 图形图像 MOOP Website SpringerLink Google Scholar Homepage Cited in SCI: 1

Detail-preserving smoke simulation using an efficient high-order numerical scheme
Zhu, Jian; Yang, Zhuo; Sun, Hanqiu; Wu, Enhua; Cai, Ruichu; Hao, Zhifeng
Sci China Inf Sci, 2020, 63(6): 164101
Keywords: constrained interpolation profile; dimensional splitting; high-order advection; taylor expansion; smoke simulation
Cite as: Zhu J, Yang Z, Sun H Q, et al. Detail-preserving smoke simulation using an efficient high-order numerical scheme. Sci China Inf Sci, 2020, 63(6): 164101, doi: 10.1007/s11432-018-9889-8

计算机 图形图像 LETTER Website SpringerLink Google Scholar Supplementary Cited in SCI: 1

Mobile person re-identification with a lightweight trident CNN
Xiong, Mingfu; Chen, Dan; Lu, Xiaoqiang
Sci China Inf Sci, 2020, 63(11): 219102
Keywords: smart mobile device; person re-identification; trident cnn; smart city; video surveillance
Cite as: Xiong M F, Chen D, Lu X Q. Mobile person re-identification with a lightweight trident CNN. Sci China Inf Sci, 2020, 63(11): 219102, doi: 10.1007/s11432-019-2782-3

计算机 图形图像 LETTER Website SpringerLink Google Scholar Supplementary Cited in SCI: 1

Face-sketch learning with human sketch-drawing order enforcement
Chang, Liang; Jin, Lihua; Weng, Lifen; Chao, Wentao; Wang, Xuguang; Deng, Xiaoming; Dong, Qiulei
Sci China Inf Sci, 2020, 63(11): 219103
Keywords: face sketch synthesis; deep neural network; order enforcement; image synthesis; generative adversarial network
Cite as: Chang L, Jin L H, Weng L F, et al. Face-sketch learning with human sketch-drawing order enforcement. Sci China Inf Sci, 2020, 63(11): 219103, doi: 10.1007/s11432-019-2890-8

计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Supplementary Cited in SCI: 1

Color and direction-invariant nonlocal self-similarity prior and its application to color image denoising
Xie, Qi; Zhao, Qian; Xu, Zongben; Meng, Deyu
Sci China Inf Sci, 2020, 63(12): 222101
Keywords: color image denoising; nonlocal self-similarity; gaussian mixture model; maximum a posterior (map) model; em algorithm
Cite as: Xie Q, Zhao Q, Xu Z B, et al. Color and direction-invariant nonlocal self-similarity prior and its application to color image denoising. Sci China Inf Sci, 2020, 63(12): 222101, doi: 10.1007/s11432-020-2880-3

计算机 图形图像 MOOP Website SpringerLink Google Scholar Homepage Cited in SCI: 1

Semantic part segmentation of single-view point cloud
Peng, Haotian; Zhou, Bin; Yin, Liyuan; Guo, Kan; Zhao, Qinping
Sci China Inf Sci, 2020, 63(12): 224101
Keywords: computers and graphics; single-view point cloud; annotation transfer; semantic part segmentation; category independent matching; point clouds registration
Cite as: Peng H T, Zhou B, Yin L Y, et al. Semantic part segmentation of single-view point cloud. Sci China Inf Sci, 2020, 63(12): 224101, doi: 10.1007/s11432-018-9689-9

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 1

Learning efficient text-to-image synthesis via interstage cross-sample similarity distillation
Mao, Fengling; Ma, Bingpeng; Chang, Hong; Shan, Shiguang; Chen, Xilin
Sci China Inf Sci, 2021, 64(2): 120102
Keywords: generative adversarial network; gan; text-to-image synthesis; knowledge distillation
Cite as: Mao F L, Ma B P, Chang H, et al. Learning efficient text-to-image synthesis via interstage cross-sample similarity distillation. Sci China Inf Sci, 2021, 64(2): 120102, doi: 10.1007/s11432-020-2900-x

计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Supplementary Homepage Cited in SCI: 1

Neural compositing for real-time augmented reality rendering in low-frequency lighting environments
Ma, Shengjie; Shen, Qian; Hou, Qiming; Ren, Zhong; Zhou, Kun
Sci China Inf Sci, 2021, 64(2): 122101
Keywords: augmented reality; neural networks; differentiable renderer; reflection; shadow
Cite as: Ma S J, Shen Q, Hou Q M, et al. Neural compositing for real-time augmented reality rendering in low-frequency lighting environments. Sci China Inf Sci, 2021, 64(2): 122101, doi: 10.1007/s11432-020-3024-5

计算机 图形图像 MOOP Website SpringerLink Google Scholar Homepage Cited in SCI: 1

Designing and deploying a mixed-reality aquarium for cognitive training of young children with autism spectrum disorder
Liu, Juan; Bian, Yulong; Yuan, Yanran; Xi, Yuting; Geng, Wenxiu; Jin, Xinpei; Gai, Wei; Fan, Xiangmin; Tian, Feng; Meng, Xiangxu; Yang, Chenglei
Sci China Inf Sci, 2021, 64(5): 154101
Keywords: autism spectrum disorder; mixed reality; cognitive training; young asd children; user study; quantitative research
Cite as: Liu J, Bian Y L, Yuan Y R, et al. Designing and deploying a mixed-reality aquarium for cognitive training of young children with autism spectrum disorder. Sci China Inf Sci, 2021, 64(5): 154101, doi: 10.1007/s11432-020-2941-7

Special Focus on Visual Computing with Machine Learning
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 1

Single-view facial reflectance inference with a differentiable renderer
Geng, Jiahao; Weng, Yanlin; Wang, Lvdi; Zhou, Kun
Sci China Inf Sci, 2021, 64(11): 210101
Keywords: facial modeling; reflectance inference; differentiable renderer
Cite as: Geng J H, Weng Y L, Wang L D, et al. Single-view facial reflectance inference with a differentiable renderer. Sci China Inf Sci, 2021, 64(11): 210101, doi: 10.1007/s11432-020-3236-2

计算机 图形图像 MOOP Website SpringerLink Google Scholar Homepage Cited in SCI: 1

Automatic image matting and fusing for portrait synthesis
Yi, Zhike; Song, Wenfeng; Li, Shuai; Hao, Aimin
Sci China Inf Sci, 2022, 65(2): 124101
Keywords: neural network; automatic image matting; image fusion; deep learning; gradient domain
Cite as: Yi Z K, Song W F, Li S, et al. Automatic image matting and fusing for portrait synthesis. Sci China Inf Sci, 2022, 65(2): 124101, doi: 10.1007/s11432-021-3279-y

计算机 图形图像 LETTER Website SpringerLink Google Scholar Cited in SCI: 1

Self-adjustable hyper-graphs for video pose estimation based on spatial-temporal subspace construction
Ma, Jizhou; Li, Shuai; Qin, Hong; Hao, Aimin; Zhao, Qinping
Sci China Inf Sci, 2022, 65(3): 139101
Keywords: video pose estimation; self-adjustable hyper-graph; spatial-temporal subspace exploration; maximum matching subspace operator; action pattern
Cite as: Ma J Z, Li S, Qin H, et al. Self-adjustable hyper-graphs for video pose estimation based on spatial-temporal subspace construction. Sci China Inf Sci, 2022, 65(3): 139101, doi: 10.1007/s11432-019-2869-x

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 1

Progressive learning with multi-scale attention network for cross-domain vehicle re-identification
Wang, Yang; Peng, Jinjia; Wang, Huibing; Wang, Meng
Sci China Inf Sci, 2022, 65(6): 160103
Keywords: data adaptation module; weighted label smoothing loss; multi-scale attention network; vehicle re-identification
Cite as: Wang Y, Peng J J, Wang H B, et al. Progressive learning with multi-scale attention network for cross-domain vehicle re-identification. Sci China Inf Sci, 2022, 65(6): 160103, doi: 10.1007/s11432-021-3383-y

计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 0

A flexible technique to select objects via convolutional neural network in VR space
Li, Huiyu; Fan, Linwei
Sci China Inf Sci, 2020, 63(1): 112101
Keywords: convolutional neural network; interaction techniques; pose estimation; virtual reality; 3d selection
Cite as: Li H Y, Fan L W. A flexible technique to select objects via convolutional neural network in VR space. Sci China Inf Sci, 2020, 63(1): 112101, doi: 10.1007/s11432-019-1517-3

计算机 图形图像 LETTER Website SpringerLink Google Scholar Cited in SCI: 0

Keywords: fractional fourier transform; face recognition; convolutional neural networks; deep spatial-frequency feature; illumination
Cite as: Wu X, Tao R, Hong D F, et al. The FrFT convolutional face: toward robust face recognition using the fractional Fourier transform and convolutional neural networks. Sci China Inf Sci, 2020, 63(1): 119103, doi: 10.1007/s11432-018-9862-9

计算机 图形图像 LETTER Website SpringerLink Google Scholar Supplementary Cited in SCI: 0

A spatial structural similarity triplet loss for auxiliary vehicle re-identification
Zhu, Jianqing; Liu, Liu; Zhu, Xiaobin; Zeng, Huanqiang
Sci China Inf Sci, 2021, 64(7): 179104
Keywords: vehicle re-identification; spatial strutural simialrity; triplet loss; deep learning; video surveillance system
Cite as: Zhu J Q, Liu L, Zhu X B, et al. A spatial structural similarity triplet loss for auxiliary vehicle re-identification. Sci China Inf Sci, 2021, 64(7): 179104, doi: 10.1007/s11432-020-3004-7

计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 0

Learning to focus: cascaded feature matching network for few-shot image recognition
Chen, Mengting; Wang, Xinggang; Luo, Heng; Geng, Yifeng; Liu, Wenyu
Sci China Inf Sci, 2021, 64(9): 192105
Keywords: few-shot learning; image recognition; feature matching; self-attention
Cite as: Chen M T, Wang X G, Luo H, et al. Learning to focus: cascaded feature matching network for few-shot image recognition. Sci China Inf Sci, 2021, 64(9): 192105, doi: 10.1007/s11432-020-2973-7

计算机 图形图像 MOOP Website SpringerLink Google Scholar Supplementary Homepage Cited in SCI: 0

LotusMenu: a 3D menu using wrist and elbow rotation inspired by Chinese traditional symbol
Lyu, Fei; Liu, Yujie; Huang, Jin; Zhang, Zhaolin
Sci China Inf Sci, 2021, 64(10): 204101
Keywords: interaction; 3d menu; freehand gesture; selection technology; 3d rotation
Cite as: Lyu F, Liu Y J, Huang J, et al. LotusMenu: a 3D menu using wrist and elbow rotation inspired by Chinese traditional symbol. Sci China Inf Sci, 2021, 64(10): 204101, doi: 10.1007/s11432-020-2999-y

Special Focus on Visual Computing with Machine Learning
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Supplementary Cited in SCI: 0

Dual attention autoencoder for all-weather outdoor lighting estimation
Yu, Piaopiao; Guo, Jie; Wu, Longhai; Zhou, Cheng; Li, Mengtian; Wang, Chenchen; Guo, Yanwen
Sci China Inf Sci, 2021, 64(11): 210102
Keywords: outdoor illumination; adaptive feature pyramid; attention; autoencoder; augmented reality
Cite as: Yu P P, Guo J, Wu L H, et al. Dual attention autoencoder for all-weather outdoor lighting estimation. Sci China Inf Sci, 2021, 64(11): 210102, doi: 10.1007/s11432-021-3282-4

Special Focus on Visual Computing with Machine Learning
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Supplementary Cited in SCI: 0

Weakly supervised 2D human pose transfer
Zheng, Qian; Liu, Yajie; Lin, Zhizhao; Lischinski, Dani; Cohen-Or, Daniel; Huang, Hui
Sci China Inf Sci, 2021, 64(11): 210103
Keywords: pose transfer; weak supervision; human skeleton
Cite as: Zheng Q, Liu Y J, Lin Z Z, et al. Weakly supervised 2D human pose transfer. Sci China Inf Sci, 2021, 64(11): 210103, doi: 10.1007/s11432-021-3301-5

Special Focus on Visual Computing with Machine Learning
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 0

iHairRecolorer: deep image-to-video hair color transfer
Wu, Keyu; Yang, Lingchen; Fu, Hongbo; Zheng, Youyi
Sci China Inf Sci, 2021, 64(11): 210104
Keywords: hair color transfer; video manipulation; luminance map; cycle consistency
Cite as: Wu K Y, Yang L C, Fu H B, et al. iHairRecolorer: deep image-to-video hair color transfer. Sci China Inf Sci, 2021, 64(11): 210104, doi: 10.1007/s11432-021-3325-6

Special Focus on Visual Computing with Machine Learning
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 0

Hausdorff point convolution with geometric priors
Lin, Liqiang; Huang, Pengdi; Xue, Fuyou; Xu, Kai; Cohen-Or, Daniel; Huang, Hui
Sci China Inf Sci, 2021, 64(11): 210105
Keywords: point convolution; hausdorff distance; geometric prior; deep neural network
Cite as: Lin L Q, Huang P D, Xue F Y, et al. Hausdorff point convolution with geometric priors. Sci China Inf Sci, 2021, 64(11): 210105, doi: 10.1007/s11432-021-3311-2

计算机 图形图像 REVIEW Website SpringerLink Google Scholar Cited in SCI: 0

Survey on rain removal from videos or a single image
Wang, Hong; Wu, Yichen; Li, Minghan; Zhao, Qian; Meng, Deyu
Sci China Inf Sci, 2022, 65(1): 111101
Keywords: rain removal; maximum a posterior estimation; deep learning; generalization performance; comprehensive repository
Cite as: Wang H, Wu Y C, Li M H, et al. Survey on rain removal from videos or a single image. Sci China Inf Sci, 2022, 65(1): 111101, doi: 10.1007/s11432-020-3225-9

计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 0

AR-CNN: an attention ranking network for learning urban perception
Li, Zhetao; Chen, Ziwen; Zheng, Wei-Shi; Oh, Sangyoon; Nguyen, Kien
Sci China Inf Sci, 2022, 65(1): 112104
Keywords: ranking network; urban perception; attribute learning; attention network; colour and texture
Cite as: Li Z T, Chen Z W, Zheng W-S, et al. AR-CNN: an attention ranking network for learning urban perception. Sci China Inf Sci, 2022, 65(1): 112104, doi: 10.1007/s11432-019-2899-9

计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 0

Learning practically feasible policies for online 3D bin packing
Zhao, Hang; Zhu, Chenyang; Xu, Xin; Huang, Hui; Xu, Kai
Sci China Inf Sci, 2022, 65(1): 112105
Keywords: bin packing problem; online 3d-bpp; reinforcement learning
Cite as: Zhao H, Zhu C Y, Xu X, et al. Learning practically feasible policies for online 3D bin packing. Sci China Inf Sci, 2022, 65(1): 112105, doi: 10.1007/s11432-021-3348-6

计算机 图形图像 LETTER Website SpringerLink Google Scholar Cited in SCI: 0

VNet: a versatile network to train real-time semantic segmentation models on a single GPU
Li, Wenxing; Lin, Ning; Zhang, Mingzhe; Lu, Hang; Chen, Xiaoming; Li, Xiaowei
Sci China Inf Sci, 2022, 65(3): 139105
Keywords: efficiency; machine learning; real-time systems; segmentation; light-weight
Cite as: Li W X, Lin N, Zhang M Z, et al. VNet: a versatile network to train real-time semantic segmentation models on a single GPU. Sci China Inf Sci, 2022, 65(3): 139105, doi: 10.1007/s11432-020-2971-8

计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 0

RGBT tracking via reliable feature configuration
Tu, Zhengzheng; Pan, Wenli; Duan, Yunsheng; Tang, Jin; Li, Chenglong
Sci China Inf Sci, 2022, 65(4): 142101
Keywords: rgbt tracking; multi-modal; reliability guideline; feature configuration; correlation filter
Cite as: Tu Z Z, Pan W L, Duan Y S, et al. RGBT tracking via reliable feature configuration. Sci China Inf Sci, 2022, 65(4): 142101, doi: 10.1007/s11432-020-3160-5

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 0

Onfocus detection: identifying individual-camera eye contact from unconstrained images
Zhang, Dingwen; Wang, Bo; Wang, Gerong; Zhang, Qiang; Zhang, Jiajia; Han, Jungong; You, Zheng
Sci China Inf Sci, 2022, 65(6): 160101
Keywords: onfocus detection; deep neural network; capsule routing; computer vision; deep learning
Cite as: Zhang D W, Wang B, Wang G R, et al. Onfocus detection: identifying individual-camera eye contact from unconstrained images. Sci China Inf Sci, 2022, 65(6): 160101, doi: 10.1007/s11432-020-3181-9

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 0

HAPNet: a head-aware pedestrian detection network associated with the affinity field
Ding, Jiali; Liu, Tie; Zhao, Yun; Yuan, Zejian; Shang, Yuanyuan
Sci China Inf Sci, 2022, 65(6): 160102
Keywords: pedestrian detection; head detection; head-aware pedestrian network; affinity module; occlusion
Cite as: Ding J L, Liu T, Zhao Y, et al. HAPNet: a head-aware pedestrian detection network associated with the affinity field. Sci China Inf Sci, 2022, 65(6): 160102, doi: 10.1007/s11432-021-3300-2

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 0

TransCrowd: weakly-supervised crowd counting with transformers
Liang, Dingkang; Chen, Xiwu; Xu, Wei; Zhou, Yu; Bai, Xiang
Sci China Inf Sci, 2022, 65(6): 160104
Keywords: crowd counting; visual transformer; weakly supervised; crowd analysis; transformer
Cite as: Liang D K, Chen X W, Xu W, et al. TransCrowd: weakly-supervised crowd counting with transformers. Sci China Inf Sci, 2022, 65(6): 160104, doi: 10.1007/s11432-021-3445-y

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 0

Prototype-based classifier learning for long-tailed visual recognition
Wei, Xiu-Shen; Xu, Shu-Lin; Chen, Hao; Xiao, Liang; Peng, Yuxin
Sci China Inf Sci, 2022, 65(6): 160105
Keywords: long-tailed distribution; categorical prototype; classifier generation; classifier calibration; class imbalance
Cite as: Wei X-S, Xu S-L, Chen H, et al. Prototype-based classifier learning for long-tailed visual recognition. Sci China Inf Sci, 2022, 65(6): 160105, doi: 10.1007/s11432-021-3489-1

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 NEWS & VIEWS Website SpringerLink Google Scholar Cited in SCI: 0

Comprehensive benchmark datasets for Amharic scene text detection and recognition
DIKUBAB, Wondimu; LIANG, Dingkang; LIAO, Minghui; BAI, Xiang
Sci China Inf Sci, 2022, 65(6): 160106
Keywords: Amharic script; scene text; text detection; text recognition; Amharic text
Cite as: Dikubab W, Liang D K, Liao M H, et al. Comprehensive benchmark datasets for Amharic scene text detection and recognition. Sci China Inf Sci, 2022, 65(6): 160106, doi: 10.1007/s11432-021-3447-9

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 LETTER Website SpringerLink Google Scholar Cited in SCI: 0

Human-object interaction detection via interactive visual-semantic graph learning
Wu, Tongtong; Duan, Fuqing; Chang, Liang; Lu, Ke
Sci China Inf Sci, 2022, 65(6): 160108
Keywords: human-object interaction; context modeling; graph learning; interactive graph; visual-semantic graph
Cite as: Wu T T, Duan F Q, Chang L, et al. Human-object interaction detection via interactive visual-semantic graph learning. Sci China Inf Sci, 2022, 65(6): 160108, doi: 10.1007/s11432-021-3427-2

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 LETTER Website SpringerLink Google Scholar Cited in SCI: 0

Few-shot font style transfer with multiple style encoders
Zhang, Kejun; Zhang, Rui; Wu, Yonglin; Li, Yifei; Ling, Yonggen; Wang, Bolin; Sun, Lingyun; Li, Yingming
Sci China Inf Sci, 2022, 65(6): 160109
Keywords: font generation; image-to-image translation; gans; multi-task learning; font fusion; style transfer
Cite as: Zhang K J, Zhang R, Wu Y L, et al. Few-shot font style transfer with multiple style encoders. Sci China Inf Sci, 2022, 65(6): 160109, doi: 10.1007/s11432-021-3435-8

计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar

Heterogeneous memory enhanced graph reasoning network for cross-modal retrieval
Ji Z, Chen K X, He Y Q, et al
Sci China Inf Sci, 2022, 65(7): 172104
Keywords: cross-modal retrieval; graph reasoning; memory network; visual semantic embedding; image-text retrieval; video-text retrieval
Cite as: Ji Z, Chen K X, He Y Q, et al. Heterogeneous memory enhanced graph reasoning network for cross-modal retrieval. Sci China Inf Sci, 2022, 65(7): 172104, doi: 10.1007/s11432-021-3367-y

计算机 图形图像 LETTER Website SpringerLink Google Scholar Supplementary Cited in SCI: 0

3DPF-FBN: video inpainting by jointly 3D-patch filling and neural network refinement
Huang, Yan; Yang, Chuanchuan; Chen, Zhangyuan
Sci China Inf Sci, 2022, 65(7): 179103
Keywords: video inpainting; patch-based searching; optical flow; neural network; refinement
Cite as: Huang Y, Yang C C, Chen Z Y. 3DPF-FBN: video inpainting by jointly 3D-patch filling and neural network refinement. Sci China Inf Sci, 2022, 65(7): 179103, doi: 10.1007/s11432-019-2956-6

计算机 图形图像 LETTER Website SpringerLink Google Scholar Supplementary Cited in SCI: 0

Indoor layout programming via virtual navigation detectors
Fu, Qiang; Fu, Hongbo; Deng, Zhigang; Li, Xueming
Sci China Inf Sci, 2022, 65(8): 189101
Keywords: 3d modeling; indoor scene synthesis; layout programming; indoor navigation; deep reinforcement learning
Cite as: Fu Q, Fu H B, Deng Z G, et al. Indoor layout programming via virtual navigation detectors. Sci China Inf Sci, 2022, 65(8): 189101, doi: 10.1007/s11432-019-2930-x

计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 0

Difficulty-aware bi-network with spatial attention constrained graph for axillary lymph node segmentation
Xu, Qing; Xi, Xiaoming; Meng, Xianjing; Qin, Zheyun; Nie, Xiushan; Wu, Yongjian; Zhou, Dongsheng; Qu, Yi; Li, Chenglong; Yin, Yilong
Sci China Inf Sci, 2022, 65(9): 192102
Keywords: ultrasound image; axillary lymph nodes segmentation; difficulty-aware segmentation; graph with spatial attention
Cite as: Xu Q, Xi X M, Meng X J, et al. Difficulty-aware bi-network with spatial attention constrained graph for axillary lymph node segmentation. Sci China Inf Sci, 2022, 65(9): 192102, doi: 10.1007/s11432-020-3079-8