Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 21

Uncertainty-optimized deep learning model for small-scale person re-identification
Zhao, Cairong; Chen, Kang; Zang, Di; Zhang, Zhaoxiang; Zuo, Wangmeng; Miao, Duoqian
Sci China Inf Sci, 2019, 62(12): 220102
Keywords: person re-identification; uncertainty analysis; deep learning
Cite as: Zhao C R, Chen K, Zang D, et al. Uncertainty-optimized deep learning model for small-scale person re-identification. Sci China Inf Sci, 2019, 62(12): 220102, doi: 10.1007/s11432-019-2675-3

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 LETTER Website SpringerLink Google Scholar Supplementary Cited in SCI: 19

MDSSD: multi-scale deconvolutional single shot detector for small objects
Cui, Lisha; Ma, Rui; Lv, Pei; Jiang, Xiaoheng; Gao, Zhimin; Zhou, Bing; Xu, Mingliang
Sci China Inf Sci, 2020, 63(2): 120113
Keywords: object detection; small objects; multi-scale deconvolution; fusion block; real-time
Cite as: Cui L S, Ma R, Lv P, et al. MDSSD: multi-scale deconvolutional single shot detector for small objects. Sci China Inf Sci, 2020, 63(2): 120113, doi: 10.1007/s11432-019-2723-1

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 14

PSC-Net: learning part spatial co-occurrence for occluded pedestrian detection
Xie, Jin; Pang, Yanwei; Cholakkal, Hisham; Anwer, Rao; Khan, Fahad; Shao, Ling
Sci China Inf Sci, 2021, 64(2): 120103
Keywords: pedestrian detection; graph convolutional network; occlusion; object detection; feature extraction
Cite as: Xie J, Pang Y W, Cholakkal H, et al. PSC-Net: learning part spatial co-occurrence for occluded pedestrian detection. Sci China Inf Sci, 2021, 64(2): 120103, doi: 10.1007/s11432-020-2969-8

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 14

Task-wise attention guided part complementary learning for few-shot image classification
Cheng, Gong; Li, Ruimin; Lang, Chunbo; Han, Junwei
Sci China Inf Sci, 2021, 64(2): 120104
Keywords: few-shot learning; meta-learning; task-wise attention; part complementary learning
Cite as: Cheng G, Li R M, Lang C B, et al. Task-wise attention guided part complementary learning for few-shot image classification. Sci China Inf Sci, 2021, 64(2): 120104, doi: 10.1007/s11432-020-3156-7

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 13

CGNet: cross-guidance network for semantic segmentation
Zhang, Zhijie; Pang, Yanwei
Sci China Inf Sci, 2020, 63(2): 120104
Keywords: semantic segmentation; fully convolutional networks; pyramid network; edge detection; saliency detection; cross-guidance
Cite as: Zhang Z J, Pang Y W. CGNet: cross-guidance network for semantic segmentation. Sci China Inf Sci, 2020, 63(2): 120104, doi: 10.1007/s11432-019-2718-7

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 12

FACLSTM: ConvLSTM with focused attention for scene text recognition
Wang, Qingqing; Huang, Ye; Jia, Wenjing; He, Xiangjian; Blumenstein, Michael; Lyu, Shujing; Lu, Yue
Sci China Inf Sci, 2020, 63(2): 120103
Keywords: scene text recognition; convolutional lstm; focused attention; spatial correlation; sequential prediction
Cite as: Wang Q Q, Huang Y, Jia W J, et al. FACLSTM: ConvLSTM with focused attention for scene text recognition. Sci China Inf Sci, 2020, 63(2): 120103, doi: 10.1007/s11432-019-2713-1

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 10

Preserving details in semantics-aware context for scene parsing
Ma, Shuai; Pang, Yanwei; Pan, Jing; Shao, Ling
Sci China Inf Sci, 2020, 63(2): 120106
Keywords: fully convolutional networks; semantic segmentation; cityscapes; semantic-aware context
Cite as: Ma S, Pang Y W, Pan J, et al. Preserving details in semantics-aware context for scene parsing. Sci China Inf Sci, 2020, 63(2): 120106, doi: 10.1007/s11432-019-2738-y

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 9

Irregular scene text detection via attention guided border labeling
Chen, Jie; Lian, Zhouhui; Wang, Yizhi; Tang, Yingmin; Xiao, Jianguo
Sci China Inf Sci, 2019, 62(12): 220103
Keywords: scene text detection; weighted border; attention mechanisms; curved text; semantic segmentation
Cite as: Chen J, Lian Z H, Wang Y Z, et al. Irregular scene text detection via attention guided border labeling. Sci China Inf Sci, 2019, 62(12): 220103, doi: 10.1007/s11432-019-2673-8

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 8

Triple discriminator generative adversarial network for zero-shot image classification
Ji, Zhong; Yan, Jiangtao; Wang, Qiang; Pang, Yanwei; Li, Xuelong
Sci China Inf Sci, 2021, 64(2): 120101
Keywords: zero-shot classification; generative adversarial nets; text reconstruction; sharma-mittal entropy
Cite as: Ji Z, Yan J T, Wang Q, et al. Triple discriminator generative adversarial network for zero-shot image classification. Sci China Inf Sci, 2021, 64(2): 120101, doi: 10.1007/s11432-020-3032-8

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 7

Feature context learning for human parsing
Huang, Tengteng; Xu, Yongchao; Bai, Song; Wang, Yongpan; Bai, Xiang
Sci China Inf Sci, 2019, 62(12): 220101
Keywords: human parsing; context learning; fully convolutional networks; graph convolutional network; semantic segmentation
Cite as: Huang T T, Xu Y C, Bai S, et al. Feature context learning for human parsing. Sci China Inf Sci, 2019, 62(12): 220101, doi: 10.1007/s11432-019-9935-6

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 7

Ordinal distribution regression for gait-based age estimation
Zhu, Haiping; Zhang, Yuheng; Li, Guohao; Zhang, Junping; Shan, Hongming
Sci China Inf Sci, 2020, 63(2): 120102
Keywords: computer vision; deep learning; ordinal distribution regression; global and local features; gait-based age estimation
Cite as: Zhu H P, Zhang Y H, Li G H, et al. Ordinal distribution regression for gait-based age estimation. Sci China Inf Sci, 2020, 63(2): 120102, doi: 10.1007/s11432-019-2733-4

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 LETTER Website SpringerLink Google Scholar Supplementary Cited in SCI: 6

Leveraging 3D blendshape for facial expression recognition using CNN
Wang, Sa; Cheng, Zhengxin; Deng, Xiaoming; Chang, Liang; Duan, Fuqing; Lu, Ke
Sci China Inf Sci, 2020, 63(2): 120114
Keywords: facial expression recognition; convolutional neural network; two-stream network; 3d face blendshape; face representation
Cite as: Wang S, Cheng Z X, Deng X M, et al. Leveraging 3D blendshape for facial expression recognition using CNN. Sci China Inf Sci, 2020, 63(2): 120114, doi: 10.1007/s11432-019-2747-y

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 5

SynthText3D: synthesizing scene text images from 3D virtual worlds
Liao, Minghui; Song, Boyu; Long, Shangbang; He, Minghang; Yao, Cong; Bai, Xiang
Sci China Inf Sci, 2020, 63(2): 120105
Keywords: optical character recognition (ocr); synthetic data; scene text detection; 3d; deep learning
Cite as: Liao M H, Song B Y, Long S B, et al. SynthText3D: synthesizing scene text images from 3D virtual worlds. Sci China Inf Sci, 2020, 63(2): 120105, doi: 10.1007/s11432-019-2737-0

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 EDITORIAL Website SpringerLink Google Scholar Cited in SCI: 3

Special focus on deep learning for computer vision
Pang, Yanwei; Bai, Xiang; Zhang, Guofeng
Sci China Inf Sci, 2019, 62(12): 220100
Cite as: Pang Y W, Bai X, Zhang G F. Special focus on deep learning for computer vision. Sci China Inf Sci, 2019, 62(12): 220100, doi: 10.1007/s11432-019-2701-8

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 LETTER Website SpringerLink Google Scholar Supplementary Cited in SCI: 3

ARPNET: attention region proposal network for 3D object detection
Ye, Yangyang; Zhang, Chi; Hao, Xiaoli
Sci China Inf Sci, 2019, 62(12): 220104
Keywords: attention; regional proposal network; shape-specific proposals; lidar-based; 3d object detection
Cite as: Ye Y Y, Zhang C, Hao X L. ARPNET: attention region proposal network for 3D object detection. Sci China Inf Sci, 2019, 62(12): 220104, doi: 10.1007/s11432-019-2636-x

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 LETTER Website SpringerLink Google Scholar Cited in SCI: 3

Learning generalizable deep feature using triplet-batch-center loss for person re-identification
Hu, Bin; Xu, Jiwei; Wang, Xinggang
Sci China Inf Sci, 2021, 64(2): 120111
Keywords: person re-identification; triplet loss; triplet-batch-center loss; metric learning; deep learning
Cite as: Hu B, Xu J W, Wang X G. Learning generalizable deep feature using triplet-batch-center loss for person re-identification. Sci China Inf Sci, 2021, 64(2): 120111, doi: 10.1007/s11432-019-2943-6

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 LETTER Website SpringerLink Google Scholar Supplementary Cited in SCI: 2

Multi-attention based cross-domain beauty product image retrieval
Wang, Zhihui; Liu, Xing; Lin, Jiawen; Yang, Caifei; Li, Haojie
Sci China Inf Sci, 2020, 63(2): 120112
Keywords: beauty product image retrieval; saliency attention mechanism; text attention mechanism; local feature aggregation; multi-attention classification network
Cite as: Wang Z H, Liu X, Lin J W, et al. Multi-attention based cross-domain beauty product image retrieval. Sci China Inf Sci, 2020, 63(2): 120112, doi: 10.1007/s11432-019-2721-0

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 LETTER Website SpringerLink Google Scholar Cited in SCI: 2

RLLNet: a lightweight remaking learning network for saliency redetection on RGB-D images
Zhou, Wujie; Liu, Chang; Lei, Jingsheng; Yu, Lu
Sci China Inf Sci, 2022, 65(6): 160107
Keywords: deep learning; rgb-d image; saliency detection; remaking learning; lightweight network
Cite as: Zhou W J, Liu C, Lei J S, et al. RLLNet: a lightweight remaking learning network for saliency redetection on RGB-D images. Sci China Inf Sci, 2022, 65(6): 160107, doi: 10.1007/s11432-020-3337-9

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 EDITORIAL Website SpringerLink Google Scholar Cited in SCI: 1

Special focus on deep learning for computer vision
Bai, Xiang; Pang, Yanwei; Zhang, Guofeng
Sci China Inf Sci, 2020, 63(2): 120100
Cite as: Bai X, Pang Y W, Zhang G F. Special focus on deep learning for computer vision. Sci China Inf Sci, 2020, 63(2): 120100, doi: 10.1007/s11432-020-2766-x

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 1

Progressive rectification network for irregular text recognition
Gao, Yunze; Chen, Yingying; Wang, Jinqiao; Lu, Hanqing
Sci China Inf Sci, 2020, 63(2): 120101
Keywords: irregular text recognition; progressive rectification; iterative refinement
Cite as: Gao Y Z, Chen Y Y, Wang J Q, et al. Progressive rectification network for irregular text recognition. Sci China Inf Sci, 2020, 63(2): 120101, doi: 10.1007/s11432-019-2710-7

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 LETTER Website SpringerLink Google Scholar Cited in SCI: 1

Discriminative stacked autoencoder for feature representation and classification
Gao, Yiping; Li, Xinyu; Gao, Liang
Sci China Inf Sci, 2020, 63(2): 120111
Keywords: stacked autoencoders; discriminative feature representation; hybrid pretraining; classification; deep learning
Cite as: Gao Y P, Li X Y, Gao L. Discriminative stacked autoencoder for feature representation and classification. Sci China Inf Sci, 2020, 63(2): 120111, doi: 10.1007/s11432-019-2722-3

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 1

Learning efficient text-to-image synthesis via interstage cross-sample similarity distillation
Mao, Fengling; Ma, Bingpeng; Chang, Hong; Shan, Shiguang; Chen, Xilin
Sci China Inf Sci, 2021, 64(2): 120102
Keywords: generative adversarial network; gan; text-to-image synthesis; knowledge distillation
Cite as: Mao F L, Ma B P, Chang H, et al. Learning efficient text-to-image synthesis via interstage cross-sample similarity distillation. Sci China Inf Sci, 2021, 64(2): 120102, doi: 10.1007/s11432-020-2900-x

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 1

Progressive learning with multi-scale attention network for cross-domain vehicle re-identification
Wang, Yang; Peng, Jinjia; Wang, Huibing; Wang, Meng
Sci China Inf Sci, 2022, 65(6): 160103
Keywords: data adaptation module; weighted label smoothing loss; multi-scale attention network; vehicle re-identification
Cite as: Wang Y, Peng J J, Wang H B, et al. Progressive learning with multi-scale attention network for cross-domain vehicle re-identification. Sci China Inf Sci, 2022, 65(6): 160103, doi: 10.1007/s11432-021-3383-y

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 0

Onfocus detection: identifying individual-camera eye contact from unconstrained images
Zhang, Dingwen; Wang, Bo; Wang, Gerong; Zhang, Qiang; Zhang, Jiajia; Han, Jungong; You, Zheng
Sci China Inf Sci, 2022, 65(6): 160101
Keywords: onfocus detection; deep neural network; capsule routing; computer vision; deep learning
Cite as: Zhang D W, Wang B, Wang G R, et al. Onfocus detection: identifying individual-camera eye contact from unconstrained images. Sci China Inf Sci, 2022, 65(6): 160101, doi: 10.1007/s11432-020-3181-9

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 0

HAPNet: a head-aware pedestrian detection network associated with the affinity field
Ding, Jiali; Liu, Tie; Zhao, Yun; Yuan, Zejian; Shang, Yuanyuan
Sci China Inf Sci, 2022, 65(6): 160102
Keywords: pedestrian detection; head detection; head-aware pedestrian network; affinity module; occlusion
Cite as: Ding J L, Liu T, Zhao Y, et al. HAPNet: a head-aware pedestrian detection network associated with the affinity field. Sci China Inf Sci, 2022, 65(6): 160102, doi: 10.1007/s11432-021-3300-2

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 0

TransCrowd: weakly-supervised crowd counting with transformers
Liang, Dingkang; Chen, Xiwu; Xu, Wei; Zhou, Yu; Bai, Xiang
Sci China Inf Sci, 2022, 65(6): 160104
Keywords: crowd counting; visual transformer; weakly supervised; crowd analysis; transformer
Cite as: Liang D K, Chen X W, Xu W, et al. TransCrowd: weakly-supervised crowd counting with transformers. Sci China Inf Sci, 2022, 65(6): 160104, doi: 10.1007/s11432-021-3445-y

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 RESEARCH PAPER Website SpringerLink Google Scholar Cited in SCI: 0

Prototype-based classifier learning for long-tailed visual recognition
Wei, Xiu-Shen; Xu, Shu-Lin; Chen, Hao; Xiao, Liang; Peng, Yuxin
Sci China Inf Sci, 2022, 65(6): 160105
Keywords: long-tailed distribution; categorical prototype; classifier generation; classifier calibration; class imbalance
Cite as: Wei X-S, Xu S-L, Chen H, et al. Prototype-based classifier learning for long-tailed visual recognition. Sci China Inf Sci, 2022, 65(6): 160105, doi: 10.1007/s11432-021-3489-1

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 NEWS & VIEWS Website SpringerLink Google Scholar Cited in SCI: 0

Comprehensive benchmark datasets for Amharic scene text detection and recognition
DIKUBAB, Wondimu; LIANG, Dingkang; LIAO, Minghui; BAI, Xiang
Sci China Inf Sci, 2022, 65(6): 160106
Keywords: Amharic script; scene text; text detection; text recognition; Amharic text
Cite as: Dikubab W, Liang D K, Liao M H, et al. Comprehensive benchmark datasets for Amharic scene text detection and recognition. Sci China Inf Sci, 2022, 65(6): 160106, doi: 10.1007/s11432-021-3447-9

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 LETTER Website SpringerLink Google Scholar Cited in SCI: 0

Human-object interaction detection via interactive visual-semantic graph learning
Wu, Tongtong; Duan, Fuqing; Chang, Liang; Lu, Ke
Sci China Inf Sci, 2022, 65(6): 160108
Keywords: human-object interaction; context modeling; graph learning; interactive graph; visual-semantic graph
Cite as: Wu T T, Duan F Q, Chang L, et al. Human-object interaction detection via interactive visual-semantic graph learning. Sci China Inf Sci, 2022, 65(6): 160108, doi: 10.1007/s11432-021-3427-2

Special Focus on Deep Learning for Computer Vision
计算机 图形图像 LETTER Website SpringerLink Google Scholar Cited in SCI: 0

Few-shot font style transfer with multiple style encoders
Zhang, Kejun; Zhang, Rui; Wu, Yonglin; Li, Yifei; Ling, Yonggen; Wang, Bolin; Sun, Lingyun; Li, Yingming
Sci China Inf Sci, 2022, 65(6): 160109
Keywords: font generation; image-to-image translation; gans; multi-task learning; font fusion; style transfer
Cite as: Zhang K J, Zhang R, Wu Y L, et al. Few-shot font style transfer with multiple style encoders. Sci China Inf Sci, 2022, 65(6): 160109, doi: 10.1007/s11432-021-3435-8