Selected papers
Li G ,Hu R , Zhang R , et al. A mapping model of spectral tilt in normal-to-Lombard speech conversion for intelligibility enhancement[J]. Multimedia Tools and Applications, :1-21.(SCI,EI,中国计算机学会C类期刊)
Li D , Hu R, Huang W , et al. HMM-Based Person Re-identification in Large-Scale Open Scenario[M]// MultiMedia Modeling. .
Hu C , Hu R, Wang X , et al. Multi-step Coding Structure of Spatial Audio Object Coding[M]// MultiMedia Modeling. .
Chen, Wei &Hu, Ruimin & Wang, Xiaochen & Li, Dengshi. (). HRTF Representation with Convolutional Auto-encoder. MultiMedia Modeling, 605-616.
Li D, Hu R, Wang X, et al. Loudspeaker triplet selection based on low distortion within head for multichannel conversion of smart 3D home theater[J]. Concurrency and Computation: Practice and Experience, , 32(13): e4796.
胡瑞敏,张亚浩,李登实,王晓晨,王超.基于逐阶共识计算的虚假物理身份属性检测方法[J].武汉大学学报(理学版),,66(02):103-110.
-
Wu T , Hu R , Wang X , et al. Audio object coding based on optimal parameter frequency resolution[J]. Multimedia Tools and Applications, , 78(15):20723-20738. (SCI,EI,中国计算机学会C类期刊)
Zhu W, Hu R, Wang Z, et al. Deep Structural Feature Learning: Re-Identification of simailar vehicles In Structure-Aware Map Space.[C]. acm multimedia, . (EI,中国计算机学会A类会议)
Wang X, Hu R, Wang Z, et al. Long Term Background Reference Based Satellite Video Coding[C]. international conference on acoustics speech and signal processing, : 1822-1826.(EI,中国计算机学会B类会议 )
Chen Y, Hu R, Xiao J, et al. Multisource Surveillance Video Coding by Exploiting 3D and 2D Knolwedge[C]. international conference on acoustics speech and signal processing, : 1787-1791.(EI,中国计算机学会B类会议 )
Chen Y,Hu R, Xiao J, et al. Multisource surveillance video coding with synthetic reference frame[J]. Journal of Visual Communication and Image Representation, .(EI,中国计算机学会B类期刊 )
Chen Y,Hu R Xiao J, et al. Multisource surveillance video data coding with hierarchical knowledge library[J]. Multimedia Tools and Applications, , 78(11): 14705-14731. (SCI,EI,中国计算机学会C类期刊)
Ke S, Hu R, Li G, et al. Multi-speakers Speech Separation Based on Modified Attractor Points Estimation and GMM Clustering[C]. international conference on multimedia and expo, : 1414-1419. (EI,中国计算机学会B类会议)
Xu Z ,Hu R, Chen J , et al. Semisupervised Discriminant Multimanifold Analysis for Action Recognition[J]. IEEE Transactions on Neural Networks and Learning Systems, :1-12. (EI,中国计算机学会B类期刊)
Zhang R, Hu R, Li G, et al. Spectral Tilt Estimation for Speech Intelligibility Enhancement Using RNN Based on All-Pole Model[C]. conference on multimedia modeling, : 144-156.
Lu S, Hu R, Liu J, et al. Structure Preserving Convolutional Attention for Image Captioning[J]. Applied Sciences, , 9(14).
Zhang M, Hu R, Jiang L, et al. Three‐dimensional sound reproduction in vehicle based on data mining technique[J]. Concurrency and Computation: Practice and Experience, , 31(4).
Li Q, Hu R,, Chen Y, et al. Vehicle Pose Estimation Using Mask Matching[C]. international conference on acoustics speech and signal processing, : 1972-1976.(EI,中国计算机学会B类会议 )
Li G, Hu R,, Wang X, et al. A near-end listening enhancement system by RNN-based noise cancellation and speech modification[J]. Multimedia Tools and Applications, , 78(11): 15483-15505. (SCI,EI,中国计算机学会C类期刊)
Ding X, Hu R,, Han Z, et al. A novel frontal facial synthesis algorithm based on individual residual face[C]//International Conference on Multimedia Modeling. Springer, Cham, : 14-22.(EI)
Liao L,Hu R,, Xiao J, et al. Edge-aware context encoder for image inpainting[C]// IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, : 3156-3160.(EI)
Li C, Hu R,, Liang C, et al. Faster seam carving for video retargeting[C]// 25th IEEE International Conference on Image Processing (ICIP). IEEE, : 823-827. (EI,中国计算机学会C类会议)
Wang X,Hu R,, Xiao J. Frame Rate Conversion Based High Efficient Compression Method for Video Satellite[C]//Pacific Rim Conference on Multimedia. Springer, Cham, : 35-44. (EI,中国计算机学会C类会议)
Chen W, Hu R,, Wang X, et al. Individualization of head related impulse responses using division analysis[J]. China Communications, , 15(5): 92-103.(SCI)
Huang Z, Hu R,, Thierry B, et al. Multi-feature fusion based background subtraction for video sequences with strong background changes[C]// IEEE International Conference on Image Processing (ICIP). IEEE, : 3370-3374.
Wang Z,Hu R,, Chen C, et al. Person reidentification via discrepancy matrix and matrix metric[J]. IEEE transactions on cybernetics, , 48(10): 3006-3020.(中国计算机学会B类期刊高引用)
Wang Z, Hu R, Yu Y, et al. Statistical Inference of Gaussian-Laplace Distribution for Person Verification[C]. acm multimedia, : 1609-1617. (EI,中国计算机学会A类会议)
Jing X Y , Zhu X , Wu F , et al. Super-Resolution Person Re-Identification With Semi-Coupled Low-Rank Discriminant Dictionary Learning[J]. IEEE Transactions on Image Processing, , 26(3):1363-1378. (SCI,中国计算机学会A类期刊)
Wu T, Hu R, Wang X, et al. High quality audio object coding framework based on non-negative matrix factorization[J]. China Communications, , 14(9): 32-41.
Jiang J, Hu R, Wang Z, et al. Facial Image Hallucination Through Coupled-Layer Neighbor Embedding[J]. IEEE Transactions on Circuits and Systems for Video Technology, , 26(9): 1674-1684.
Wang Z, Hu R, Yu Y, et al. Taichi distance for person re-identification[C]. international conference on acoustics, speech, and signal processing, : 2052-2056. (EI,中国计算机学会C类会议)
Li Q, Hu R, Chen Y, et al. A Fine-Grained Filtered Viewpoint Informed Keypoint Prediction from 2D Images[C]. pacific rim conference on multimedia, : 172-181.
Wang S, Hu R, Chen S, et al. 3D Sound Field Reproduction at Non Central Point for NHK 22.2 System[C]. conference on multimedia modeling, : 3-14.
Huang W,Hu R, Liang C, et al. Structural superpixel descriptor for visual tracking[C]. international joint conference on neural network, : 3146-3152.
Chen L,Hu R, Han Z, et al. A joint learning based Face Super Resolution approach via contextual topological structure[C]. international conference on acoustics, speech, and signal processing, : 1088-1092. (EI,中国计算机学会C类会议)
Wang S,Hu R, Chen S, et al. Sound physical property matching between non central listening point and central listening point for NHK 22.2 system reproduction[C]. international conference on acoustics, speech, and signal processing, : 436-440. (EI,中国计算机学会C类会议)
Hu R, Bao C, Zhao Q, et al. Recent development of speech and audio signal processing in network communication[J]. China Communications, , 14(9).
Huang K,Hu R, Jiang J, et al. HRM graph constrained dictionary learning for face image super-resolution[J]. Multimedia Tools and Applications, , 76(2): 3139-3162. (SCI,EI,中国计算机学会C类期刊)
Chen L, Hu R, Han Z, et al. Face super resolution based on parent patch prior for VLQ scenarios[J]. Multimedia Tools and Applications, , 76(7): 10231-10254. (SCI,EI,中国计算机学会C类期刊)
Chen H, Chen J, Hu R, et al. Action recognition with temporal scale-invariant deep learning framework[J]. China Communications, , 14(2): 163-172.
Chen L, Hu R, Liang C, et al. A novel face super resolution approach for noisy images using contour feature and standard deviation prior[J]. Multimedia Tools and Applications, , 76(2): 2467-2493. (SCI,EI,中国计算机学会C类期刊)
Wang Z, Hu R, Yu Y, et al. Scale-adaptive low-resolution person re-identification via learning a discriminating surface[C]. international joint conference on artificial intelligence, : 2669-2675. (EI,中国计算机学会A类会议)
Wu F, Jing X, You X, et al. Multi-view low-rank dictionary learning for image classification[J]. Pattern Recognition, : 143-154. (EI,中国计算机学会B类期刊)
Ruan W , Chen J , Wang J , et al. Boosted local classifiers for visual tracking[C]// IEEE International Conference on Multimedia & Expo. IEEE Computer Society, . (EI,中国计算机学会B类会议)
Gao L , Hu R , Wang X , et al. JND-based spatial parameter quantization of multichannel audio signals[J]. Eurasip Journal on Audio Speech & Music Processing, , (1).(A刊)
Xiao J, Hu R, Liao L, et al. Knowledge-Based Coding of Objects for Multisource Surveillance Video Data[J]. IEEE Transactions on Multimedia, , 18(9): 1691-1706.
Xiong M, Chen J, Wang Z, et al. Person Re-Identification via Multiple Coarse-to-Fine Deep Metrics.[C]. european conference on artificial intelligence, : 355-362. (EI,中国计算机学会B类会议)
Li D, Hu R, Wang X, et al. Multichannel reduction based on sound field within two ears[C]. international conference on multimedia and expo, : 1-6. (EI,中国计算机学会B类会议)
Liao L, Hu R, Xiao J, et al. An Analysis-Oriented ROI Based Coding Approach on Surveillance Video Data[C]. pacific rim conference on multimedia, : 428-438.
Lin J, Ruimin H, Xiaochen W, et al. Audio Bandwidth Extension Using Audio Super-Resolution[C]. pacific rim conference on multimedia, : 540-549.
Wu T, Hu R, Gao L, et al. Analysis and Comparison of Inter-Channel Level Difference and Interaural Level Difference[C]. conference on multimedia modeling, : 586-595.
Wang Z, Hu R, Liang C, et al. Zero-Shot Person Re-identification via Cross-View Consistency[J]. IEEE Transactions on Multimedia, , 18(2): 260-272.(EI)
Wu T, Hu R, Gao L, et al. Analysis and Comparison of Inter-Channel Level Difference and Interaural Level Difference[C]. conference on multimedia modeling, : 586-595.
Xu Z, Hu R, Chen J, et al. Global Contrast Based Salient Region Boundary Sampling for Action Recognition[C]. conference on multimedia modeling, : 187-198.
Jiang J, Hu R, Wang Z, et al. CDMMA: Coupled discriminant multi-manifold analysis for matching low-resolution face images[J]. Signal Processing, : 162-172.(SCI,中国计算机学会C类期刊)
Huang W, Hu R, Liang C, et al. Camera Network Based Person Re-identification by Leveraging Spatial-Temporal Constraint and Multiple Cameras Relations[C]. conference on multimedia modeling, : 174-186.
Huang K, Hu R, Jiang J, et al. Face Image Super-Resolution Through Improved Neighbor Embedding[C]. conference on multimedia modeling, : 409-420.
Zhang L, Hu R, Li D, et al. Adaptive Multichannel Reduction Using Convex Polyhedral Loudspeaker Array[C]. conference on multimedia modeling, : 421-431.
Yang Y, Wang Y, Hu R, et al. Level Ratio Based Inter and Intra Channel Prediction with Application to Stereo Audio Frame Loss Concealment[C]. conference on multimedia modeling, : 654-661.
Jiang J,Hu R, Wang Z, et al. Facial Image Hallucination Through Coupled-Layer Neighbor Embedding[J]. IEEE Transactions on Circuits and Systems for Video Technology, , 26(9): 1674-1684.
Wang Z, Hu R, Yu Y, et al. Multi-Level Fusion for Person Re-identification with Incomplete Marks[C]. acm multimedia, : 1267-1270. (EI,中国计算机学会A类会议)
Wang Z, Hu R, Liang C, et al. Person Re-identification Using Data-Driven Metric Adaptation[C]. conference on multimedia modeling, : 195-207.
Wang S, Hu R, Chen S, et al. 3D Panning Based Sound Field Enhancement Method for Ambisonics[C]. pacific rim conference on multimedia, : 135-145.
Wang S, Hu R, Chen S, et al. A down-mixing method for 22.2 multichannel system reproduction[C]. international conference on acoustics, speech, and signal processing, : 634-638. (EI,中国计算机学会C类会议)
Zhang M, Hu R, Chen S, et al. Spatial perception reproduction of sound events based on sound property coincidences[C]. international conference on multimedia and expo, : 1-6. (EI,中国计算机学会B类会议)
Yin L, Hu R, Chen S, et al. A Block-Based Background Model for Surveillance Video Coding[C]. data compression conference, : 476-476. (EI,中国计算机学会B类会议)
Hu J, Hu R, Chen Y, et al. Joint Weighted Sparse Representation Based Median Filter for Depth Video Coding[C]. data compression conference, : 450-450. (EI,中国计算机学会B类会议)
Gao L,Hu R, Yang Y, et al. Azimuthal Perceptual Resolution Model Based Adaptive 3D Spatial Parameter Coding[C]. conference on multimedia modeling, : 534-545.
Jiang L,Hu R, Wang X, et al. Low Bitrates Audio Bandwidth Extension Using a Deep Auto-Encoder[C]. pacific rim conference on multimedia, : 528-537.
Yang C,Hu R, Su L, et al. Multi-channel Object-Based Spatial Parameter Compression Approach for 3D Audio[C]. pacific rim conference on multimedia, : 354-364.
Li D,Hu R, Wang X, et al. Multichannel Simplification Based on Deviation of Loudspeaker Positions[C]. advances in multimedia, : 544-553.
Xie S, Yang Y,Hu R, et al. Signal-Aware Parametric Quality Model for Audio and Speech over IP Networks[C]. conference on multimedia modeling, : 487-497.
Xiao J, Liao L, Hu J, et al. Exploiting global redundancy in big surveillance video data for efficient coding[J]. Cluster Computing, , 18(2): 531-540.
Xiao J, Chen Y, Liao L, et al. Global Coding of Multi-source Surveillance Video Data[C]. data compression conference, : 33-42. (EI,中国计算机学会B类会议)
Zhong R, Hu R, Wang Z, et al. 3D hybrid just noticeable distortion modeling for depth image-based rendering[J]. Multimedia Tools and Applications, , 74(23): 10457-10478. (SCI,EI,中国计算机学会C类期刊)
Wang S, Hu R, Chen S, et al. A down-mixing method for 22.2 multichannel system reproduction[C]. international conference on acoustics, speech, and signal processing, : 634-638. (EI,中国计算机学会C类会议)
Liao L,Hu R, Xiao J, et al. Exploiting effects of parts in fine-grained categorization of vehicles[C]. international conference on image processing, : 745-749.
Xu Z, Hu R, Chen J, et al. How much bandwidth does surveillance system require[C]. international conference on image processing, : 1762-1766. (EI,中国计算机学会C类会议)
Zhang M, Hu R, Chen S, et al. Spatial perception reproduction of sound events based on sound property coincidences[C]. international conference on multimedia and expo, : 1-6. (EI,中国计算机学会B类会议)
Jing X, Zhu X, Wu F, et al. Super-resolution Person re-identification with semi-coupled low-rank discriminant dictionary learning[C]. computer vision and pattern recognition, : 695-704. (EI,中国计算机学会A类会议)
Qu S, Hu R, Chen S, et al. Face hallucination via Cauchy regularized sparse representation[C]. international conference on acoustics, speech, and signal processing, : 1216-1220. (EI,中国计算机学会C类会议)
Gao L, Hu R, Yang Y, et al. Azimuthal Perceptual Resolution Model Based Adaptive 3D Spatial Parameter Coding[C]. conference on multimedia modeling, : 534-545
Jiang J, Hu R, Han Z, et al. Coupled Discriminant Multi-Manifold Analysis with Application to Low-Resolution Face Recognition[C]. conference on multimedia modeling, : 37-48. (EI,中国计算机学会C类会议)
[20] Jiang J, Hu R, Wang Z, et al. Face Super-Resolution via Multilayer Locality-Constrained Iterative Neighbor Embedding and Intermediate Dictionary Learning[J]. IEEE Transactions on Image Processing, , 23(10): 4220-4231.(SCI,中国计算机学会A类期刊)
Zhong R, Hu R, Wang Z, et al. 3D hybrid just noticeable distortion modeling for depth image-based rendering[J]. Multimedia Tools and Applications, , 74(23): 10457-10478. (SCI,EI,中国计算机学会C类期刊)
Jiang J, Hu R, Han Z, et al. Low-Resolution and Low-Quality Face Super-Resolution in Monitoring Scene via Support-Driven Sparse Coding[C]. signal processing systems, , 75(3): 245-256.(SCI)
Hu J, Hu R, Wang Z, et al. Adaptive Learning Based View Synthesis Prediction for Multi-View Video Coding[C]. signal processing systems, , 74(1): 115-126.(SCI)
Jiang J ,Hu R , Wang Z , et al. Noise Robust Face Hallucination via Locality-Constrained Representation[J]. IEEE Transactions on Multimedia, , 16(5):1268-1281. (SCI,中国计算机学会C类会议)
Huang Z,Hu R, Wang Z, et al. Background Subtraction With Video Coding[J]. IEEE Signal Processing Letters, , 20(11): 1058-1061.(SCI)
Gao L, Hu R, Yang Y, et al. A spatial priority based scalable audio coding[C]. international conference on acoustics speech and signal processing, : 3670-3674.(EI,中国计算机学会B类会议 )
Leng Q, Hu R, Liang C, et al. Bidirectional ranking for person re-identification[C]. international conference on multimedia and expo, : 1-6. (EI,中国计算机学会B类会议)
Wang Y, Hu R, Liang C, et al. Camera compensation using feature projection matrix for person re-identification[C]. international conference on multimedia and expo, : 1-6. (EI,中国计算机学会B类会议)
Lan C,Hu R, Huang K, et al. Face hallucination with shape parameters projection constraint[C]. acm multimedia, : 883-886. (EI,中国计算机学会A类会议)
Chen H,Hu R, Mao D, et al. Video coding using dynamic texture synthesis[C]. international conference on multimedia and expo, : 203-208. (EI,中国计算机学会B类会议)
Chen H, Hu R, Hu J, et al. Temporal color Just Noticeable Distortion model and its application for video coding[C]. international conference on multimedia and expo, : 713-718. (EI,中国计算机学会B类会议)
Hu R, Hang B, Ma Y, et al. A bottom-up audio attention model for surveillance[C]. international conference on multimedia and expo, : 564-567.(EI,中国计算机学会B类会议)
Books and Edited Books
多媒体信源编码技术与安防监控应急系统,胡瑞敏,湖北科学技术出版,
avs技术创新报告(2002-),数字音视频编解码技术标准工作组,人民邮电出版社,