Madhusudana et al., 2023 - Google Patents
Conviqt: Contrastive video quality estimatorMadhusudana et al., 2023
View PDF- Document ID
- 5319198190512214837
- Author
- Madhusudana P
- Birkbeck N
- Wang Y
- Adsumilli B
- Bovik A
- Publication year
- Publication venue
- IEEE Transactions on Image Processing
External Links
Snippet
Perceptual video quality assessment (VQA) is an integral component of many streaming and video sharing platforms. Here we consider the problem of learning perceptually relevant video quality representations in a self-supervised manner. Distortion type identification and …
- 238000012549 training 0 abstract description 59
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6232—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods
- G06K9/6247—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods based on an approximation criterion, e.g. principal component analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
- G06K9/00711—Recognising video content, e.g. extracting audiovisual features from movies, extracting representative key-frames, discriminating news vs. sport content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/40—Analysis of texture
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/141—Systems for two-way working between two video terminals, e.g. videophone
- H04N7/147—Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N17/00—Diagnosis, testing or measuring for television systems or their details
- H04N17/004—Diagnosis, testing or measuring for television systems or their details for digital television systems
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Bampis et al. | Spatiotemporal feature integration and model fusion for full reference video quality assessment | |
Tu et al. | UGC-VQA: Benchmarking blind video quality assessment for user generated content | |
Madhusudana et al. | Conviqt: Contrastive video quality estimator | |
Tu et al. | RAPIQUE: Rapid and accurate video quality prediction of user generated content | |
Madhusudana et al. | Image quality assessment using contrastive learning | |
Sun et al. | MC360IQA: A multi-channel CNN for blind 360-degree image quality assessment | |
Li et al. | No-reference video quality assessment with 3D shearlet transform and convolutional neural networks | |
Li et al. | No-reference image quality assessment with deep convolutional neural networks | |
Wang et al. | Information content weighting for perceptual image quality assessment | |
Lu et al. | Deep neural network for blind visual quality assessment of 4K content | |
Duanmu et al. | Quantifying visual image quality: A bayesian view | |
He et al. | A visual residual perception optimized network for blind image quality assessment | |
Hou et al. | A perceptual quality metric for video frame interpolation | |
Tu et al. | Efficient user-generated video quality prediction | |
Wang | A survey on IQA | |
Zheng et al. | Faver: Blind quality prediction of variable frame rate videos | |
Wang et al. | Perceptually quasi-lossless compression of screen content data via visibility modeling and deep forecasting | |
Athar et al. | Degraded reference image quality assessment | |
Chen et al. | GAMIVAL: Video quality prediction on mobile cloud gaming content | |
Saha et al. | Perceptual video quality assessment: The journey continues! | |
Liu et al. | Combined CNN/RNN video privacy protection evaluation method for monitoring home scene violence | |
Qiu et al. | Blind 360-degree image quality assessment via saliency-guided convolution neural network | |
Shen et al. | A Blind Video Quality Assessment Method via Spatiotemporal Pyramid Attention | |
Goodall et al. | Blind picture upscaling ratio prediction | |
Da et al. | Perceptual quality assessment of nighttime video |