TWI524739B - Sample adaptive offset (sao) in accordance with video coding - Google Patents
Sample adaptive offset (sao) in accordance with video coding Download PDFInfo
- Publication number
- TWI524739B TWI524739B TW102105283A TW102105283A TWI524739B TW I524739 B TWI524739 B TW I524739B TW 102105283 A TW102105283 A TW 102105283A TW 102105283 A TW102105283 A TW 102105283A TW I524739 B TWI524739 B TW I524739B
- Authority
- TW
- Taiwan
- Prior art keywords
- video signal
- frequency band
- signal
- communication device
- video
- Prior art date
Links
Landscapes
- Compression Or Coding Systems Of Tv Signals (AREA)
Description
本發明總體涉及數位視頻處理;更具體地,涉及根據該數位視頻處理的處理及操作。 The present invention generally relates to digital video processing; and more particularly to processing and operations in accordance with the digital video processing.
操作用於傳輸數位媒體(例如,圖像、視頻、數據等)的通訊系統已經持續開發了多年。對這種採用某種形式的視頻數據的通訊系統而言,以一定的幀速率(例如,每秒幀數)輸出或顯示多個數位圖像以實現適用於輸出和消費的視頻訊號。在利用視頻數據進行操作的多個這樣的通訊系統中,在吞吐量(例如,可以從第一位置傳輸至第二位置的圖像幀的數量)與最終要被輸出或顯示的訊號的視頻和/或圖像質量之間存在著折中。當前技術無法充分地或可接受地提供以下手段:即,其可以根據提供充足的或可接受的視頻和/或圖像質量將視頻數據從第一位置傳輸至第二位置,從而保證與通訊相關聯的相對少量的開銷,以及各個通訊鏈路端的通訊設備的相對較低的複雜度等。 Communication systems that operate to transfer digital media (eg, images, video, data, etc.) have been in development for many years. For such a communication system employing some form of video data, a plurality of digital images are output or displayed at a certain frame rate (for example, frames per second) to realize video signals suitable for output and consumption. In a plurality of such communication systems operating with video data, the video and the throughput (e.g., the number of image frames that can be transmitted from the first location to the second location) and the signal that is ultimately to be output or displayed / or there is a compromise between image quality. The current technology does not provide sufficient or acceptable means to provide video data from a first location to a second location based on providing sufficient or acceptable video and/or image quality to ensure communication-related A relatively small amount of overhead, and relatively low complexity of communication devices at each communication link end.
(1)一種通訊設備裝置,包括:至少一個輸入端,用於:從至少一個額外裝置接收視頻訊號;以及經由信令從所述至少一個額外裝置接收多個頻帶偏移;以及處理器,用於:分析與至少一個最大編碼單元(LCU)相關的多個畫素以識 別畫素值分佈,其中,所述最大編碼單元與所述視頻訊號相關;基於所述畫素值分佈推理地識別多個頻帶指數;根據所述視頻訊號或基於所述視頻訊號的訊號的濾波處理將所述多個頻帶偏移應用到所述多個頻帶指數;對所述視頻訊號或基於所述視頻訊號的訊號執行採樣點自適應偏移(SAO)濾波處理以生成第一濾波訊號,其中,所述採樣點自適應偏移濾波處理將所述多個頻帶偏移應用到所述多個頻帶指數;以及對所述第一濾波訊號執行去塊濾波處理以生成第二濾波訊號。 (1) A communication device apparatus comprising: at least one input for receiving a video signal from at least one additional device; and receiving a plurality of band offsets from the at least one additional device via signaling; and a processor for And analyzing: a plurality of pixels related to at least one maximum coding unit (LCU) Generating a prime value distribution, wherein the maximum coding unit is related to the video signal; inferringly identifying a plurality of frequency band indices based on the pixel value distribution; and filtering according to the video signal or a signal based on the video signal Processing, applying the plurality of frequency band offsets to the plurality of frequency band indices; performing sample point adaptive offset (SAO) filtering processing on the video signal or the signal based on the video signal to generate a first filtered signal, The sampling point adaptive offset filtering process applies the multiple frequency band offsets to the plurality of frequency band indices; and performs deblocking filtering processing on the first filtered signals to generate a second filtered signal.
(2)根據(1)所述的通訊設備裝置,其中,所述處理器用於:分析與至少一個最大編碼單元相關聯的多個畫素以生成表示所述畫素值分佈的畫素值直方圖,其中,所述最大編碼單元與所述視頻訊號相關;以及基於所述畫素值直方圖,識別要被應用所述多個頻帶偏移的所述多個頻帶指數。 (2) The communication device device of (1), wherein the processor is configured to: analyze a plurality of pixels associated with the at least one maximum coding unit to generate a pixel value histogram representing the pixel value distribution a picture, wherein the maximum coding unit is associated with the video signal; and based on the pixel value histogram, identifying the plurality of frequency band indices to which the plurality of frequency band offsets are to be applied.
(3)根據(1)所述的通訊設備裝置,其中:所述多個頻帶指數具有不連續分佈,使得所述多個頻帶指數中的至少兩個連續的頻帶指數彼此分離至少一個頻帶指數值。 (3) The communication device device of (1), wherein: the plurality of band indices have a discontinuous distribution such that at least two consecutive band indices of the plurality of band indices are separated from each other by at least one band index value .
(4)根據(1)所述的通訊設備裝置,其中:所述畫素值分佈指示分別與所述多個頻帶指數中的至少一部分相關的所述多個畫素的多個子集;以及要被應用所述多個頻帶偏移的所述多個頻帶指數對應於所述多個畫素的多個子集中的至少一個子集,其中,該至少一個子集與所述多個畫素的多個子集中的其他子集相比具有相對較大或最大數量的畫素。 (4) The communication device device of (1), wherein: the pixel value distribution indicates a plurality of subsets of the plurality of pixels respectively associated with at least a portion of the plurality of band indices; The plurality of frequency band indices to be applied to the plurality of frequency band offsets correspond to at least one subset of the plurality of subsets of the plurality of pixels, wherein the at least one subset and the plurality of pixels Other subsets in multiple subsets have a relatively larger or largest number of pixels.
(5)根據(1)所述的通訊設備裝置,其中: 所述裝置是在衛星通訊系統、無線通訊系統、有線通訊系統、光纖通訊系統及移動通訊系統中的至少一個中操作的通訊設備。 (5) The communication device device according to (1), wherein: The device is a communication device operating in at least one of a satellite communication system, a wireless communication system, a wired communication system, a fiber optic communication system, and a mobile communication system.
(6)一種通訊設備裝置,包括:輸入端,用於從至少一個額外裝置接收視頻訊號及多個頻帶偏移;以及處理器,用於:分析與至少一個最大編碼單元(LCU)相關的多個畫素以識別用於識別多個頻帶指數的畫素值分佈,所述最大編碼單元與所述視頻訊號相關;以及根據所述視頻訊號或基於所述視頻訊號的訊號的濾波處理將所述多個頻帶偏移應用到所述多個頻帶指數。 (6) A communication device apparatus comprising: an input for receiving a video signal and a plurality of band offsets from at least one additional device; and a processor for: analyzing a plurality of associated with at least one maximum coding unit (LCU) a pixel to identify a pixel value distribution for identifying a plurality of band indices, the maximum coding unit being associated with the video signal; and filtering processing according to the video signal or a signal based on the signal of the video signal A plurality of band offsets are applied to the plurality of band indices.
(7)根據(6)所述的通訊設備裝置,其中,所述處理器用於:對所述視頻訊號或基於所述視頻訊號的訊號執行採樣點自適應偏移(SAO)濾波處理以生成第一濾波訊號,其中,所述採樣點自適應偏移濾波處理將所述多個頻帶偏移應用到所述多個頻帶指數;以及對所述第一濾波訊號執行去塊濾波處理以生成第二濾波訊號。 (7) The communication device device of (6), wherein the processor is configured to: perform a sample point adaptive offset (SAO) filtering process on the video signal or a signal based on the video signal to generate a first a filter signal, wherein the sample point adaptive offset filter process applies the plurality of band offsets to the plurality of band indices; and performs deblocking filtering processing on the first filter signal to generate a second Filter signal.
(8)根據(6)所述的通訊設備裝置,其中,所述處理器用於:分析與至少一個最大編碼單元相關的多個畫素以生成表示所述畫素值分佈的畫素值直方圖,其中,所述最大編碼單元與所述視頻訊號相關;以及基於所述畫素值直方圖,識別要被應用所述多個頻帶偏移的多個頻帶指數。 (8) The communication device device of (6), wherein the processor is configured to: analyze a plurality of pixels associated with the at least one maximum coding unit to generate a pixel value histogram representing the pixel value distribution And wherein the maximum coding unit is associated with the video signal; and based on the pixel value histogram, identifying a plurality of frequency band indices to which the plurality of frequency band offsets are to be applied.
(9)根據(6)所述的通訊設備裝置,其中:所述多個頻帶指數具有不連續分佈,使得所述多個頻帶指數 中的至少兩個連續的頻帶指數彼此分離至少一個頻帶指數值。 (9) The communication device device of (6), wherein: the plurality of band indices have a discontinuous distribution such that the plurality of band indices At least two consecutive frequency band indices are separated from each other by at least one band index value.
(10)根據(6)所述的通訊設備裝置,其中:所述畫素值分佈指示分別與所述多個頻帶指數中的至少一部分相關的所述多個畫素的多個子集;以及要被應用所述多個頻帶偏移的所述多個頻帶指數對應於所述多個畫素的多個子集中的至少一個子集,其中,該至少一個子集與所述多個畫素的多個子集中的其他子集相比具有相對較大或最大數量的畫素。 (10) The communication device device of (6), wherein: the pixel value distribution indicates a plurality of subsets of the plurality of pixels respectively associated with at least a portion of the plurality of band indices; The plurality of frequency band indices to be applied to the plurality of frequency band offsets correspond to at least one subset of the plurality of subsets of the plurality of pixels, wherein the at least one subset and the plurality of pixels Other subsets in multiple subsets have a relatively larger or largest number of pixels.
(11)根據(6)所述的通訊設備裝置,其中:所述多個頻帶偏移由所述裝置經由信令從所述至少一個額外裝置接收;以及所述處理器用於基於所述畫素值分佈推理地識別所述多個頻帶指數。 (11) The communication device device of (6), wherein: the plurality of band offsets are received by the device from the at least one additional device via signaling; and the processor is configured to be based on the pixels The value distribution inferentially identifies the plurality of band indices.
(12)根據(6)所述的通訊設備裝置,其中:所述通訊設備裝置是包括視頻解碼器的接收器通訊設備;所述至少一個額外裝置是包括視頻編碼器的發射器通訊設備;以及所述接收器通訊設備及所述發射器通訊設備經由至少一個通訊通道連接或通訊地耦接。 (12) The communication device device of (6), wherein: the communication device device is a receiver communication device including a video decoder; the at least one additional device is a transmitter communication device including a video encoder; The receiver communication device and the transmitter communication device are coupled or communicatively coupled via at least one communication channel.
(13)根據(6)所述的通訊設備裝置,其中:所述通訊設備裝置是在衛星通訊系統、無線通訊系統、有線通訊系統、光纖通訊系統及移動通訊系統中的至少一個中操作的通訊設備。 (13) The communication device device according to (6), wherein: the communication device device is a communication operation in at least one of a satellite communication system, a wireless communication system, a wired communication system, a fiber-optic communication system, and a mobile communication system. device.
(14)一種通訊設備的操作方法,所述方法包括:經由所述通訊設備的輸入端,從至少一個額外通訊設備接收視頻訊號及多個頻帶偏移;分析與至少一個最大編碼單元(LCU)相關的多個畫素以識別用於識別多個頻帶指數的畫素值分佈,其中,所述最大編碼單 元與所述視頻訊號相關;以及根據所述視頻訊號或基於所述視頻訊號的訊號的濾波處理將所述多個頻帶偏移應用到所述多個頻帶指數。 (14) A method of operating a communication device, the method comprising: receiving, via an input of the communication device, a video signal and a plurality of frequency band offsets from at least one additional communication device; analyzing and at least one maximum coding unit (LCU) Correlated multiple pixels to identify a pixel value distribution for identifying a plurality of band indices, wherein the maximum code list The element is associated with the video signal; and applying the plurality of frequency band offsets to the plurality of frequency band indices according to the video signal or a filtering process based on the signal of the video signal.
(15)根據(14)所述的通訊設備的操作方法,進一步包括:對所述視頻訊號或基於所述視頻訊號的訊號執行採樣點自適應偏移(SAO)濾波處理以生成第一濾波訊號,其中,所述採樣點自適應偏移濾波處理包括將所述多個頻帶偏移應用到所述多個頻帶指數;以及對所述第一濾波訊號執行去塊濾波處理以生成第二濾波訊號。 (15) The method for operating a communication device according to (14), further comprising: performing a sample point adaptive offset (SAO) filtering process on the video signal or a signal based on the video signal to generate a first filtered signal The sampling point adaptive offset filtering process includes applying the plurality of frequency band offsets to the plurality of frequency band indices; and performing deblocking filtering processing on the first filtered signal to generate a second filtered signal .
(16)根據(14)所述的通訊設備的操作方法,進一步包括:分析與所述至少一個最大編碼單元相關的多個畫素以生成表示所述畫素值分佈的畫素值直方圖,其中,所述最大編碼單元與所述視頻訊號相關;以及基於所述畫素值直方圖,識別要被應用所述多個頻帶偏移的多個頻帶指數。 (16) The method of operating a communication device according to (14), further comprising: analyzing a plurality of pixels associated with the at least one maximum coding unit to generate a pixel value histogram representing the pixel value distribution, Wherein the maximum coding unit is associated with the video signal; and based on the pixel value histogram, identifying a plurality of frequency band indices to which the plurality of frequency band offsets are to be applied.
(17)根據(14)所述的通訊設備的操作方法,其中:所述多個頻帶指數具有不連續分佈,使得所述多個頻帶指數中的至少兩個連續的頻帶指數彼此分離至少一個頻帶指數值。 (17) The method of operating a communication device according to (14), wherein: the plurality of band indices have a discontinuous distribution such that at least two consecutive band indices of the plurality of band indices are separated from each other by at least one band Index value.
(18)根據(14)所述的通訊設備的操作方法,其中:所述畫素值分佈指示分別與所述多個頻帶指數中的至少一部分相關的所述多個畫素的多個子集;以及要被應用所述多個頻帶偏移的所述多個頻帶指數對應於所述多個畫素的多個子集中的至少一個子集,其中,該至少一個子集與所述多個畫素的多個子集中的其他子集相比具有相對較大或最大數量的畫素。 (18) The method of operating a communication device according to (14), wherein: the pixel value distribution indicates a plurality of subsets of the plurality of pixels respectively associated with at least a portion of the plurality of frequency band indices And the plurality of frequency band indices to be applied to the plurality of frequency band offsets corresponding to at least one subset of the plurality of subsets of the plurality of pixels, wherein the at least one subset and the plurality of pictures The other subsets of the plurality of primes have a relatively large or maximum number of pixels.
(19)根據(14)所述的通訊設備的操作方法,進一步包括:經由信令從所述至少一個額外通訊設備接收所述多個頻帶偏 移;以及基於所述畫素值分佈推理地識別所述多個頻帶指數。 (19) The operating method of the communication device according to (14), further comprising: receiving the plurality of frequency band offsets from the at least one additional communication device via signaling Shifting; and inferringly identifying the plurality of band indices based on the pixel value distribution.
(20)根據(14)所述的通訊設備的操作方法,其中:所述通訊設備在衛星通訊系統、無線通訊系統、有線通訊系統、光纖通訊系統及移動通訊系統中的至少一個中操作。 (20) The operating method of the communication device according to (14), wherein the communication device operates in at least one of a satellite communication system, a wireless communication system, a wired communication system, a fiber-optic communication system, and a mobile communication system.
100‧‧‧通訊系統 100‧‧‧Communication system
110‧‧‧通訊設備 110‧‧‧Communication equipment
112‧‧‧發射器 112‧‧‧transmitter
114‧‧‧編碼器 114‧‧‧Encoder
116‧‧‧接收器 116‧‧‧ Receiver
118‧‧‧解碼器 118‧‧‧Decoder
120‧‧‧通訊設備 120‧‧‧Communication equipment
122‧‧‧接收器 122‧‧‧ Receiver
124‧‧‧解碼器 124‧‧‧Decoder
126‧‧‧發射器 126‧‧‧transmitter
128‧‧‧編碼器 128‧‧‧Encoder
130‧‧‧衛星通訊通道 130‧‧‧Satellite communication channel
132‧‧‧衛星天線 132‧‧‧Satellite antenna
134‧‧‧衛星天線 134‧‧‧Satellite antenna
140‧‧‧無線通訊通道 140‧‧‧Wireless communication channel
142‧‧‧塔 142‧‧‧ tower
144‧‧‧塔 144‧‧‧ tower
150‧‧‧有線通訊通道 150‧‧‧Wired communication channel
152‧‧‧本地天線 152‧‧‧Local antenna
154‧‧‧本地天線 154‧‧‧Local antenna
160‧‧‧光纖通訊通道 160‧‧‧Fiber communication channel
162‧‧‧電光(E/O)介面 162‧‧‧Electrical (E/O) interface
164‧‧‧光電(O/E)介面 164‧‧‧Photoelectric (O/E) interface
199‧‧‧通訊通道 199‧‧‧Communication channel
200‧‧‧通訊系統 200‧‧‧Communication system
201‧‧‧訊息位元 201‧‧‧Message Bits
202‧‧‧編碼信息位元 202‧‧‧ encoded information bits
203‧‧‧離散值調製符號序列 203‧‧‧Discrete value modulation symbol sequence
204‧‧‧連續時間傳輸訊號 204‧‧‧Continuous time transmission signal
205‧‧‧過濾的連續時間傳輸訊號 205‧‧‧Filtered continuous time transmission signal
206‧‧‧連續時間接收訊號 206‧‧‧Continuous time to receive signals
207‧‧‧過濾的連續時間接收訊號 207‧‧‧Filtered continuous time reception signal
208‧‧‧離散時間接收訊號 208‧‧‧Discrete time receiving signal
209‧‧‧度量值 209‧‧‧ metrics
210‧‧‧最佳估計離散值調製符號和其中編碼的訊息位元 210‧‧‧Best estimate of discrete-valued modulation symbols and the information bits encoded therein
220‧‧‧編碼器和符號映射器 220‧‧‧Encoder and symbol mapper
222‧‧‧編碼器 222‧‧‧Encoder
224‧‧‧符號映射器 224‧‧‧symbol mapper
230‧‧‧發射驅動器 230‧‧‧Transmission driver
232‧‧‧數位類比轉換器(DAC) 232‧‧‧Digital Analog Converter (DAC)
234‧‧‧發射濾波器 234‧‧‧Transmission filter
260‧‧‧類比前端(AFE) 260‧‧‧ analog front end (AFE)
262‧‧‧接收濾波器 262‧‧‧ Receive Filter
264‧‧‧類比數位轉換器(ADC) 264‧‧‧ Analog Digital Converter (ADC)
270‧‧‧度量生成器 270‧‧‧Metric Generator
280‧‧‧解碼器 280‧‧‧Decoder
280a‧‧‧處理模組 280a‧‧‧Processing Module
280b‧‧‧處理模組 280b‧‧‧Processing module
297‧‧‧發射器 297‧‧‧transmitter
298‧‧‧接收器 298‧‧‧ Receiver
299‧‧‧通訊通道 299‧‧‧Communication channel
301‧‧‧電腦 301‧‧‧ computer
302‧‧‧筆記型電腦 302‧‧‧Note Computer
303‧‧‧高清(HD)電視 303‧‧‧High Definition (HD) TV
304‧‧‧標準清晰度(SD)電視 304‧‧‧Standard Definition (SD) TV
305‧‧‧手持媒體單元 305‧‧‧Handheld media unit
306‧‧‧機上盒(STB) 306‧‧‧Set-top box (STB)
307‧‧‧數位視頻光碟(DVD)播放器 307‧‧‧Digital Video Disc (DVD) Player
308‧‧‧通用數位圖像和/或視頻處理設備 308‧‧‧Common digital image and / or video processing equipment
400‧‧‧實施方式 400‧‧‧ Implementation
500‧‧‧實施方式 500‧‧‧ implementation
600‧‧‧實施方式 600‧‧‧ implementation
700‧‧‧實施方式 700‧‧‧ Implementation
800‧‧‧實施方式 800‧‧‧ Implementation
900‧‧‧實施方式 900‧‧‧ implementation
1000‧‧‧實施方式 1000‧‧‧ implementation
1100‧‧‧實施方式 1100‧‧‧ Implementation
1200‧‧‧實施方式 1200‧‧‧ implementation
1300‧‧‧實施方式 1300‧‧‧ implementation
1400‧‧‧實施方式 1400‧‧‧ implementation
1500‧‧‧實施方式 1500‧‧‧ implementation
1600‧‧‧實施方式 1600‧‧‧ implementation
1700‧‧‧實施方式 1700‧‧‧ Implementation
1800‧‧‧方法 1800‧‧‧ method
1810~1830‧‧‧方框 1810~1830‧‧‧ box
1900‧‧‧方法 1900‧‧‧ method
1910~1940‧‧‧方框 1910~1940‧‧‧ box
圖1和圖2示出了通訊系統的各個實施方式。 Figures 1 and 2 illustrate various embodiments of a communication system.
圖3A示出了電腦的實施方式。 Figure 3A shows an embodiment of a computer.
圖3B示出了筆記型電腦的實施方式。 Figure 3B shows an embodiment of a notebook computer.
圖3C示出了高清(HD)電視的實施方式。 Figure 3C illustrates an embodiment of a high definition (HD) television.
圖3D示出了標準清晰度(SD)電視的實施方式。 Figure 3D illustrates an embodiment of a standard definition (SD) television.
圖3E示出了手持媒體單元的實施方式。 Figure 3E illustrates an embodiment of a handheld media unit.
圖3F示出了機上盒(STB)的實施方式。 Figure 3F shows an embodiment of a set-top box (STB).
圖3G示出了數位視頻光碟(DVD)播放器的實施方式。 Figure 3G illustrates an embodiment of a digital video disc (DVD) player.
圖3H示出了通用數位圖像和/或視頻處理設備的實施方式。 Figure 3H illustrates an embodiment of a general purpose digital image and/or video processing device.
圖4、圖5和圖6是示出了視頻編碼架構的各個實施方式的示圖。 4, 5, and 6 are diagrams showing various embodiments of a video coding architecture.
圖7是示出了幀內預測處理的實施方式的示圖。 FIG. 7 is a diagram showing an embodiment of intra prediction processing.
圖8是示出了幀間預測處理的實施方式的示圖。 FIG. 8 is a diagram showing an embodiment of an inter prediction process.
圖9和圖10是示出了視頻解碼架構的各個實施方式的示圖。 9 and 10 are diagrams showing various embodiments of a video decoding architecture.
圖11示出了頻帶偏移採樣點自適應偏移(SAO)濾波處理的實施方式。 Figure 11 illustrates an embodiment of a band offset sample point adaptive offset (SAO) filtering process.
圖12示出了視頻編碼架構的可替換實施方式。 Figure 12 illustrates an alternate embodiment of a video coding architecture.
圖13示出了採樣點自適應偏移(SAO)頻帶偏移模式下的傳輸頻帶偏移的指示(自適應和/或顯式信令)的各個實施方式。 Figure 13 illustrates various embodiments of an indication (adaptive and/or explicit signaling) of a transmission band offset in a Sample Point Adaptive Offset (SAO) band offset mode.
圖14示出了SAO頻帶偏移模式下的頻帶粒度的指示(自適應和/或顯式信令)的各個實施方式。 Figure 14 illustrates various embodiments of an indication of frequency band granularity (adaptive and/or explicit signaling) in the SAO band offset mode.
圖15示出了隱式頻帶指數信令的實施方式。 Figure 15 illustrates an embodiment of implicit band index signaling.
圖16示出了隱式頻帶指數信令的可替換實施方式。 Figure 16 illustrates an alternate embodiment of implicit band index signaling.
圖17示出了頻帶偏移編碼的實施方式。 Figure 17 illustrates an embodiment of band offset coding.
圖18及圖19示出了用於操作一個或多個設備(例如,通訊設備、接收器和/或解碼器設備、發射器和/或編碼器設備等)的方法的各個實施方式。 18 and 19 illustrate various embodiments of a method for operating one or more devices (eg, communication devices, receivers and/or decoder devices, transmitters, and/or encoder devices, etc.).
在使用數位媒體(諸如數位視頻)的多個設備中,利用畫素表示各個圖像(其本質為數位的)。在某些通訊系統中,數位媒體可以從第一位置傳輸至可以輸出或顯示這種媒體的第二位置。數位通訊系統(包括操作用於通訊數位視頻的通訊系統)的目的在於無誤差地或以可接受的低誤碼率將數位數據從一個位置或子系統傳輸至另一個位置。如圖1所示,數據可以通過在各種通訊系統中的各個通訊通道上進行傳輸:磁性媒介、有線、無線、光纖、銅和/或其他類型的媒介。 In a plurality of devices using digital media, such as digital video, each image is represented by a pixel (which is digital in nature). In some communication systems, digital media can be transferred from a first location to a second location where such media can be output or displayed. The purpose of digital communication systems, including communication systems for communicating digital video, is to transfer digital data from one location or subsystem to another without error or at an acceptable low bit error rate. As shown in Figure 1, data can be transmitted over various communication channels in various communication systems: magnetic media, wired, wireless, fiber optic, copper, and/or other types of media.
圖1和圖2分別示出了通訊系統100和200的各個實施方式。 1 and 2 illustrate various embodiments of communication systems 100 and 200, respectively.
參照圖1,通訊系統100的實施方式是通訊通道199,該通訊通道將位於通訊通道199一端的通訊設備110(包括具有編碼器114的發射器112並包括具有解碼器118的接收器116)通訊地耦接至位於通訊通道199另一端的另一個通訊設備120(包括具有編碼器128的發射器126並包括具有解碼器124的接收器122)。在某些實施方式中,通訊設備110和120之一可能僅包括發射器或接收器。存在通訊通道199可以通過其實現的幾種不同類型的媒介(例如,使用碟形的衛星天線132和134的衛星通訊通道130、使用塔142和144和/或本地天線152和154的無線通訊通道140、有線通訊通道150和/或使用電光(E/O)介面162和光電(O/E)介面164的光纖通訊通道160)。另外,可以實現不止一種類型的媒介,並可以介面在一起,從而形成通訊通道199。 Referring to Figure 1, an embodiment of communication system 100 is a communication channel 199 that communicates communication device 110 (including transmitter 112 having encoder 114 and receiver 116 having decoder 118) at one end of communication channel 199. It is coupled to another communication device 120 (including a transmitter 126 having an encoder 128 and including a receiver 122 having a decoder 124) located at the other end of the communication channel 199. In some embodiments, one of the communication devices 110 and 120 may only include a transmitter or receiver. There are several different types of media through which communication channel 199 can be implemented (e.g., satellite communication channel 130 using dished satellite antennas 132 and 134, wireless communication channels using towers 142 and 144 and/or local antennas 152 and 154) 140. Wired communication channel 150 and/or fiber optic communication channel 160 using an electro-optic (E/O) interface 162 and an optoelectronic (O/E) interface 164. In addition, more than one type of medium can be implemented and interfaced together to form communication channel 199.
要注意的是,在不背離本發明的範圍和精神的情況下,這樣 的通訊設備110和/或120可以是靜止設備或移動設備。例如,通訊設備110和120之一或兩者可以在固定位置實現或可以是具有與多於一個的網路接入點(例如,在包括一個或多個無線局域網(WLAN)的移動通訊系統環境中的各自不同的接入點(AP)、在包括一個或多個衛星的移動通訊系統環境中的各自不同的衛星或通常在包括一個或多個網路接入點的移動通訊系統環境中的各自不同的網路接入點,可以通過這些網路節點利用通訊設備110和/或120來實現通訊)相關聯和/或進行通訊的能力的移動通訊設備。 It is to be noted that without departing from the scope and spirit of the invention, The communication device 110 and/or 120 can be a stationary device or a mobile device. For example, one or both of communication devices 110 and 120 may be implemented at a fixed location or may have more than one network access point (eg, in a mobile communication system environment including one or more wireless local area networks (WLANs) Respective access points (APs), respective different satellites in a mobile communication system environment including one or more satellites, or typically in a mobile communication system environment including one or more network access points Each of the different network access points, through which the communication devices 110 and/or 120 can be utilized to enable communication, the ability to communicate and/or communicate with the mobile communication device.
通常採用誤差校正和通道編碼方案來降低通訊系統中不可避免所遭受的傳輸誤差。一般情況下,這些誤差校正和通道編碼方案涉及在通訊通道199的發射器端使用編碼器以及在通訊通道199的接收器端使用解碼器。 Error correction and channel coding schemes are often used to reduce transmission errors that are unavoidable in communication systems. In general, these error correction and channel coding schemes involve the use of an encoder at the transmitter end of communication channel 199 and the use of a decoder at the receiver end of communication channel 199.
在任何這樣的所需通訊系統(例如,包括針對圖1描述的那些變形例)、任何訊息儲存設備(例如,硬碟機(HDD)、網路訊息儲存設備和/或服務器等)或需要對訊息進行編碼和/或解碼的任何應用中,可以採用所描述的任何多種類型的ECC代碼。 In any such desired communication system (eg, including those described with respect to FIG. 1), any message storage device (eg, a hard disk drive (HDD), network message storage device, and/or server, etc.) or Any of a variety of types of ECC codes described may be employed in any application in which the message is encoded and/or decoded.
一般來說,當考慮視頻數據從一個位置或子系統傳輸至另一個位置的通訊系統時,一般可以認為視頻數據編碼在通訊通道199的傳輸端進行,一般可以認為視頻數據解碼在通訊通道199的接收端進行。 In general, when considering the transmission of video data from one location or subsystem to another, it is generally considered that the video data is encoded at the transmission end of the communication channel 199. Generally, the video data can be decoded in the communication channel 199. The receiving end performs.
同樣地,儘管該示圖的實施方式示出了通訊設備110和120之間能夠雙向通訊,當然要注意的是,在某些實施方式中,通訊設備110可能只包括視頻數據編碼功能,通訊設備120可能只包括視頻數據解碼功能,反之亦然(例如,根據視頻廣播實施方式的單向實施方式)。 Similarly, although the embodiment of the diagram illustrates two-way communication between communication devices 110 and 120, it is to be noted that, in some embodiments, communication device 110 may only include video data encoding functionality, communication devices. 120 may only include video data decoding functionality and vice versa (eg, according to a one-way implementation of a video broadcast implementation).
參照圖2的通訊系統200,在通訊通道299的發射端,向發射器297提供訊息位元201(例如,尤其對應於一個實施方式中的視頻數據),該發射器可操作用於利用編碼器和符號映射器220(可 以分別視為獨立功能塊的編碼器222和符號映射器224,編碼器222將編碼信息位元202傳送到符號映射器224)進行這些訊息位元201的解碼,從而生成一系列離散值調製符號序列203,將這一系列離散值調製符號提供給發射驅動器230,該發射驅動器使用數位類比轉換器(DAC)232來生成連續時間傳輸訊號204並利用發射濾波器234來生成大致與通訊通道299相稱的經過濾的連續時間傳輸訊號205。在通訊通道299的接收端,將連續時間接收訊號206提供給類比前端(AFE)260,該類比前端包括接收濾波器262(其生成經過濾的連續時間接收訊號207)以及類比數位轉換器(ADC)264(其生成離散時間接收訊號208)。度量生成器270計算解碼器280用來對最佳估計離散值調製符號和其中編碼的訊息位元210編碼的訊息位元進行最佳估算的度量值209(例如,基於符號和/或位元)。 Referring to the communication system 200 of FIG. 2, at the transmitting end of the communication channel 299, a message bit 201 (e.g., corresponding to video data in one embodiment) is provided to the transmitter 297, the transmitter being operable to utilize the encoder And symbol mapper 220 (can Encoder 222, which is considered to be an independent functional block, and symbol mapper 224, respectively, encodes coded information bits 202 to symbol mapper 224 for decoding of these message bits 201, thereby generating a series of discrete value modulation symbols. Sequence 203 provides the series of discrete value modulation symbols to transmit driver 230, which uses digital analog converter (DAC) 232 to generate continuous time transmission signal 204 and utilizes transmit filter 234 to generate substantially commensurate with communication channel 299. The filtered continuous time transmission signal 205. At the receiving end of communication channel 299, a continuous time receive signal 206 is provided to an analog front end (AFE) 260, which includes a receive filter 262 (which generates a filtered continuous time receive signal 207) and an analog digital converter (ADC). 264 (which generates a discrete time receive signal 208). The metric generator 270 calculates a metric 209 (e.g., based on symbols and/or bits) that the decoder 280 uses to best estimate the best estimated discrete value modulation symbols and the message bits encoded therein by the encoded message bits 210. .
在發射器297和接收器298的每一個中,可以實現將各個組件、塊、功能塊、電路等任何所需的集成在其中。例如,該圖示出處理模組280a包括編碼器和符號映射器220以及其中所有相關聯的對應部件,示出處理模組280b包括度量生成器270和解碼器280以及其中所有相關聯的對應部件。這樣的處理模組280a和280b可以是相應的集成電路。當然,在不背離本發明的範圍和精神的情況下,可以可替換地進行其他邊界和分組。例如,第一處理模組或集成電路中可以包括發射器297中的所有部件,第二處理模組或集成電路中可以包括接收器298中的所有部件。可替換地,在其他實施方式中可以任意組合發射器297和接收器298的每一個中的部件。 In each of the transmitter 297 and the receiver 298, any desired integration of various components, blocks, functional blocks, circuits, and the like can be implemented therein. For example, the figure shows that processing module 280a includes an encoder and symbol mapper 220 and all associated corresponding components therein, showing that processing module 280b includes metric generator 270 and decoder 280 and all associated corresponding components therein. . Such processing modules 280a and 280b can be corresponding integrated circuits. Of course, other boundaries and groupings may alternatively be made without departing from the scope and spirit of the invention. For example, all components in the transmitter 297 may be included in the first processing module or integrated circuit, and all components in the receiver 298 may be included in the second processing module or integrated circuit. Alternatively, the components in each of the transmitter 297 and the receiver 298 can be arbitrarily combined in other embodiments.
與前述實施方式一樣,這樣的通訊系統200可以用於視頻數據從一個位置或子系統傳輸至另一個位置(例如,經由通訊通道299從發射器297至接收器298)的通訊。 As with the previous embodiments, such communication system 200 can be used for communication of video data from one location or subsystem to another (e.g., from transmitter 297 to receiver 298 via communication channel 299).
可以由以下圖3A至圖3H所示的各個設備中的任何一個來實 現數位圖像和/或數位圖像和/或媒體(包括數位視頻訊號中的各個圖像)的視頻處理,從而允許用戶觀看這種數位圖像和/或視頻。這些多種設備並不包括窮盡的設備清單,其中可以實現本文描述的圖像和/或視頻處理,要注意的是,在不背離本發明的範圍和精神的情況下,可以實現任何通用數位圖像和/或視頻處理設備以便進行本文所述的處理。 It can be realized by any of the respective devices shown in FIG. 3A to FIG. 3H below. The video processing of digital images and/or digital images and/or media (including individual images of digital video signals) allows the user to view such digital images and/or video. These various devices do not include an exhaustive list of devices in which the image and/or video processing described herein can be implemented, it being noted that any general purpose digital image can be implemented without departing from the scope and spirit of the invention. And/or a video processing device for performing the processing described herein.
圖3A示出了電腦301的實施方式。該電腦301可以是桌上型電腦,或主機的企業儲存設備(諸如服務器),其附接至儲存陣列(諸如獨立磁盤冗陣列(RAID)、儲存路由器、邊緣路由器、儲存交換機和/或儲存導向器)。用戶能夠利用電腦301來查看靜態數位圖像和/或視頻(例如一系列數位圖像)。電腦301上時常包括多種圖像和/或視頻觀看程序和/或媒體播放器程序以允許用戶觀看這種圖像(包括視頻)。 FIG. 3A shows an embodiment of a computer 301. The computer 301 can be a desktop computer, or a hosted enterprise storage device (such as a server) attached to a storage array (such as a stand-alone disk redundancy array (RAID), storage router, edge router, storage switch, and/or storage guide) Device). The user can utilize the computer 301 to view static digital images and/or video (eg, a series of digital images). A variety of image and/or video viewing programs and/or media player programs are often included on computer 301 to allow a user to view such images (including video).
圖3B示出了筆記型電腦302的實施方式。在多種情形中的任何一種情形下可以發現並使用這樣的筆記型電腦302。近年,隨著筆記型電腦的處理能力和功能的不斷增長,在先前使用高端且更有能力的桌上型電腦的許多情況下採樣這種筆記型電腦。與電腦301一樣,筆記型電腦302可以包括各個圖像觀看程序和/或媒體播放器程序以便允許用戶觀看這種圖像(包括視頻)。 FIG. 3B illustrates an embodiment of a notebook computer 302. Such a notebook computer 302 can be found and used in any of a variety of situations. In recent years, with the increasing processing power and functionality of notebook computers, such notebook computers have been sampled in many cases of previous high-end and more capable desktop computers. Like computer 301, notebook computer 302 can include various image viewing programs and/or media player programs to allow a user to view such images (including video).
圖3C示出了高清(HD)電視303的實施方式。許多HD電視303包括允許在其上接收、處理並解碼媒體內容(例如電視廣播訊號)的集成調諧器。可替換地,有時,HD電視303從另一個接收、處理並解碼電纜和/或衛星電視廣播訊號的源(諸如數位視頻光碟(DVD)播放器、機上盒(STB))接收媒體內容。不管特定實現方式如何,HD電視303可以被實現來進行如本文所描述的圖像和/或視頻處理。一般來說,HD電視303具有顯示HD媒體內容的能力並時常被實現為具有16:9寬屏的縱橫比。 FIG. 3C illustrates an embodiment of a high definition (HD) television 303. Many HD televisions 303 include an integrated tuner that allows media content (e.g., television broadcast signals) to be received, processed, and decoded thereon. Alternatively, sometimes the HD television 303 receives media content from another source that receives, processes, and decodes cable and/or satellite television broadcast signals, such as digital video disc (DVD) players, set-top boxes (STBs). Regardless of the particular implementation, HD television 303 can be implemented to perform image and/or video processing as described herein. In general, HD television 303 has the ability to display HD media content and is often implemented with an aspect ratio of 16:9 widescreen.
圖3D示出了標準清晰度(SD)電視304的實施方式。當然, SD電視304在某種程度上類似於HD電視303,至少一個不同之處在於SD電視304不包括顯示HD媒體內容的能力,且SD電視304時常被實現為具有4:3的全屏縱橫比。儘管如此,甚至是SD電視304也可以被實現來進行如本文所描述的圖像和/或視頻處理。 FIG. 3D illustrates an embodiment of a standard definition (SD) television 304. of course, The SD television 304 is somewhat similar to the HD television 303, with at least one difference in that the SD television 304 does not include the ability to display HD media content, and the SD television 304 is often implemented with a full screen aspect ratio of 4:3. Nonetheless, even SD TV 304 can be implemented to perform image and/or video processing as described herein.
圖3E示出了手持媒體單元305的實施方式。手持媒體單元305可以操作用於對供用戶回放的圖像/視頻內容訊息(諸如聯合圖像專家組(JPEG)文件、標記圖像文件格式(TIFF)、位圖、運動圖像專家組(MPEG)文件、Windows Media(WMA/WMV)文件、諸如MPEG4文件等的其他類型的視頻文件)和/或可以以數位格式儲存的其他類型訊息進行通用儲存或儲存。從歷史觀點上說,這樣的手持媒體單元主要用於儲存並回放音頻媒體;然而,這樣的手持媒體單元305可以用於儲存並回放任何虛擬媒體(例如,音頻媒體、視頻媒體、圖像媒體等)。而且,這樣的手持媒體單元305還可以包括其他功能,諸如用於有線和無線通訊的集成通訊電路。這樣的手持媒體單元305可以被實現來進行如本文所描述的圖像和/或視頻處理。 FIG. 3E illustrates an embodiment of a handheld media unit 305. Handheld media unit 305 can operate for image/video content messages for playback by a user (such as Joint Photographic Experts Group (JPEG) files, Tagged Image File Format (TIFF), bitmaps, Moving Picture Experts Group (MPEG) Files, Windows Media (WMA/WMV) files, other types of video files such as MPEG4 files, and/or other types of messages that can be stored in digital format for general storage or storage. Historically, such handheld media units are primarily used to store and play back audio media; however, such handheld media unit 305 can be used to store and play back any virtual media (eg, audio media, video media, image media, etc.) ). Moreover, such handheld media unit 305 can also include other functions, such as integrated communication circuitry for wired and wireless communications. Such handheld media unit 305 can be implemented to perform image and/or video processing as described herein.
圖3F示出了機上盒(STB)306的實施方式。如上所述,STB 306有時可以被實現來接收、處理並解碼要提供給任何合適的顯示可用設備(諸如SD電視304和/或HD電視303)的電纜和/或衛星電視廣播訊號。這樣的STB 306可以獨立地或協同地與顯示可用設備一起操作以便進行如本文所描述的圖像和/或視頻處理。 FIG. 3F illustrates an embodiment of a set-top box (STB) 306. As noted above, the STB 306 can sometimes be implemented to receive, process, and decode cable and/or satellite television broadcast signals to be provided to any suitable display available device, such as SD television 304 and/or HD television 303. Such STBs 306 can operate independently or in concert with display available devices for image and/or video processing as described herein.
圖3G示出了數位視頻光碟(DVD)播放器307的實施方式。在不背離本發明的範圍和精神的情況下,這樣的DVD播放器可以是藍光DVD播放機、HD可用DVD播放器、SD可用DVD播放器、上採樣可用DVD播放器(例如從SD至HD等)。DVD播放器可以向任何合適的顯示可用設備(諸如SD電視304和/或HD電視303)提供訊號。DVD播放器307可以被實現來進行如本文 所描述的圖像和/或視頻處理。 FIG. 3G illustrates an embodiment of a digital video disc (DVD) player 307. Such a DVD player may be a Blu-ray DVD player, an HD-available DVD player, an SD-available DVD player, an upsampling available DVD player (eg, from SD to HD, etc. without departing from the scope and spirit of the present invention). ). The DVD player can provide signals to any suitable display available device, such as SD TV 304 and/or HD TV 303. DVD player 307 can be implemented to perform as herein The described image and/or video processing.
圖3H示出了通用數位圖像和/或視頻處理設備308的實施方式。另外,如上所述,上述這些各種設備不包括可以實現本文描述的圖像和/或視頻處理的窮盡的設備清單,要注意的是,在不背離本發明的範圍和精神的情況下,可以實現任何通用數位圖像和/或視頻處理設備308以便進行本文所述的圖像和/或視頻處理。 FIG. 3H illustrates an embodiment of a general purpose digital image and/or video processing device 308. In addition, as described above, the various devices described above do not include an exhaustive list of devices that can implement the image and/or video processing described herein, and it is noted that it can be implemented without departing from the scope and spirit of the present invention. Any general purpose digital image and/or video processing device 308 is utilized to perform the image and/or video processing described herein.
圖4、圖5和圖6分別是示出了視頻編碼架構的各個實施方式400、500和600的示圖。 4, 5, and 6 are diagrams showing various embodiments 400, 500, and 600 of a video coding architecture, respectively.
參照圖4的實施方式400,針對該圖可以看出,由視頻編碼器接收輸入視頻訊號。在某些實施方式中,輸入視頻訊號由編碼單元(CU)或宏塊(MB)構成。這種編碼單元或宏塊的大小可變並可以包括通常以正方形佈置的多個畫素。在一個實施方式中,這種編碼單元或宏塊的大小為16×16畫素。然而,通常要注意的是,宏塊可以具有任何所需的大小,諸如N×N畫素,其中N為整數(例如,16×16、8×8或4×4)。當然,雖然優選實施方式中採用了方形編碼單元或宏塊,但某些實現方式可以包括非方形編碼單元或宏塊。 Referring to the embodiment 400 of FIG. 4, it can be seen for the figure that the input video signal is received by the video encoder. In some embodiments, the input video signal is comprised of a coding unit (CU) or a macroblock (MB). Such coding units or macroblocks are variable in size and may include a plurality of pixels that are typically arranged in a square. In one embodiment, the size of such coding unit or macroblock is 16 x 16 pixels. However, it is generally noted that the macroblocks can have any desired size, such as NxN pixels, where N is an integer (eg, 16x16, 8x8, or 4x4). Of course, although square coding units or macroblocks are employed in the preferred embodiment, some implementations may include non-square coding units or macroblocks.
輸入視頻訊號通常可以被稱為對應於原始幀(或圖片)圖像數據。例如,原始幀(或圖片)圖像數據可以經歷處理以生成亮度和色度樣本。在某些實施方式中,宏塊中的亮度樣本組是一個特定佈置(例如16×16),色度樣本組是不同的特定佈置(例如8×8)。根據本文所描述的實施方式,視頻編碼器以逐塊為單位對這些樣本進行處理。 The input video signal can generally be referred to as corresponding to the original frame (or picture) image data. For example, raw frame (or picture) image data may undergo processing to generate luminance and chrominance samples. In some embodiments, the set of luma samples in a macroblock is a particular arrangement (e.g., 16 x 16), and the sets of chroma samples are different specific arrangements (e.g., 8 x 8). In accordance with embodiments described herein, a video encoder processes the samples on a block-by-block basis.
輸入視頻訊號然後經歷模式選擇,輸入視頻訊號按照該模式選擇性地經歷幀內和/或幀間預測處理。一般來說,輸入視頻訊號沿壓縮通路經歷壓縮。當無反饋地進行操作時(例如,不根據幀間預測,也不根據幀內預測)輸入視頻訊號經由壓縮通路提供從而經歷變換操作(例如,根據離散餘弦變換(DCT))。當然,在 可替換實施方式中可以採用其他變換。在這種操作模式下,輸入視頻訊號本身就是所壓縮的訊號。壓縮通路可以利用人眼缺乏高頻率靈敏度來進行壓縮。 The input video signal then undergoes mode selection, and the input video signal selectively undergoes intra and/or inter prediction processing in accordance with the mode. In general, the input video signal undergoes compression along the compression path. When operating without feedback (eg, not based on inter prediction, nor based on intra prediction), the input video signal is provided via a compression path to undergo a transform operation (eg, according to a discrete cosine transform (DCT)). Of course, in Other transformations may be employed in alternative embodiments. In this mode of operation, the input video signal itself is the compressed signal. The compression path can be compressed using the human eye lacking high frequency sensitivity.
然而,通過選擇性地使用幀內或幀間預測視頻編碼,可以沿壓縮通路進行反饋。根據反饋或預測操作模式,壓縮通路對從當前宏塊減去當前宏塊預測值而產生的(相對較低能量)冗餘(例如差值)進行操作。根據在指定實例中採用哪種預測形式,生成當前宏塊和至少基於相同幀(或圖片)的一部分或至少基於至少一個其他幀(或圖片)的一部分的宏塊預測值之間的冗餘或差值。 However, feedback can be performed along the compression path by selectively using intra or inter prediction video coding. Based on the feedback or prediction mode of operation, the compression path operates on (relatively lower energy) redundancy (e.g., difference) resulting from subtracting the current macroblock prediction value from the current macroblock. Generating a current macroblock and redundancy between at least a portion of the same frame (or picture) or at least a portion of the macroblock prediction based on at least one other frame (or picture) based on which prediction form is employed in the specified instance Difference.
由此產生的修改的視頻訊號然後沿壓縮通路經歷變換操作。在一個實施方式中,離散餘弦變換(DCT)對一組視頻樣本(例如,亮度、色度、冗餘等)進行操作以計算預定數量的基模式中的每一個的係數值。例如,一個實施方式包括64個基函數(例如,對8×8樣本而言)。一般來說,不同實施方式可以採用不同數量的基函數(例如,不同變換)。這些各個基函數(包括合適的且選擇性加權的基函數)的任意組合可以用於表示給定的一組視頻樣本。與進行變換操作的各個方式相關的額外詳情在與包括通過引用併入的如上所述的那些標準/草案標準的視頻編碼相關聯的技術文獻中進行描述。變換處理的輸出包括各自的係數值。將該輸出提供給量化器。 The resulting modified video signal then undergoes a transform operation along the compression path. In one embodiment, a discrete cosine transform (DCT) operates on a set of video samples (eg, luminance, chrominance, redundancy, etc.) to calculate coefficient values for each of a predetermined number of base modes. For example, one embodiment includes 64 basis functions (eg, for an 8x8 sample). In general, different embodiments may employ different numbers of basis functions (eg, different transforms). Any combination of these various basis functions, including suitable and selectively weighted basis functions, can be used to represent a given set of video samples. Additional details relating to the various ways in which the transform operation is performed are described in the technical literature associated with video coding including those standard/draft standards as described above incorporated by reference. The output of the transform process includes the respective coefficient values. This output is provided to the quantizer.
一般情況下,大多數圖像塊通常會產生係數(例如,根據離散餘弦變換(DCT)操作的實施方式中的DCT係數),使得大多數相關DCT係數的頻率較低。由於這個原因以及人眼對高頻視覺效果的靈敏度相對較差,量化器可以操作用於將大部分不大相關的係數轉換為零值。也就是說,可以根據量化處理來消除相對貢獻率低於某個預定值(例如某個閾值)的那些係數。量化器還可以操作用於將重要係數轉換為比變化處理產生的值可以更加有效地進行編碼的值。例如,量化處理可以通過將每個相應係數除以 整數值並丟棄任意餘數來進行操作。當對典型的編碼單元或宏塊進行操作時,這個處理通常會產生相對少量的非零係數,這些非零係數然後被傳輸至熵編碼器進行無損編碼並且供根據可以根據視頻編碼選擇幀內和/或幀間預測處理的反饋路徑來使用。 In general, most image blocks typically produce coefficients (e.g., DCT coefficients in an embodiment based on discrete cosine transform (DCT) operation) such that the frequency of most of the associated DCT coefficients is lower. For this reason and the sensitivity of the human eye to high frequency visual effects is relatively poor, the quantizer can be operated to convert most of the less relevant coefficients to zero values. That is to say, those coefficients whose relative contribution rate is lower than a certain predetermined value (for example, a certain threshold) can be eliminated according to the quantization process. The quantizer can also operate to convert significant coefficients to values that can be encoded more efficiently than values produced by the variation process. For example, the quantization process can be performed by dividing each corresponding coefficient by The integer value is discarded and any remainder is discarded for operation. When operating on a typical coding unit or macroblock, this process typically produces a relatively small number of non-zero coefficients, which are then transmitted to an entropy encoder for lossless encoding and for selecting intra-frame based on video coding. / or feedback path for inter prediction processing to use.
熵編碼器根據無損壓縮編碼處理進行操作。相比之下,量化操作通常是有損失的。熵編碼處理對量化處理提供的係數進行操作。那些係數可以表示各個特徵(例如,亮度、色度、冗餘等)。熵編碼器可以採用各種類型的編碼。例如,熵編碼器可以進行上下文自適應二進制算術編碼(CABAC)和/或上下文自適應可變長編碼(CAVLC)。例如,根據熵編碼方案的至少一部分,將數據轉換為(運行、級別)配對(run-level paring)(例如,將數據14,3,0,4,0,0,-3轉換為各個(運行、級別)對(0,14),(0,3),(1,4),(2,-3))。事先,編制將可變長度編碼分配至值對的表,以便將相對較短長度編碼分配給相對常見的值對,並將相對較長長度編碼分配給相對少見的值對。 The entropy encoder operates in accordance with a lossless compression encoding process. In contrast, quantitative operations are usually lossy. The entropy encoding process operates on the coefficients provided by the quantization process. Those coefficients can represent individual features (eg, brightness, chrominance, redundancy, etc.). Entropy encoders can employ various types of encoding. For example, the entropy coder may perform context adaptive binary arithmetic coding (CABAC) and/or context adaptive variable length coding (CAVLC). For example, converting data to (run-level paring) according to at least a portion of the entropy encoding scheme (eg, converting data 14, 3, 0, 4, 0, 0, -3 to each (running) , level) pairs (0, 14), (0, 3), (1, 4), (2, -3)). In advance, a table is assigned to assign variable length codes to pairs of values to assign relatively short length codes to relatively common value pairs and relatively long length codes to relatively rare value pairs.
如讀者理解的一樣,逆量化和逆變換的操作分別對應於量化及變換的操作。例如,在DCT用於變換操作的實施方式中,逆DCT(IDCT)是逆變換操作中所採用的變換。 As understood by the reader, the operations of inverse quantization and inverse transform correspond to the operations of quantization and transformation, respectively. For example, in embodiments where DCT is used for transform operations, inverse DCT (IDCT) is the transform employed in the inverse transform operation.
圖片緩衝器(或者稱為數位圖片緩衝器或DPB)從IDCT模組接收訊號;圖片緩衝器操作用於儲存當前幀(或圖片)和/或一個或多個其他幀(或圖片),諸如作為根據視頻編碼所進行的幀內預測和/或幀間預測操作使用的幀(或圖片)。要注意的是,根據幀內預測,相對少量的儲存就足夠了,因為也許沒有必要儲存幀(或圖片)序列中的當前幀(或圖片)或任何其他幀(或圖片)。在根據視頻編碼進行幀間預測的時候,所儲存的訊息可以用來進行運動補償和/或運動估計。 A picture buffer (or digital picture buffer or DPB) receives signals from the IDCT module; the picture buffer operates to store the current frame (or picture) and/or one or more other frames (or pictures), such as A frame (or picture) used according to intra prediction and/or inter prediction operations by video coding. It is to be noted that, according to intra prediction, a relatively small amount of storage is sufficient, as it may not be necessary to store the current frame (or picture) or any other frame (or picture) in the sequence of frames (or pictures). When inter-predicting based on video coding, the stored information can be used for motion compensation and/or motion estimation.
在一個可能的實施方式中,對於運動估計,將來自當前幀(或圖片)的相應的亮度樣本組(例如,16×16)與幀(或圖片)序列 (例如根據幀間預測)中其他幀(或圖片)中的各個緩衝配對物進行比較。在一個可能的實現方式中,定位最匹配區域(例如預測參考)並產生向量偏移(例如運動向量)。在單個幀(或圖片)中,可以找到多個運動向量,但不是所有的運動向量都必須指向相同的方向。根據運動估計進行的一種或多種操作操作地用於生成一個或多個運動向量。 In one possible implementation, for motion estimation, a corresponding luma sample set (eg, 16×16) and frame (or picture) sequence from the current frame (or picture) The comparison is made for each buffer counterpart in other frames (or pictures) (eg, according to inter prediction). In one possible implementation, the best matching region (eg, prediction reference) is located and a vector offset (eg, motion vector) is generated. Multiple motion vectors can be found in a single frame (or picture), but not all motion vectors must point in the same direction. One or more operations performed in accordance with motion estimation are operatively used to generate one or more motion vectors.
運動補償操作地採用可以根據運動估計生成的一個或多個運動向量。識別並交付樣本的預測參考組以便從原始輸入視頻訊號中扣除,試圖希望產生相對(例如,理想的、多個)較低的能量冗餘。如果此類操作未導致產生較低的能量冗餘,就不必進行運動補償,變換操作可以僅對原始輸入視頻訊號進行操作,而不對冗餘進行操作(例如,根據將輸入視頻訊號直接提供給變換操作,以便不進行幀內預測,也不進行幀間預測的操作模式),或可以使用幀內預測並對幀內預測產生的冗餘進行變換操作。同樣地,如果運動估計和/或運動補償操作成功,運動向量還可以與相應的冗餘係數一起發送至熵編碼器,用於進行無損熵編碼。 Motion compensation operatively employs one or more motion vectors that can be generated from motion estimation. The predictive reference set of samples is identified and delivered for deduction from the original input video signal in an attempt to produce a relative (eg, ideal, multiple) lower energy redundancy. If such an operation does not result in lower energy redundancy, motion compensation is not necessary, and the transform operation can operate only on the original input video signal without performing redundancy (eg, based on providing the input video signal directly to the transform) The operation is such that no intra prediction is performed, and the operation mode of inter prediction is not performed, or intra prediction is performed and the redundancy generated by the intra prediction is subjected to a transform operation. Likewise, if the motion estimation and/or motion compensation operation is successful, the motion vector can also be sent to the entropy encoder along with the corresponding redundancy coefficients for lossless entropy coding.
來自整個視頻編碼操作的輸出是輸出位元流。要注意的是,輸出位元流當然可以根據生成連續時間訊號(該連續時間訊號可以經由通訊通道傳輸)來進行一定處理。例如,某些實施方式在無線通訊系統中操作。在這種情況下,輸出位元流可以在用於生成能夠經由通訊通道傳輸的連續時間訊號等的無線通訊設備中進行適當的數位類比轉換、頻率變換、縮放、過濾、調製、符號映射和/或任何其他操作。 The output from the entire video encoding operation is the output bit stream. It should be noted that the output bit stream can of course be processed according to the generation of a continuous time signal (which can be transmitted via the communication channel). For example, some embodiments operate in a wireless communication system. In this case, the output bit stream can perform appropriate digital analog conversion, frequency transform, scaling, filtering, modulation, symbol mapping, and/or in a wireless communication device for generating continuous time signals and the like that can be transmitted via the communication channel. Or any other operation.
參照圖5的實施方式500,針對該圖可以看出,輸入視頻訊號由視頻編碼器接收。在某些實施方式中,輸入視頻訊號由編碼單元或宏塊構成(和/或可以劃分為編碼單元(CU))。編碼單元或宏塊的大小可以改變並可以包括通常設置成正方形的多個畫素。在一個實施方式中,編碼單元或宏塊的大小為16×16畫素。然而, 通常要注意的是,宏塊可以具有任何所需的大小,諸如N×N畫素,其中N為整數。當然,雖然優選實施方式中採用了方形編碼單元或宏塊,但某些實現方式可以包括非方形編碼單元或宏塊。 Referring to the embodiment 500 of FIG. 5, it can be seen for the figure that the input video signal is received by the video encoder. In some embodiments, the input video signal is comprised of coding units or macroblocks (and/or can be divided into coding units (CUs)). The size of the coding unit or macroblock may vary and may include multiple pixels that are typically arranged in a square. In one embodiment, the size of the coding unit or macroblock is 16 x 16 pixels. however, It is generally noted that a macroblock can have any desired size, such as an NxN pixel, where N is an integer. Of course, although square coding units or macroblocks are employed in the preferred embodiment, some implementations may include non-square coding units or macroblocks.
輸入視頻訊號通常可以被稱為對應於原始幀(或圖片)圖像數據。例如,原始幀(或圖片)圖像數據可以進行處理以生成亮度和色度樣本。在某些實施方式中,宏塊中的亮度樣本組是一個特定佈置(例如16×16),色度樣本組是不同的特定佈置(例如8×8)。根據本文所描述的實施方式,視頻編碼器以逐塊為單位對這些樣本進行處理。 The input video signal can generally be referred to as corresponding to the original frame (or picture) image data. For example, raw frame (or picture) image data can be processed to generate luminance and chrominance samples. In some embodiments, the set of luma samples in a macroblock is a particular arrangement (e.g., 16 x 16), and the sets of chroma samples are different specific arrangements (e.g., 8 x 8). In accordance with embodiments described herein, a video encoder processes the samples on a block-by-block basis.
輸入視頻訊號然後進行模式選擇,輸入視頻訊號按照該模式可選擇性地進行幀內和/或幀間預測處理。一般來說,輸入視頻訊號沿壓縮通路進行壓縮。當無反饋地進行操作時(例如,不根據幀間預測,也不根據幀內預測),輸入視頻訊號經由壓縮通路提供來進行變換操作(例如,根據離散餘弦變換(DCT))。當然,在可替換實施方式中可以採用其他變換。在這種操作模式下,輸入視頻訊號本身就是所壓縮的訊號。壓縮通路可以利用人眼缺乏高頻率靈敏度來進行壓縮。 The video signal is input and then the mode is selected, and the input video signal is selectively subjected to intra and/or inter prediction processing according to the mode. Generally, the input video signal is compressed along the compression path. When operating without feedback (eg, not based on inter prediction, nor based on intra prediction), the input video signal is provided via a compression path for transform operations (eg, according to discrete cosine transform (DCT)). Of course, other transformations may be employed in alternative embodiments. In this mode of operation, the input video signal itself is the compressed signal. The compression path can be compressed using the human eye lacking high frequency sensitivity.
然而,通過選擇性地使用幀內或幀間預測視頻編碼,可以沿壓縮通路進行反饋。根據反饋或預測操作模式,壓縮通路對從當前宏塊減去當前宏塊預測值而導致的(相對較低能量)冗餘(例如差值)進行操作。根據在指定實例中採用哪種預測形式,生成當前宏塊和至少基於相同幀(或圖片)的一部分或至少基於至少一個其他幀(或圖片)的一部分的宏塊預測值之間的冗餘或差值。 However, feedback can be performed along the compression path by selectively using intra or inter prediction video coding. Based on the feedback or prediction mode of operation, the compression path operates on (relatively lower energy) redundancy (e.g., difference) resulting from subtracting the current macroblock prediction value from the current macroblock. Generating a current macroblock and redundancy between at least a portion of the same frame (or picture) or at least a portion of the macroblock prediction based on at least one other frame (or picture) based on which prediction form is employed in the specified instance Difference.
由此產生的修改視頻訊號然後沿壓縮通路進行變換操作。在一個實施方式中,離散餘弦變換(DCT)對一組視頻樣本(例如,亮度、色度、冗餘等)進行操作以計算預定數量的基模式中的每一個的係數值。例如,一個實施方式包括64個基函數(例如,對8×8樣本而言)。一般來說,不同實施方式可以採用不同數量的基 函數(例如,不同變換)。包括合適的選擇性加權的這些基函數的任意組合可以用於表示給定的一組視頻樣本。與進行變換操作的各個方式相關的額外詳情在與包括通過引用併入的如上所述的那些標準/草案標準的視頻編碼相關聯的技術文獻中進行描述。變換處理的輸出包括各自的係數值。將該輸出提供給量化器。 The resulting modified video signal is then transformed along the compression path. In one embodiment, a discrete cosine transform (DCT) operates on a set of video samples (eg, luminance, chrominance, redundancy, etc.) to calculate coefficient values for each of a predetermined number of base modes. For example, one embodiment includes 64 basis functions (eg, for an 8x8 sample). In general, different implementations can use different numbers of bases. Functions (for example, different transforms). Any combination of these basis functions including suitable selective weighting can be used to represent a given set of video samples. Additional details relating to the various ways in which the transform operation is performed are described in the technical literature associated with video coding including those standard/draft standards as described above incorporated by reference. The output of the transform process includes the respective coefficient values. This output is provided to the quantizer.
一般情況下,大多數圖像塊通常會產生係數(例如,根據離散餘弦變換(DCT)操作的實施方式中的DCT係數),使得大多數相關DCT係數的頻率較低。由於這個原因以及人眼對高頻視覺效果的靈敏度相對較差,量化器可以操作用於將大部分不大相關的係數轉換為零值。也就是說,可以根據量化過程來消除相對貢獻低於某個預定值(例如某個閾值)的那些係數。量化器還可以操作用於將重要係數轉換為比變化過程產生的值可以更加有效地進行編碼的值。例如,量化過程可以通過各自的係數除以整數值並丟棄任意餘數來進行操作。當對典型的編碼單元或宏塊進行操作時,這個過程通常會產生相對少量的非零係數,這些非零係數然後被傳輸至熵編碼器進行無損編碼並根據視頻編碼選擇幀內和/或幀間預測處理的反饋路徑進行使用。 In general, most image blocks typically produce coefficients (e.g., DCT coefficients in an embodiment based on discrete cosine transform (DCT) operation) such that the frequency of most of the associated DCT coefficients is lower. For this reason and the sensitivity of the human eye to high frequency visual effects is relatively poor, the quantizer can be operated to convert most of the less relevant coefficients to zero values. That is, those coefficients whose relative contributions are below a certain predetermined value (eg, a certain threshold) can be eliminated according to the quantization process. The quantizer can also operate to convert significant coefficients to values that can be encoded more efficiently than values produced by the variation process. For example, the quantization process can operate by dividing the respective coefficients by an integer value and discarding any remainder. When operating on a typical coding unit or macroblock, this process typically produces a relatively small number of non-zero coefficients that are then transmitted to an entropy encoder for lossless encoding and selection of intra and/or frames based on video coding. The feedback path of the inter prediction process is used.
熵編碼器根據無損壓縮編碼過程進行操作。相比之下,量化操作通常是有損的。熵編碼過程對量化過程提供的係數進行操作。那些係數可以表示各個特徵(例如,亮度、色度、冗餘等)。熵編碼器可以採用各種類型的編碼。例如,熵編碼器可以進行上下文自適應二進制算術編碼(CABAC)和/或上下文自適應可變長編碼(CAVLC)。例如,根據熵編碼方案的至少一部分,將數據轉換為(運行、級別)配對(run-level paring)(例如,將數據14,3,0,4,0,0,-3轉換為各個(運行、級別)對(0,14),(0,3),(1,4),(2,-3))。事先,編制將可變長度碼分配至值對的表,以便將相對較短長度編碼分配給相對常見的值對,並將相對較長長度編碼分配給相對少見的值對。 The entropy encoder operates in accordance with a lossless compression encoding process. In contrast, quantification operations are usually lossy. The entropy coding process operates on the coefficients provided by the quantization process. Those coefficients can represent individual features (eg, brightness, chrominance, redundancy, etc.). Entropy encoders can employ various types of encoding. For example, the entropy coder may perform context adaptive binary arithmetic coding (CABAC) and/or context adaptive variable length coding (CAVLC). For example, converting data to (run-level paring) according to at least a portion of the entropy encoding scheme (eg, converting data 14, 3, 0, 4, 0, 0, -3 to each (running) , level) pairs (0, 14), (0, 3), (1, 4), (2, -3)). In advance, a table is created that assigns variable length codes to value pairs to assign relatively short length codes to relatively common value pairs and relatively long length codes to relatively rare value pairs.
如讀者理解的一樣,逆量化和逆變換的操作分別對應於量化和變換的操作。例如,在DCT用於變換操作的實施方式中,逆DCT(IDCT)是逆變換操作中所採用的變換。 As understood by the reader, the operations of inverse quantization and inverse transform correspond to the operations of quantization and transform, respectively. For example, in embodiments where DCT is used for transform operations, inverse DCT (IDCT) is the transform employed in the inverse transform operation.
自適應回路濾波器(ALF)被實現來處理來自逆變換塊的輸出。在解碼圖片儲存在圖片緩衝器(有時稱為DPB、數位圖片緩衝器)之前,自適應回路濾波器(ALF)應用於解碼圖片。自適應回路濾波器(ALF)被實現來降低解碼圖像的編碼噪音,可以逐地選擇性地分別對亮度和色度進行過濾,不管自適應回路濾波器(ALF)是否以片級別或以塊級別應用。在自適應回路濾波器(ALF)的應用中可以使用二維(2D)有限脈衝響應(FIR)過濾。濾波器的係數可以在編碼器中逐片地進行設計,然後將此訊息傳遞給解碼器(例如,從包括視頻編碼器(或者稱為編碼器)的發射器通訊設備傳遞給包括視頻解碼器(或者稱為解碼器)的接收器通訊設備)。 An adaptive loop filter (ALF) is implemented to process the output from the inverse transform block. An adaptive loop filter (ALF) is applied to the decoded picture before the decoded picture is stored in a picture buffer (sometimes referred to as a DPB, digital picture buffer). An adaptive loop filter (ALF) is implemented to reduce the encoding noise of the decoded image, and the luminance and chrominance can be selectively filtered separately, regardless of whether the adaptive loop filter (ALF) is at the slice level or in blocks. Level application. Two-dimensional (2D) finite impulse response (FIR) filtering can be used in adaptive loop filter (ALF) applications. The coefficients of the filter can be designed piece by piece in the encoder and then passed to the decoder (eg, from a transmitter communication device including a video encoder (or encoder) to include a video decoder ( Or a receiver communication device called a decoder).
一個實施方式通過根據維納濾波設計生成係數來進行操作。另外,不管是否進行濾波處理並是否根據四叉樹結構將該決定傳遞給解碼器(例如,從包括視頻編碼器(或者稱為編碼器)的發射器通訊設備傳遞給包括視頻解碼器(或者稱為解碼器)的接收器通訊設備),都可以在編碼器中一塊一塊地應用,其中塊大小根據率失真優化來決定。要注意的是,利用2D濾波的實現方式可以根據編碼和解碼引入複雜度。例如,通過根據自適應回路濾波器(ALF)及實現方式來使用2D濾波,在編碼器(其在發射器通訊設備中實現)和解碼器(其在接收器通訊設備中實現)中可能存在一定的增加的複雜性。 One embodiment operates by generating coefficients according to a Wiener filter design. In addition, whether or not filtering processing is performed and whether the decision is passed to the decoder according to a quadtree structure (eg, from a transmitter communication device including a video encoder (or encoder) to a video decoder (or The receiver communication device for the decoder) can be applied one by one in the encoder, where the block size is determined by rate-distortion optimization. It should be noted that the implementation of 2D filtering can introduce complexity based on encoding and decoding. For example, by using 2D filtering in accordance with an adaptive loop filter (ALF) and implementation, there may be certain in the encoder (which is implemented in the transmitter communication device) and the decoder (which is implemented in the receiver communication device). Increased complexity.
在某些可選實施方式中,將來自去塊濾波器的輸出提供給被實現為處理來自逆變換塊的輸出的一個或多個其他環內濾波器(例如根據自適應回路濾波器(ALF)、採樣點自適應偏移(SAO)濾波器和/或任何其他濾波器類型實現)。例如,在解碼圖片儲存在 圖片緩衝器(有時稱為DPB、數位圖片緩衝器)之前,ALF應用於解碼圖片。ALF被實現來降低解碼圖像的編碼噪音,可以逐片地選擇地分別對亮度和色度進行過濾,不管ALF是否以片級別或以塊級別應用。在ALF的應用中可以使用二維(2D)有限脈衝響應(FIR)過濾。濾波器的係數可以在編碼器中逐片地進行設計,然後將此訊息傳遞給解碼器(例如,從包括視頻編碼器(或者稱為編碼器)的發射器通訊設備傳遞給包括視頻解碼器(或者稱為解碼器)的接收器通訊設備)。 In some alternative embodiments, the output from the deblocking filter is provided to one or more other in-loop filters implemented to process the output from the inverse transform block (eg, according to an adaptive loop filter (ALF) , Sample Point Adaptive Offset (SAO) filter and/or any other filter type implementation). For example, the decoded picture is stored in ALF is applied to the decoded picture before the picture buffer (sometimes called DPB, digital picture buffer). The ALF is implemented to reduce the encoding noise of the decoded image, and the luminance and chrominance can be separately filtered on a slice-by-slice basis, regardless of whether the ALF is applied at the slice level or at the block level. Two-dimensional (2D) finite impulse response (FIR) filtering can be used in ALF applications. The coefficients of the filter can be designed piece by piece in the encoder and then passed to the decoder (eg, from a transmitter communication device including a video encoder (or encoder) to include a video decoder ( Or a receiver communication device called a decoder).
一個實施方式根據維納濾波設計操作用於生成係數。另外,不管是否進行濾波處理並是否根據四叉樹結構將該決定傳遞給解碼器(例如,從包括視頻編碼器(或者稱為編碼器)的發射器通訊設備傳遞給包括視頻解碼器(或者稱為解碼器)的接收器通訊設備),都可以在編碼器中逐塊地應用,其中塊大小根據率失真優化來決定。要注意的是,利用2D濾波的實現方式可以根據編碼和解碼引入複雜度。例如,通過根據ALF及實現方式來使用2D濾波,在編碼器(其在發射器通訊設備中實現)和解碼器(其在接收器通訊設備中實現)中可能存在一定的增加的複雜性。 One embodiment operates to generate coefficients based on Wiener filtering design operations. In addition, whether or not filtering processing is performed and whether the decision is passed to the decoder according to a quadtree structure (eg, from a transmitter communication device including a video encoder (or encoder) to a video decoder (or The receiver communication device for the decoder) can be applied block by block in the encoder, where the block size is determined according to rate-distortion optimization. It should be noted that the implementation of 2D filtering can introduce complexity based on encoding and decoding. For example, by using 2D filtering in accordance with ALF and implementation, there may be some added complexity in the encoder (which is implemented in the transmitter communication device) and the decoder (which is implemented in the receiver communication device).
如針對其他實施方式所述,使用ALF可以提供根據此視頻處理的一系列改進之一,包括通過進行隨機量化去噪導致的峰值訊號噪音比(PSNR)改進客觀質量測量。另外,隨後編碼的視頻訊號的主觀質量可以通過照明補償來實現,根據ALF處理,照明補償可以根據進行偏移處理和縮放處理(例如,根據應用增益)來引入。 As described for other embodiments, the use of ALF can provide one of a series of improvements in accordance with this video processing, including improving the objective quality measurement by performing a peak signal to noise ratio (PSNR) resulting from random quantization denoising. Additionally, the subjective quality of the subsequently encoded video signal can be achieved by illumination compensation, which can be introduced according to ALF processing, based on performing offset processing and scaling processing (eg, based on application gain).
對一種類型的環內濾波器而言,使用自適應回路濾波器(ALF)可以提供根據此視頻處理的一系列改進之一,包括通過進行隨機量化去噪導致的峰值訊號噪音比(PSNR)改進客觀質量測量。另外,隨後編碼的視頻訊號的主觀質量可以通過照明補償來實現,根據自適應回路濾波器(ALF)處理,照明補償可以根據進行偏移 處理和縮放處理(例如,根據應用增益)來引入。 For one type of in-loop filter, the use of an adaptive loop filter (ALF) can provide one of a series of improvements based on this video processing, including improved peak-to-noise-to-noise ratio (PSNR) by random quantization denoising. Objective quality measurement. In addition, the subjective quality of the subsequently encoded video signal can be achieved by illumination compensation, which can be offset according to adaptive loop filter (ALF) processing. Processing and scaling processing (eg, based on application gain) is introduced.
圖片緩衝器(或者稱為數位圖片緩衝器或DPB)接收從ALF輸出的訊號;圖片緩衝器操作用於儲存當前幀(或圖片)和/或一個或多個其他幀(或圖片),諸如按照視頻編碼進行的根據幀內預測和/或幀間預測操作使用的幀(或圖片)。要注意的是,根據幀內預測,相對少量的儲存就足夠了,因為也許沒有必要儲存幀(或圖片)序列中的任何其他幀(或圖片)或當前幀(或圖片)。在根據視頻編碼進行幀間預測的時候,所儲存的訊息可以用來進行運動補償和/或運動估計。 A picture buffer (or digital picture buffer or DPB) receives the signal output from the ALF; the picture buffer operates to store the current frame (or picture) and/or one or more other frames (or pictures), such as A frame (or picture) used by video coding according to intra prediction and/or inter prediction operations. It is to be noted that, according to intra prediction, a relatively small amount of storage is sufficient, as it may not be necessary to store any other frames (or pictures) or current frames (or pictures) in the sequence of frames (or pictures). When inter-predicting based on video coding, the stored information can be used for motion compensation and/or motion estimation.
在一個可能的實施方式中,對於運動估計,將來自當前幀(或圖片)的相應的亮度樣本組(例如,16×16)與幀(或圖片)序列(例如根據幀間預測)中其他幀(或圖片)中的各個緩衝配對物進行比較。在一個可能的實現方式中,定位最匹配區域(例如預測參考)並產生向量偏移(例如運動向量)。在單個幀(或圖片)中,可以找到多個運動向量,但不是所有的運動向量都必須指向相同的方向。根據運動估計進行的一種或多種操作操作地生成一個或多個運動向量。 In one possible implementation, for motion estimation, a corresponding luma sample group (eg, 16×16) from a current frame (or picture) and other frames in a frame (or picture) sequence (eg, according to inter prediction) Each buffer counterpart in (or picture) is compared. In one possible implementation, the best matching region (eg, prediction reference) is located and a vector offset (eg, motion vector) is generated. Multiple motion vectors can be found in a single frame (or picture), but not all motion vectors must point in the same direction. One or more motion vectors are operatively generated based on one or more operations performed by the motion estimation.
運動補償操作地採用可以根據運動估計生成的一個或多個運動向量。識別並交付樣本的預測參考組以便從原始輸入視頻訊號中扣除,試圖希望產生相對(例如理想的多個)較低的能量冗餘。如果此類操作未導致產生較低的能量冗餘,就不必進行運動補償,變換操作可以僅對原始輸入視頻訊號進行操作,而不對冗餘進行操作(例如,根據將輸入視頻訊號直接提供給變換操作,以便不進行幀內預測,也不進行幀間預測的操作模式),或可以使用幀內預測並對幀內預測產生的冗餘進行變換操作。同樣地,如果運動估計和/或運動補償操作成功,運動向量還可以與相應的冗餘係數一起發送至熵編碼器,用於進行無損熵編碼。 Motion compensation operatively employs one or more motion vectors that can be generated from motion estimation. The predictive reference set of samples is identified and delivered for deduction from the original input video signal in an attempt to produce relatively (eg, ideally multiple) lower energy redundancy. If such an operation does not result in lower energy redundancy, motion compensation is not necessary, and the transform operation can operate only on the original input video signal without performing redundancy (eg, based on providing the input video signal directly to the transform) The operation is such that no intra prediction is performed, and the operation mode of inter prediction is not performed, or intra prediction is performed and the redundancy generated by the intra prediction is subjected to a transform operation. Likewise, if the motion estimation and/or motion compensation operation is successful, the motion vector can also be sent to the entropy encoder along with the corresponding redundancy coefficients for lossless entropy coding.
來自整個視頻編碼操作的輸出是輸出位元流。要注意的是, 輸出位元流當然可以根據生成連續時間訊號(其可以經由通訊通道傳輸)來進行一定處理。例如,某些實施方式在無線通訊系統中操作。在這種情況下,輸出位元流可以在無線通訊設備(其用於生成能夠經由通訊通道傳輸的連續時間訊號等)中進行適當的數位類比轉換、頻率變換、縮放、過濾、調製、符號映射和/或任何其他操作。 The output from the entire video encoding operation is the output bit stream. To be careful of, The output bit stream can of course be processed according to the generation of a continuous time signal (which can be transmitted via the communication channel). For example, some embodiments operate in a wireless communication system. In this case, the output bit stream can be subjected to appropriate digital analog conversion, frequency transform, scaling, filtering, modulation, symbol mapping in a wireless communication device (which is used to generate continuous time signals that can be transmitted via a communication channel, etc.) And/or any other operation.
參照圖6的實施方式600,針對該圖描述了一種視頻編碼器的可替換實施方式,該視頻編碼器進行預測、變化以及編碼處理以便產生壓縮的輸出位元流。這種視頻編碼器可以根據一個或多個視頻編碼協議、標準和/或推薦作法(諸如ISO/IEC 14496-10-MPEG-4的第10部分,AVC(高級視頻編碼)(或者稱為H.264/MPEG-4的第10部分或AVC(高級視頻編碼),ITU H.264/MPEG4-AVC)進行操作並與之兼容。 Referring to the embodiment 600 of FIG. 6, an alternative embodiment of a video encoder is described for the video encoder that performs prediction, variation, and encoding processes to produce a compressed output bitstream. Such a video encoder may be based on one or more video coding protocols, standards, and/or recommended practices (such as Part 10 of ISO/IEC 14496-10-MPEG-4, AVC (Advanced Video Coding) (or H. Part 10 of 264/MPEG-4 or AVC (Advanced Video Coding), ITU H.264/MPEG4-AVC) operates and is compatible.
要注意的是,對應視頻解碼器(諸如位於通訊通道另一端的設備內的視頻解碼器)操作用於進行解碼、逆變換及重建的互補處理以便產生相應的解碼視頻序列,該序列(理想地)表示輸入視頻訊號。 It is noted that a corresponding video decoder, such as a video decoder located within the device at the other end of the communication channel, operates complementary processing for decoding, inverse transform, and reconstruction to produce a corresponding decoded video sequence, which is ideally ) indicates the input video signal.
在將該圖與先前的圖進行比較時,同樣將提供給幀內預測塊的來自逆量化和逆變換(例如IDCT)塊的訊號路徑輸出提供給去塊濾波器。將來自去塊濾波器的輸出提供給被實現為處理來自逆變換塊的輸出的一個或多個其他環內濾波器(例如根據自適應回路濾波器(ALF)、採樣點自適應偏移(SAO)濾波器和/或任何其他濾波器類型實現)。例如,在一個可能的實施方式中,在解碼圖片儲存在圖片緩衝器(有時稱為DPB、數位圖片緩衝器)之前,SAO濾波器應用於解碼圖片。 When the map is compared to the previous graph, the signal path output from the inverse quantization and inverse transform (e.g., IDCT) blocks supplied to the intra prediction block is also supplied to the deblocking filter. Providing the output from the deblocking filter to one or more other in-loop filters implemented to process the output from the inverse transform block (eg, based on adaptive loop filter (ALF), sample point adaptive offset (SAO) ) Filters and/or any other filter type implementation). For example, in one possible implementation, the SAO filter is applied to the decoded picture before the decoded picture is stored in a picture buffer (sometimes referred to as a DPB, digital picture buffer).
就被實現來生成輸出位元流的任何視頻編碼器架構而言,要注意的是,這類架構可以在多種通訊設備中的任何一個中實現。輸出位元流可以經歷額外的處理(包括誤差校正碼(ECC)、前向 糾錯(FEC)等),從而生成其中具有額外的冗餘交易的修改輸出位元流。同樣,如關於此數位訊號可以理解,可以根據生成適用於或適於經由通訊通道進行傳輸的連續時間訊號進行任何適當的處理。也就是說,這樣的視頻編碼器架構可以在用於經由一個或多個通訊通道傳輸一個或多個訊號的通訊設備中實現。可以對由這種視頻編碼器架構生成的輸出位元流進行額外的處理,從而生成可以發射到通訊通道中的連續時間訊號。 With regard to any video encoder architecture that is implemented to generate an output bitstream, it is noted that such an architecture can be implemented in any of a variety of communication devices. The output bit stream can undergo additional processing (including error correction code (ECC), forward direction Error Correction (FEC), etc., to generate a modified output bit stream with additional redundant transactions therein. Again, as can be appreciated with respect to this digital signal, any suitable processing can be performed in accordance with the generation of continuous time signals suitable for or suitable for transmission via the communication channel. That is, such a video encoder architecture can be implemented in a communication device for transmitting one or more signals via one or more communication channels. Additional processing of the output bitstream generated by such a video encoder architecture can be performed to generate a continuous time signal that can be transmitted into the communication channel.
圖7是示出了幀內預測處理的實施方式700的示圖。針對該圖可以看出,視頻數據的當前塊(例如通常呈正方形並且通常包括N×N畫素)進行處理以便對其中的各個畫素進行估計。根據幀內預測採用位於當前塊的上方及左邊的事先編碼畫素。從某種角度來說,幀內預測方向可以被視為對應於從當前畫素延伸至位於當前畫素的上方或左邊的參考畫素的向量。根據H.264/AVC應用於編碼的幀內預測的詳情在上文通過引用併入的對應標準內進行了規定(例如,國際電信聯盟,ITU-T,TELECOMMUNICATION STANDARDIZATION SECTOR OF ITU,H.264(03/2010),SERIES H:AUDIOVISUAL AND MULTIMEDIA SYSTEMS,Infrastructure of audiovisual services-Coding of moving video,Advanced video coding for generic audiovisual services,Recommendation ITU-T H.264(或稱為International Telecomm ISO/IEC 14496-10-MPEG-4的第10部分,AVC(高級視頻編碼),H.264/MPEG-4的第10部分或AVC(高級視頻編碼),ITU H.264/MPEG4-AVC或等效文獻)。 FIG. 7 is a diagram showing an embodiment 700 of intra prediction processing. As can be seen for this figure, the current block of video data (e.g., typically square and typically including NxN pixels) is processed to estimate individual pixels therein. Pre-coded pixels located above and to the left of the current block are used according to intra prediction. In some ways, the intra prediction direction can be considered to correspond to a vector that extends from the current pixel to a reference pixel located above or to the left of the current pixel. The details of the intra prediction applied to the encoding according to H.264/AVC are specified in the corresponding standards incorporated by reference above (for example, International Telecommunication Union, ITU-T, TELECOMMUNICATION STANDARDIZATION SECTOR OF ITU, H.264 ( 03/2010), SERIES H: AUDIOVISUAL AND MULTIMEDIA SYSTEMS, Infrastructure of audiovisual services-Coding of moving video, Advanced video coding for generic audiovisual services, Recommendation ITU-T H.264 (or International Telecomm ISO/IEC 14496-10 - Part 10 of MPEG-4, AVC (Advanced Video Coding), Part 10 of H.264/MPEG-4 or AVC (Advanced Video Coding), ITU H.264/MPEG4-AVC or equivalent).
冗餘(其是當前畫素和參考或預測畫素之間的差值)為經過編碼的冗餘。針對該圖可以看出,幀內預測利用常見幀(或圖片)內的畫素進行操作。當然要注意的是,給定畫素可以具有與其相關聯的各自不同的分量,且每個分量可能存在各自不同的樣本組。 Redundancy, which is the difference between the current pixel and the reference or predicted pixel, is coded redundancy. As can be seen from this figure, intra prediction operates using pixels within a common frame (or picture). It is of course to be noted that a given pixel may have its own distinct component associated with it, and each component may have a different set of samples.
圖8是示出了幀間預測處理的實施方式800的示圖。與幀內預測不同,幀間預測用於基於當前幀(或圖片)內的當前的畫素 組以及位於幀(或圖片)序列內的一個或多個其他幀(或圖片)內的一組或多組參考或預測畫素來識別運動向量(例如,幀間預測方向)。可以看出,運動向量從幀(或圖片)序列內的當前幀(或圖片)延伸至另一個幀(或圖片)。幀間預測可以使用子畫素插值,使得預測畫素值對應於參考幀或圖片中的多個畫素的函數。 FIG. 8 is a diagram showing an embodiment 800 of inter prediction processing. Unlike intra prediction, inter prediction is used based on the current pixel within the current frame (or picture). A group or sets of reference or predicted pixels within one or more other frames (or pictures) within a frame (or picture) sequence to identify motion vectors (eg, inter prediction directions). It can be seen that the motion vector extends from the current frame (or picture) within the frame (or picture) sequence to another frame (or picture). Inter prediction may use subpixel interpolation such that the predicted pixel value corresponds to a function of a plurality of pixels in a reference frame or picture.
可以根據幀間預測處理來計算冗餘,儘管這樣的冗餘不同於根據幀內預測處理計算的冗餘。根據幀間預測處理,每個畫素的冗餘再次對應於當前畫素和預測畫素值之間的差。然而,根據幀間預測處理,當前畫素和參考或預測畫素不位於相同幀(或圖片)內。儘管該圖示出了關於一個或多個先前的幀或圖片所採用的幀間預測,但同樣要注意的是,可替換實施方式可以利用對應於當前幀之前和/或之後的幀進行操作。例如,根據適當的緩衝和/或內存管理,可以對多個幀進行儲存。當對給定的幀進行操作時,可以根據在給定幀之前和/或之後的其他幀來生成參考。 Redundancy may be calculated according to inter prediction processing, although such redundancy is different from redundancy calculated according to intra prediction processing. According to the inter prediction process, the redundancy of each pixel again corresponds to the difference between the current pixel and the predicted pixel value. However, according to the inter prediction process, the current pixel and the reference or predicted pixel are not located in the same frame (or picture). Although the figure shows inter prediction with respect to one or more previous frames or pictures, it is also noted that alternative embodiments may operate with frames that correspond to before and/or after the current frame. For example, multiple frames can be stored according to appropriate buffering and/or memory management. When operating on a given frame, references can be generated from other frames before and/or after a given frame.
結合CU,基本單元可以用於預測劃分模式(即,預測單元或PU)。同樣要注意的是,僅為最後深度CU限定PU,且相應大小局限於CU的大小。 In conjunction with the CU, the base unit can be used to predict the partitioning mode (ie, prediction unit or PU). It is also noted that the PU is only limited for the last depth CU and the corresponding size is limited to the size of the CU.
圖9和圖10是分別示出了視頻解碼架構的各個實施方式900和1000的示圖。 9 and 10 are diagrams showing various embodiments 900 and 1000 of a video decoding architecture, respectively.
一般來說,這種視頻解碼架構對輸入位元流進行操作。當然要注意的是,這種輸入位元流可以根據通訊設備從通訊通道接收的訊號來生成。可以對從通訊通道接收的連續時間訊號進行各種操作,包括諸如可以適當根據生成輸入位元流的數位採樣、解調、縮放、濾波等。而且,可以實現一種或多種誤差校正碼(ECC)、前向糾錯(FEC)等的某些實施方式可以根據ECC、FEC等進行適當的解碼,從而生成輸入位元流。也就是說,在已經根據生成對應輸出位元流(例如,可以從發射器通訊設備或收發器通訊設備的發射器部分進行發射的輸出位元流)進行額外的冗餘的某些 實施方式中,可以根據生成輸入位元流來進行適當的處理。總體來說,這樣的視頻解碼架構遺憾的是要處理輸入位元流,從而盡可能緊密地並完全在理想情況下生成對應於原始輸入視頻訊號的輸出視頻訊號,用於輸出至一個或多個視頻顯示可用設備。 In general, this video decoding architecture operates on input bitstreams. Of course, it should be noted that this input bit stream can be generated based on the signal received by the communication device from the communication channel. Various operations can be performed on the continuous time signals received from the communication channel, including, for example, digital sampling, demodulation, scaling, filtering, etc., which can be appropriately generated from the input bit stream. Moreover, certain embodiments in which one or more error correction codes (ECC), forward error correction (FEC), etc., may be implemented, may be appropriately decoded in accordance with ECC, FEC, etc., to generate an input bit stream. That is, some additional redundancy has been performed in accordance with the generation of a corresponding output bit stream (eg, an output bit stream that can be transmitted from a transmitter communication device or a transmitter portion of a transceiver communication device) In an embodiment, appropriate processing can be performed based on generating an input bit stream. In general, such a video decoding architecture unfortunately handles the input bit stream to generate an output video signal corresponding to the original input video signal as closely and completely as possible, for output to one or more The video shows the available devices.
參照圖9的實施方式900,一般來說,諸如熵解碼器(例如,其可以根據CABAC、CAVLC等實現)的解碼器根據進行編碼(如在視頻編碼器架構中進行的)的互補處理來處理輸入位元流。輸入位元流可以被視為(盡可能緊密地並完全在理想情況下)由視頻編碼器架構生成的壓縮輸出位元流。當然,在實際應用中,在經由一個或多個通訊鏈路傳輸的訊號中可能已經遭受了一部分誤差。熵解碼器對輸入位元流進行處理並提取適當的係數,諸如DCT係數(例如,表示色度、亮度等訊息),並將係數提供給逆量化及逆變換塊。如果採用DCT變換,那麼逆量化及逆變換塊就可以被實現來進行逆DCT(IDCT)操作。隨後,A/D塊濾波器被實現來生成對應於輸出視頻訊號的各個幀和/或圖片。可以將這些幀和/或圖片提供給圖片緩衝器或數位圖片緩衝器(DPB),以便用來進行包括運動補償的其他操作。一般來說,這樣的運動補償操作可以被視為對應於與視頻編碼相關聯的幀間預測。同樣,還可以對從逆量化及逆變換塊輸出的訊號進行幀間預測。類似於視頻編碼,這種視頻解碼器架構可以被實現來進行模式選擇,根據對輸入位元流進行解碼,不通過幀內預測或幀間預測進行,通過幀間預測進行或通過幀內預測進行,從而生成輸出視頻訊號。 Referring to implementation 900 of FIG. 9, in general, a decoder such as an entropy decoder (eg, which may be implemented in accordance with CABAC, CAVLC, etc.) processes according to complementary processing of encoding (as performed in a video encoder architecture) Enter the bit stream. The input bit stream can be viewed (as closely and completely ideally as possible) of the compressed output bit stream generated by the video encoder architecture. Of course, in practical applications, some errors may have been experienced in the signals transmitted via one or more communication links. The entropy decoder processes the input bitstream and extracts appropriate coefficients, such as DCT coefficients (e.g., information representing chrominance, luminance, etc.), and provides the coefficients to the inverse quantization and inverse transform blocks. If a DCT transform is employed, the inverse quantization and inverse transform blocks can be implemented to perform an inverse DCT (IDCT) operation. The A/D block filter is then implemented to generate individual frames and/or pictures corresponding to the output video signal. These frames and/or pictures may be provided to a picture buffer or digital picture buffer (DPB) for use in performing other operations including motion compensation. In general, such motion compensation operations can be considered to correspond to inter predictions associated with video coding. Similarly, it is also possible to inter-predict the signals output from the inverse quantization and inverse transform blocks. Similar to video coding, such a video decoder architecture can be implemented for mode selection, based on decoding of input bitstreams, not by intra prediction or inter prediction, by inter prediction or by intra prediction. , thereby generating an output video signal.
參照圖10的實施方式1000,在某些可選實施方式中,諸如可以根據用於生成輸出位元流的視頻編碼來實現的一個或多個環內濾波器(例如,根據自適應回路濾波器(ALF)、採樣點自適應偏移(SAO)濾波器和/或任何其他濾波器類型實現),對應的一個或多個環內濾波器可以在視頻解碼器架構中實現。在一個實施方式中,在去塊濾波器之後適當實現一個或多個環內濾波器。 Referring to embodiment 1000 of FIG. 10, in some alternative implementations, such as one or more in-loop filters that may be implemented in accordance with video coding used to generate an output bitstream (eg, according to an adaptive loop filter) (ALF), Sample Point Adaptive Offset (SAO) filter and/or any other filter type implementation), the corresponding one or more in-loop filters can be implemented in a video decoder architecture. In one embodiment, one or more in-loop filters are suitably implemented after the deblocking filter.
根據某些可能的實施方式,在完成解碼圖片的去塊濾波處理之後(例如,根據在圖6中的其他環內濾波器內實現的SAO濾波),可以執行採樣點自適應偏移(SAO)處理。基於被定義為一個或多個完整最大編碼單元(LCU)的區域執行該處理。 According to some possible implementations, after completing the deblocking filtering process of the decoded picture (eg, according to the SAO filtering implemented in other in-loop filters in FIG. 6), sample point adaptive offset (SAO) may be performed. deal with. This processing is performed based on an area defined as one or more complete maximum coding units (LCUs).
圖11示出了頻帶偏移採樣點自適應偏移(SAO)濾波處理的實施方式1100。該圖示出了頻帶偏移SAO的概念。在應用每個偏移之後,由此產生的畫素被裁剪為有效的8位畫素範圍[0,255]。在該圖中,將偏移應用到四個連續、有源的帶;不修改剩餘的帶。當然,在其他實施方式中,此等偏移可以應用於非連續帶。 FIG. 11 illustrates an embodiment 1100 of a band offset sample point adaptive offset (SAO) filtering process. This figure shows the concept of the band offset SAO. After each offset is applied, the resulting pixels are cropped to a valid 8-bit pixel range [0, 255]. In this figure, the offset is applied to four consecutive, active bands; the remaining bands are not modified. Of course, in other embodiments, such offsets can be applied to discontinuous bands.
圖12示出了視頻編碼架構的可替換實施方式1200。在此實施方式1200中,任何一個或多個其他環內濾波器(例如,根據自適應回路濾波器(ALF)、採樣點自適應偏移(SAO)濾波器和/或任何其他濾波器類型實現的環內濾波器)可以實現為處理逆量化及逆變換塊(例如,在去塊濾波器之前)的輸出。換句話說,在此等實施方式中,在去塊處理之前可以應用一個或多個其他環內濾波器(例如,一個實施方式中的SAO濾波器)。在可替換實施方式中,在去塊處理之前可以實現此等環內濾波器(例如,根據自適應回路濾波器(ALF)、採樣點自適應偏移(SAO)濾波器和/或任何其他濾波器類型實現的環內濾波器)。然而,在去塊處理之前,本發明的各方面、實施方式和/或其等同物操作地應用此等環內濾波器(例如,根據自適應回路濾波器(ALF)、採樣點自適應偏移(SAO)濾波器和/或任何其他濾波器類型實現的環內濾波器),如圖12所示。 FIG. 12 illustrates an alternative embodiment 1200 of a video coding architecture. In this embodiment 1200, any one or more other in-loop filters (eg, implemented according to an adaptive loop filter (ALF), a sample point adaptive offset (SAO) filter, and/or any other filter type) The in-loop filter can be implemented to process the output of the inverse quantization and inverse transform blocks (eg, before the deblocking filter). In other words, in such embodiments, one or more other in-loop filters (e.g., SAO filters in one embodiment) may be applied prior to the deblocking process. In alternative embodiments, such in-loop filters may be implemented prior to deblocking processing (eg, according to adaptive loop filter (ALF), sample point adaptive offset (SAO) filters, and/or any other filtering In-loop filter implemented by the type of the device). However, prior to deblocking, aspects, embodiments, and/or their equivalents of the present invention operationally apply such in-loop filters (eg, based on adaptive loop filter (ALF), sample point adaptive offset (SAO) filter and/or any other filter type implemented in-loop filter), as shown in Figure 12.
根據有些實施方式,當SAO可操作且開啟時(例如,針對來自去塊處理的輸出實現此SAO的實施方式),可能會出現一些不合意的塊狀假影。在這些實施方式中,主要是因為兩個相鄰的LCU正在使用不同的帶偏移值。為了減輕這個問題,在去塊處理之前可以應用此SAO,及去塊處理可以用於在此等情況下減少任何不 合意且出現的塊狀假影。在這種情況下,去塊處理過程中使用的邊界強度及變量β及tC也由SAO參數決定。 According to some embodiments, when the SAO is operational and on (eg, implementations of this SAO for output from the deblocking process), some undesirable blockiness artifacts may occur. In these embodiments, it is mainly because two adjacent LCUs are using different band offset values. To alleviate this problem, this SAO can be applied before the deblocking process, and the deblocking process can be used to reduce any undesired and appearing blockiness artifacts in such cases. In this case, the boundary strengths and variables β and t C used in the deblocking process are also determined by the SAO parameters.
從某些角度來看,帶偏移SAO基本上可視為是校正濾波器(例如,某些實施方式中的直方圖校正濾波器)。基於強度值來分類畫素以產生畫素的分佈。例如,針對直方圖實施方式,基於強度值將畫素(例如,一個或多個最大編碼單元(LCU)的畫素)歸類為直方圖倉或“帶”。將整個畫素範圍(0-255)分成32個均勻的帶,並將特定偏移加到每個帶中的所有畫素上。編碼器從範圍[-7,7]選擇要應用的偏移。 From some perspectives, the band offset SAO can be considered substantially as a correction filter (e.g., a histogram correction filter in some embodiments). The pixels are classified based on the intensity values to produce a distribution of pixels. For example, for a histogram implementation, pixels (eg, pixels of one or more largest coding units (LCUs)) are classified as histogram bins or "bands" based on intensity values. The entire pixel range (0-255) is divided into 32 uniform bands, and a specific offset is added to all the pixels in each band. The encoder selects the offset to apply from the range [-7, 7].
儘管偏移可應用於全部32個帶,為了簡化帶偏移處理並減少開銷,可以利用任何LCU中的帶偏移SAO對縮減集(例如,僅4個連續的帶)進行實際修改。編碼器選擇四個連續的帶,偏移針對這四個連續的帶進行傳輸。不修改剩餘的28個帶(零偏移)。由於可能存在32個帶,因此具有非零偏移的第一個帶以位元流表示。band_position參數攜帶有該訊息。剩餘的三個有源帶可以由的(band_position+i)%32確定。注意此處的模運算,如果第一帶是29、30或31,剩餘的帶將繞回0。 Although the offset can be applied to all 32 bands, in order to simplify the band offset processing and reduce overhead, the reduced set (eg, only 4 consecutive bands) can be actually modified with the band offset SAO in any LCU. The encoder selects four consecutive bands for which the offset is transmitted. The remaining 28 bands (zero offset) are not modified. Since there may be 32 bands, the first band with a non-zero offset is represented by a bit stream. The band_position parameter carries this message. The remaining three active bands can be determined by (band_position+i)%32. Note the modulo operation here. If the first band is 29, 30 or 31, the remaining bands will wrap around 0.
圖13示出了採樣點自適應偏移(SAO)帶偏移模式下的傳輸頻帶偏移的指示(自適應和/或顯式信令)的各個實施方式1300。根據適應性地指明SAO頻帶偏移模式下的傳輸頻帶偏移的數量可以完成這樣的操作。例如,SAO頻帶偏移模式下的傳輸頻帶偏移的數量可能與LCU的尺寸相關(例如,使得SAO頻帶偏移模式下的傳輸頻帶偏移的數量可以是LCU尺寸的函數)。例如,如果LCU尺寸減小,傳輸頻帶的數量也減小。再如,4個傳輸頻帶偏移可以用於64×64LCU,3個傳輸頻帶偏移可以用於32×32LCU,2個傳輸頻帶偏移可以用於16×16LCU。一般情況下,根據LCU各自不同的尺寸,可以指出SAO頻帶偏移模式下的傳輸頻帶偏移的各自不同的數量。 FIG. 13 illustrates various embodiments 1300 of an indication (adaptive and/or explicit signaling) of a transmission band offset in a sample point adaptive offset (SAO) band offset mode. Such an operation can be accomplished in accordance with adaptively indicating the number of transmission band offsets in the SAO band offset mode. For example, the number of transmission band offsets in the SAO band offset mode may be related to the size of the LCU (eg, such that the number of transmission band offsets in the SAO band offset mode may be a function of the LCU size). For example, if the LCU size is reduced, the number of transmission bands is also reduced. As another example, four transmission band offsets can be used for a 64x64 LCU, three transmission band offsets can be used for a 32x32 LCU, and two transmission band offsets can be used for a 16x16 LCU. In general, depending on the respective sizes of the LCUs, the respective different numbers of transmission band offsets in the SAO band offset mode can be indicated.
每個LCU尺寸的傳輸頻帶偏移的數量也可以按照SPS(序列參數集)、PPS(圖片參數大小)、APS(自適應參數集)、宏塊條頭、LCU數據和/或使用其他部分顯式地以訊號通知。 The number of transmission band offsets per LCU size may also be in accordance with SPS (Sequence Parameter Set), PPS (Picture Parameter Size), APS (Adaptive Parameter Set), Macroblock Header, LCU Data, and/or other parts of the display. The signal is notified by the signal.
圖14示出了SAO頻帶偏移模式下的頻帶粒度的指示(自適應和/或顯式信令)的各個實施方式1400。根據適應性地指明SAO頻帶偏移模式下的粒度可以完成這樣的操作。 Figure 14 illustrates various embodiments 1400 of indication of frequency band granularity (adaptive and/or explicit signaling) in SAO band offset mode. Such an operation can be accomplished in accordance with adaptively indicating the granularity in the SAO band offset mode.
在某些實施方式中,將整個畫素範圍(0-255)分成32個均勻的頻帶。可以利用任何LCU中的頻帶偏移SAO對僅4個頻帶進行實際修改。編碼器(例如,發射器通訊設備)選擇四個連續的頻帶,偏移針對這四個連續的頻帶進行傳輸。不修改剩餘的28個頻帶(例如,零偏移)。在每個頻帶中,將特定偏移加到所有畫素上。 In some embodiments, the entire pixel range (0-255) is divided into 32 uniform frequency bands. The actual modification can be made to only 4 frequency bands using the band offset SAO in any LCU. An encoder (eg, a transmitter communication device) selects four consecutive frequency bands for which the offset is transmitted. The remaining 28 bands (eg, zero offset) are not modified. In each frequency band, a specific offset is added to all pixels.
由於可以改變LCU的尺寸(例如,64×64、32×32或16×16),頻帶的粒度可以是自適應的。例如,LCU的尺寸越小,粒度就越粗糙。再如,如果LCU尺寸為32×32,則可以將[0,255]均勻分為16個頻帶,每個頻帶涵蓋16個連續的強度值。一般來說,可以根據LCU各自不同的尺寸指明SAO頻帶偏移模式下的各自不同的頻帶粒度。 Since the size of the LCU can be changed (for example, 64x64, 32x32, or 16x16), the granularity of the frequency band can be adaptive. For example, the smaller the size of the LCU, the coarser the granularity. As another example, if the LCU size is 32 x 32, then [0, 255] can be evenly divided into 16 bands, each band covering 16 consecutive intensity values. In general, different frequency band sizes in the SAO band offset mode can be indicated according to different sizes of the LCUs.
每個LCU尺寸的頻帶粒度也可以按照SPS(序列參數集)、PPS(圖片參數大小)、APS(自適應參數集)、宏塊條頭、LCU數據和/或使用其他部分清楚地以訊號通知。 The bandwidth of each LCU size can also be clearly signaled according to SPS (Sequence Parameter Set), PPS (Picture Parameter Size), APS (Adaptive Parameter Set), Macroblock Header, LCU Data and/or other parts. .
圖15示出了隱式頻帶指數信令的實施方式1500。例如,可以基於當前LCU的畫素值推斷出(例如,基於LCU的分析確定,推斷地確定,等等)此訊息,而沒有明確地以訊號通知頻帶指數。例如,通過生成LCU的畫素值直方圖,頻帶偏移可以用於畫素的數量占主導地位的頻帶。此等頻帶指數不一定是連續的(例如,頻帶指數可以使得具有不連續的分佈,以便至少兩個連續的頻帶指數可以間隔至少一個頻帶指數值,換句話說,頻帶指數彼此不 一定是連續的)。 FIG. 15 illustrates an implementation 1500 of implicit band index signaling. For example, this message can be inferred based on the pixel values of the current LCU (eg, LCU-based analysis determination, inferential determination, etc.) without explicitly signaling the band index. For example, by generating a histogram of the pixel values of the LCU, the band offset can be used for the frequency band in which the number of pixels is dominant. These band indices are not necessarily continuous (eg, the band index may be such that there is a discontinuous distribution such that at least two consecutive band indices may be separated by at least one band index value, in other words, the band indices are not mutually exclusive Must be continuous).
圖16示出了隱式頻帶指數信令的可替換實施方式1600。在示出了極簡化實施方式的極簡化圖中,存在只具有兩個灰階的LCU。直方圖(例如,如讀者所理解的描述畫素分佈的可能的一種方式)說明50%的畫素具有灰階25,50%的畫素具有灰階205。因此,兩個頻帶偏移就足夠,代替了原來的四個。 Figure 16 illustrates an alternative embodiment 1600 of implicit band index signaling. In an extremely simplified diagram showing a very simplified implementation, there is an LCU with only two gray levels. A histogram (eg, a possible way of describing the pixel distribution as understood by the reader) states that 50% of the pixels have a gray level of 25, and 50% of the pixels have a gray level 205. Therefore, two band offsets are sufficient instead of the original four.
圖17示出了頻帶偏移編碼的實施方式1700。在頻帶偏移模式下,由於sao_band_position表示具有非零偏移的頻帶偏移的開始,因此第一偏移值sao_offset[cldx][saoDepth][x0][y0][0]必須為非零(例如,在某些情況下,可能的最小的值可以是取1的值)。所以,而不直接編碼sao_offset[cldx][saoDepth][x0][y0][0],bs(sao_offset[cldx][saoDepth][x0][y0][0])1及sao_offset[cldx][saoDepth][x0][y0][0]的符號位可以單獨編碼,其中abs是計算絕對值的函數。 FIG. 17 illustrates an embodiment 1700 of band offset coding. In the band offset mode, since sao_band_position represents the start of the band offset with a non-zero offset, the first offset value sao_offset[cldx][saoDepth][x0][y0][0] must be non-zero (eg In some cases, the smallest possible value may be a value of 1). Therefore, instead of directly encoding sao_offset[cldx][saoDepth][x0][y0][0],bs(sao_offset[cldx][saoDepth][x0][y0][0])1 and sao_offset[cldx][saoDepth The sign bits of [x0][y0][0] can be encoded separately, where abs is a function for calculating absolute values.
圖18及圖19示出了一個或多個設備(例如,通訊設備、接收器和/或解碼器設備、發射器和/或編碼器設備等)的操作方法的各個實施方式。 18 and 19 illustrate various embodiments of methods of operation of one or more devices (eg, communication devices, receivers and/or decoder devices, transmitters, and/or encoder devices, etc.).
參照圖18的方法1800,方法1800首先通過通訊設備的輸入端從至少一個額外通訊設備接收視頻訊號及多個頻帶偏移,如方框1810所示。 Referring to method 1800 of FIG. 18, method 1800 first receives a video signal and a plurality of band offsets from at least one additional communication device via an input of the communication device, as indicated by block 1810.
方法1800繼續分析與至少一個最大編碼單元(LCU)(該最大編碼單元與視頻訊號相關聯)相關聯的多個畫素以識別用於識別多個頻帶指數的畫素值分佈,如方框1820所示。 Method 1800 continues by analyzing a plurality of pixels associated with at least one maximum coding unit (LCU) (which is associated with a video signal) to identify a pixel value distribution for identifying a plurality of frequency band indices, such as block 1820 Shown.
方法1800然後根據視頻訊號或基於此的訊號的濾波處理將多個頻帶偏移應用到多個頻帶指數,如方框1830所示。 Method 1800 then applies a plurality of band offsets to the plurality of band indices based on the video signal or filtering process based on the signals, as indicated by block 1830.
參照圖19的方法1900,該方法1900首先通過通訊設備的輸入端從至少一個額外通訊設備接收視頻訊號及多個頻帶偏移,如方框1910所示。 Referring to method 1900 of FIG. 19, method 1900 first receives a video signal and a plurality of band offsets from at least one additional communication device via an input of the communication device, as indicated by block 1910.
方法1900繼續分析與至少一個最大編碼單元(LCU)(該最大編碼單元與視頻訊號相關聯)相關聯的多個畫素以識別用於識別多個頻帶指數的畫素值分佈,如方框1920所示。 The method 1900 continues by analyzing a plurality of pixels associated with at least one maximum coding unit (LCU) (which is associated with a video signal) to identify a pixel value distribution for identifying a plurality of frequency band indices, such as block 1920. Shown.
方法1900繼續對視頻訊號或基於此的訊號執行採樣點自適應偏移(SAO)濾波處理以生成第一濾波訊號,使得SAO濾波處理包括將多個頻帶偏移應用到多個頻帶指數,如方框1930所示。 The method 1900 continues with performing a sample point adaptive offset (SAO) filtering process on the video signal or the signal based thereon to generate a first filtered signal, such that the SAO filtering process includes applying a plurality of band offsets to the plurality of band indices, such as Block 1930 is shown.
方法1900繼續對第一濾波訊號執行去塊濾波處理以生成第二濾波訊號,如方框1940所示。 The method 1900 continues with performing a deblocking filtering process on the first filtered signal to generate a second filtered signal, as indicated by block 1940.
同樣要注意的是,諸如利用其中實現的基頻處理模組和/或處理模組和/或其中的其他部件,可以在多種通訊設備中的任意一種中進行文中針對各個方法所述的各操作和功能。例如,此基頻處理模組和/或處理模組可以生成此等訊號並執行如本文所述的此等操作、處理等,還可以執行如本文所述的各操作和分析,或如本文所述的任何其他操作和功能等,或各自的等同物。 It should also be noted that the operations described herein for each method may be performed in any of a variety of communication devices, such as with the baseband processing module and/or processing module implemented therein and/or other components therein. and function. For example, the baseband processing module and/or processing module can generate such signals and perform such operations, processes, etc. as described herein, and can also perform various operations and analyses as described herein, or as herein Any other operations, functions, etc., or equivalents thereof.
在某些實施方式中,此基頻處理模組和/或處理模組(其可以在相同設備或不同設備中實現)可以執行根據本發明的各方面的此處理、操作等,和/或如本文所述的任何其他操作和功能等,或各自的等同物。在某些實施方式中,此處理通過第一設備中的第一處理模組以及第二設備中的第二處理模組協作地執行。在其他實施方式中,此處理、操作等全部由給定的一個設備中的基頻處理模組和/或處理模組執行。在甚至其他實施方式中,使用單個設備中的至少第一處理模組及第二處理模組執行此處理、操作等。 In some embodiments, the baseband processing module and/or processing module (which may be implemented in the same device or in different devices) may perform such processing, operations, etc., in accordance with aspects of the present invention, and/or as Any other operations and functions, etc., or equivalents thereof, as described herein. In some embodiments, the processing is performed cooperatively by the first processing module in the first device and the second processing module in the second device. In other embodiments, such processing, operations, and the like are all performed by a baseband processing module and/or processing module in a given device. In even other embodiments, the processing, operations, etc. are performed using at least a first processing module and a second processing module of a single device.
同樣,如本文所使用的術語“基本上”和“近似地”為其對應術語提供行業接收的容差和/或物品間的相關性。行業接收的容差的範圍小於1%-50%並對應於(但不限於)分量值、集成電路工藝變量、溫度變量、上升和下降時間和/或熱噪音。物品之間的相關性的差別為幾個百分點至數量級。如本文同樣所使用的術語“可操作地耦接至”、“耦接至”和/或“耦接”包括物品之間直 接耦接和/或物品之間通過居間物品間接耦接(例如,物品包括(但不限於)部件、元件、電路和/或模組),其中,對間接耦接而言,居間物品不修改訊號訊息但可以調節其電流電平、電壓電平和/或功率電平。如本文進一步所使用的推測耦接(即,當一個元件通過推斷與另一個元件耦接)包括以與“耦接至”相同的方式在兩個物品之間直接和間接耦接。如本文更進一步所使用的術語“可操作用於”或“可操作地耦接至”表明物品包括一種或多種電源連接、輸入、輸出等以便在啟動時執行一個或多個對應功能且可以進一步包括與一個或多個其他物品的推測耦接。如本文更進一步所使用的術語“與…相關聯”包括另一物品中嵌入的獨立物品和/或一個物品的直接和/或間接耦接。如本文所使用的術語“與…媲美”表明兩個或兩個以上物品、訊號等之間的比較提供所需關係。例如,當所述關係為訊號1比訊號2的數量級大時,當訊號1的數量級比訊號2的數量級大時或當訊號2的數量級比訊號1的數量級小時,可以實現有利比較。 Likewise, the terms "substantially" and "approximately" as used herein are used to provide industry-received tolerances and/or inter-item correlations for their corresponding terms. Industry-accepted tolerances range from less than 1% to 50% and correspond to, but are not limited to, component values, integrated circuit process variables, temperature variables, rise and fall times, and/or thermal noise. The difference in correlation between items is a few percentage points to the order of magnitude. The terms "operably coupled to", "coupled to", and/or "coupled" as used herein are intended to include the The couplings and/or items are indirectly coupled by intervening items (eg, items include, but are not limited to, components, components, circuits, and/or modules), wherein for indirect coupling, the intervening items are not modified Signal messages can be adjusted for their current level, voltage level and/or power level. Predictive coupling as used further herein (ie, when one element is inferred to be coupled to another element) includes direct and indirect coupling between two items in the same manner as "coupled to." The term "operable for" or "operably coupled to" as used further herein indicates that the item includes one or more power connections, inputs, outputs, etc. to perform one or more corresponding functions upon startup and may further Includes speculation coupling with one or more other items. The term "associated with", as used further herein, includes direct and/or indirect coupling of separate items and/or an item embedded in another item. The term "comparable to" as used herein means that a comparison between two or more items, signals, etc., provides the desired relationship. For example, when the relationship is that the signal 1 is greater than the magnitude of the signal 2, an advantageous comparison can be achieved when the magnitude of the signal 1 is greater than the magnitude of the signal 2 or when the magnitude of the signal 2 is less than the magnitude of the signal 1.
如本文同樣所使用的術語“處理模組”、“模組”、“處理電路”和/或“處理單元”(例如,包括可以操作、實現和/或用於編碼、解碼、基頻處理的各種模組和/或電路等)可以是單個處理設備或多個處理設備。這種處理設備可以是微處理器、微控制器、數位訊號處理器、微電腦、中央處理器、現場可編程門陣列、可編程邏輯設備、狀態機、邏輯電路、類比電路、數位電路和/或基於電路的硬編碼和/或操作指令操作訊號(類比和/或數位)的任何設備。處理模組、模組、處理電路和/或處理單元可以具有相關聯的儲存器和/或集成儲存元件,該儲存器和/或集成儲存元件可以是單個儲存設備、多個儲存設備和/或處理模組的嵌入電路、模組、處理電路和/或處理單元。這種儲存設備可以是只讀儲存器(ROM)、隨機存取儲存器(RAM)、易失性儲存器、非易失性儲存器、靜態儲存器、動態儲存器、閃存、高速緩衝儲存器和/或儲 存數位訊息的任意設備。要注意的是,如果處理模組、模組、處理電路和/或處理單元包括一個以上的處理設備,那麼處理設備就可以中心定位(例如,通過有線和/或無線總線結構直接耦接在一起)或可以分布式定位(例如,通過局域網和/或廣域網進行的間接耦接雲端計算)。進一步要注意的是,如果處理模組、模組、處理電路和/或處理單元通過狀態機、類比電路、數位電路和/或邏輯電路實現一種或多種功能,可以將儲存有相應操作指令的儲存器和/或儲存器元件嵌在包括狀態機、類比電路、數位電路和/或邏輯電路的電路內或外部。進一步要注意的是,儲存器元件可以儲存,處理模組、模組、處理電路和/或處理單元執行對應於一個或多個圖中示出的至少一部分步驟和/或功能的硬編碼和/或操作指令。製品中可以包括有這種儲存器設備或儲存器元件。 The terms "processing module," "module," "processing circuit," and/or "processing unit" as used herein (eg, include operations, implementations, and/or for encoding, decoding, and baseband processing). Various modules and/or circuits, etc.) may be a single processing device or multiple processing devices. Such processing devices may be microprocessors, microcontrollers, digital signal processors, microcomputers, central processing units, field programmable gate arrays, programmable logic devices, state machines, logic circuits, analog circuits, digital circuits, and/or Any device that operates on signals (analog and/or digits) based on hard-coded and/or operational instructions of the circuit. The processing module, module, processing circuit and/or processing unit may have associated storage and/or integrated storage elements, which may be a single storage device, multiple storage devices, and/or An embedded circuit, module, processing circuit, and/or processing unit of the processing module. Such storage devices may be read only memory (ROM), random access memory (RAM), volatile storage, nonvolatile storage, static storage, dynamic storage, flash memory, cache storage. And / or storage Any device that stores a digital message. It should be noted that if the processing module, module, processing circuit, and/or processing unit includes more than one processing device, the processing device can be centrally located (eg, directly coupled together by a wired and/or wireless bus structure) Or can be distributedly located (eg, indirectly coupled to cloud computing over a local area network and/or wide area network). It is further noted that if the processing module, module, processing circuit, and/or processing unit implements one or more functions through a state machine, analog circuit, digital circuit, and/or logic circuit, the storage storing the corresponding operational command may be stored. The memory and/or memory elements are embedded within or external to circuitry including state machines, analog circuits, digital circuits, and/or logic circuits. It is further noted that the storage elements can be stored, and the processing modules, modules, processing circuits, and/or processing units perform hard coding and/or corresponding to at least a portion of the steps and/or functions illustrated in one or more of the figures. Or operation instructions. Such a reservoir device or reservoir element can be included in the article.
上文在示出了指定函數的性能及其關係的方法步驟的幫助下已對本發明進行了描述。為了便於描述,本文任意限定了這些功能構建塊和方法步驟的界限和順序。只要適當執行指定功能和關係,就可以限定替代界限和順序。任何替代界限和順序都在要求保護的本發明的範圍和精神範圍內。此外,為了便於描述,任意限定了這些功能構建塊的界限。只要適當執行某些重要功能,就可以限定替代界限。類似地,本文也任意限定了流程框圖以便示出某些重要功能。從所用的程度來看,另外規定了流程框圖界限和順序,並仍然執行某些重要功能。功能構建塊和流程圖塊和順序的替代定義在要求保護的本發明的範圍和精神範圍內。本領域的普通技術人員還將明白,本文中的功能構建塊及其他說明性塊、模組和部件可以被實現為利用離散部件、專用集成電路、執行適當軟件等的處理器或其任意組合來示出。 The invention has been described above with the aid of method steps showing the performance of the specified functions and their relationships. For convenience of description, the boundaries and order of these functional building blocks and method steps are arbitrarily defined herein. Alternative boundaries and sequences can be defined as long as the specified functions and relationships are properly performed. Any alternative limits and sequences are within the scope and spirit of the invention as claimed. Moreover, the boundaries of these functional building blocks are arbitrarily defined for ease of description. As long as some important functions are properly performed, the alternative boundaries can be defined. Similarly, the flow block diagram is also arbitrarily defined herein to illustrate certain important functions. From the point of view of the use, the boundaries and sequence of the process block diagram are additionally specified, and some important functions are still performed. Alternative definitions of functional building blocks and flowchart blocks and sequences are within the scope and spirit of the claimed invention. Those of ordinary skill in the art will also appreciate that the functional building blocks and other illustrative blocks, modules, and components herein can be implemented as discrete components, application specific integrated circuits, processors executing appropriate software, etc., or any combination thereof. show.
可能已經至少部分地針對一個或多個實施方式對本發明進行了描述。本發明的實施方式在本文中用來說明本發明,其一方面、其特點、其概念和/或其實例。體現本發明的裝置、製品、機器和/ 或工藝的物理實施方式可以包括參照文中所討論的一個或多個實施方式描述的一個或多個方面、特點、概念、實例等。此外,從圖到圖,實施方式可以併入可以使用相同或不同參考編號的相同或類似命名的功能、步驟、模組等,正因如此,所述功能、步驟、模組等可以是相同或類似的功能、步驟、模組等或可以是不同的功能、步驟、模組等。 The invention may have been described, at least in part, in one or more embodiments. Embodiments of the invention are used herein to describe the invention, its aspects, its features, its concepts, and/or examples thereof. Devices, articles, machines and/or embodying the invention Or physical implementations of the process may include one or more aspects, features, concepts, examples, etc. described with reference to one or more embodiments discussed herein. In addition, from the figures to the figures, embodiments may incorporate the same or similarly named functions, steps, modules, etc., which may use the same or different reference numbers, and as such, the functions, steps, modules, etc. may be the same or Similar functions, steps, modules, etc. may be different functions, steps, modules, and the like.
除非從反面特別說明,傳遞給本文中所顯示的任何一個圖中的元件的訊號、來自該元件的訊號和/或元件之間的訊號可以是類比訊號或數位訊號、連續時間訊號或離散時間訊號以及單端訊號或差分訊號。例如,如果訊號路徑被顯示為單端路徑,則還表示差分訊號路徑。類似地,如果訊號路徑被顯示為差分路徑,則還表示單端訊號路徑。儘管本文對一個或多個特定架構進行了描述,但同樣可以實現其他架構,其他架構使用如本領域的普通技術人員認可的一個或多個數據總線(未明確示出)、元件之間的直接連接和/或其他元件之間的間接耦接。 Unless specifically stated to the contrary, the signals transmitted to the components of any of the figures shown herein, signals from the components, and/or signals between the components may be analog or digital signals, continuous time signals, or discrete time signals. And single-ended signals or differential signals. For example, if the signal path is displayed as a single-ended path, it also represents a differential signal path. Similarly, if the signal path is displayed as a differential path, it also represents a single-ended signal path. Although one or more specific architectures are described herein, other architectures may be implemented that use one or more data buses (not explicitly shown) as recognized by one of ordinary skill in the art, directly between components. Indirect coupling between connections and/or other components.
術語“模組”用於對本發明的各個實施方式進行描述。模組包括經由硬件實現以便執行一個或多個模組功能,諸如處理一個或多個輸入訊號以產生一個或多個輸出訊號的功能塊。實現模組的硬件本身可以結合軟件和/或固件進行操作。如本文所使用的模組可以包含一個或多個子模組,每個子模組本身就是模組。 The term "module" is used to describe various embodiments of the invention. The module includes functional blocks implemented via hardware to perform one or more module functions, such as processing one or more input signals to produce one or more output signals. The hardware that implements the module itself can operate in conjunction with software and/or firmware. A module as used herein may include one or more sub-modules, each of which is itself a module.
儘管本文明確描述了本發明的各個功能和特點的特定組合,但這些特點和功能的其他組合同樣是可能的。本發明不受本文所公開的特定實例的限制並明確結合其他組合。 Although specific combinations of various functions and features of the present invention are explicitly described herein, other combinations of these features and functions are equally possible. The present invention is not limited by the specific examples disclosed herein and is explicitly combined with other combinations.
200‧‧‧通訊系統 200‧‧‧Communication system
201‧‧‧訊息位元 201‧‧‧Message Bits
202‧‧‧編碼信息位元 202‧‧‧ encoded information bits
203‧‧‧離散值調製符號序列 203‧‧‧Discrete value modulation symbol sequence
204‧‧‧連續時間傳輸訊號 204‧‧‧Continuous time transmission signal
205‧‧‧過濾的連續時間傳輸訊號 205‧‧‧Filtered continuous time transmission signal
206‧‧‧連續時間接收訊號 206‧‧‧Continuous time to receive signals
207‧‧‧過濾的連續時間接收訊號 207‧‧‧Filtered continuous time reception signal
208‧‧‧離散時間接收訊號 208‧‧‧Discrete time receiving signal
209‧‧‧度量值 209‧‧‧ metrics
210‧‧‧最佳估計離散值調製符號和其中編碼的訊息位元 210‧‧‧Best estimate of discrete-valued modulation symbols and the information bits encoded therein
220‧‧‧編碼器和符號映射器 220‧‧‧Encoder and symbol mapper
222‧‧‧編碼器 222‧‧‧Encoder
224‧‧‧符號映射器 224‧‧‧symbol mapper
230‧‧‧發射驅動器 230‧‧‧Transmission driver
232‧‧‧數位類比轉換器(DAC) 232‧‧‧Digital Analog Converter (DAC)
234‧‧‧發射濾波器 234‧‧‧Transmission filter
260‧‧‧類比前端(AFE) 260‧‧‧ analog front end (AFE)
262‧‧‧接收濾波器 262‧‧‧ Receive Filter
264‧‧‧類比數位轉換器(ADC) 264‧‧‧ Analog Digital Converter (ADC)
270‧‧‧度量生成器 270‧‧‧Metric Generator
280‧‧‧解碼器 280‧‧‧Decoder
280a‧‧‧處理模組 280a‧‧‧Processing Module
280b‧‧‧處理模組 280b‧‧‧Processing module
297‧‧‧發射器 297‧‧‧transmitter
298‧‧‧接收器 298‧‧‧ Receiver
299‧‧‧通訊通道 299‧‧‧Communication channel
Claims (9)
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201261597683P | 2012-02-10 | 2012-02-10 | |
US201261598326P | 2012-02-13 | 2012-02-13 | |
US201261603190P | 2012-02-24 | 2012-02-24 | |
US13/623,765 US9380320B2 (en) | 2012-02-10 | 2012-09-20 | Frequency domain sample adaptive offset (SAO) |
US13/758,169 US9282328B2 (en) | 2012-02-10 | 2013-02-04 | Sample adaptive offset (SAO) in accordance with video coding |
Publications (2)
Publication Number | Publication Date |
---|---|
TW201338556A TW201338556A (en) | 2013-09-16 |
TWI524739B true TWI524739B (en) | 2016-03-01 |
Family
ID=48928077
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW102105283A TWI524739B (en) | 2012-02-10 | 2013-02-08 | Sample adaptive offset (sao) in accordance with video coding |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN103248888A (en) |
TW (1) | TWI524739B (en) |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103442230B (en) * | 2013-08-27 | 2018-04-27 | 复旦大学 | The Lagrange multiplier obtaining value method of SAO mode adjudgings suitable for the encoder of HEVC standard |
CN103647975B (en) * | 2013-12-02 | 2017-01-11 | 国家广播电影电视总局广播科学研究院 | Improved sample adaptive offset filtering method based on histogram analysis |
CN111263149B (en) | 2013-12-12 | 2021-10-26 | 三星电子株式会社 | Video encoding method and apparatus, and video decoding method and apparatus |
FR3018417B1 (en) * | 2014-03-04 | 2017-07-28 | Sagemcom Broadband Sas | METHOD FOR MODIFYING A BINARY VIDEO STREAM |
CN110177283B (en) | 2014-04-04 | 2021-08-03 | 北京三星通信技术研究有限公司 | Method and device for processing pixel identification |
CN105898566A (en) * | 2016-04-29 | 2016-08-24 | 乐视控股(北京)有限公司 | Video content presenting switching method and device, and mobile play terminal |
JP7147788B2 (en) | 2018-01-05 | 2022-10-05 | 株式会社ソシオネクスト | Encoding method, decoding method, encoding device, decoding device, encoding program and decoding program |
US11683487B2 (en) * | 2019-03-26 | 2023-06-20 | Qualcomm Incorporated | Block-based adaptive loop filter (ALF) with adaptive parameter set (APS) in video coding |
WO2020260668A1 (en) | 2019-06-28 | 2020-12-30 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Video decoder, video encoder, methods for encoding and decoding video signals and computer program adjusting one or more denoising operations |
JP7200074B2 (en) * | 2019-09-20 | 2023-01-06 | Kddi株式会社 | Image decoding device, image decoding method and program |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8660174B2 (en) * | 2010-06-15 | 2014-02-25 | Mediatek Inc. | Apparatus and method of adaptive offset for video coding |
-
2013
- 2013-02-08 TW TW102105283A patent/TWI524739B/en not_active IP Right Cessation
- 2013-02-18 CN CN2013100528431A patent/CN103248888A/en active Pending
Also Published As
Publication number | Publication date |
---|---|
TW201338556A (en) | 2013-09-16 |
CN103248888A (en) | 2013-08-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11800086B2 (en) | Sample adaptive offset (SAO) in accordance with video coding | |
TWI524739B (en) | Sample adaptive offset (sao) in accordance with video coding | |
TWI542219B (en) | Frequency domain sample adaptive offset (sao) | |
TWI504221B (en) | Signaling of prediction size unit in accordance with video coding | |
US9432700B2 (en) | Adaptive loop filtering in accordance with video coding | |
US9693064B2 (en) | Video coding infrastructure using adaptive prediction complexity reduction | |
US20130343447A1 (en) | Adaptive loop filter (ALF) padding in accordance with video coding | |
TWI538478B (en) | Video coding sub-block sizing based on infrastructure capabilities and current conditions | |
TWI493885B (en) | Unified binarization for cabac/cavlc entropy coding | |
US11019351B2 (en) | Video coding with trade-off between frame rate and chroma fidelity | |
US20130235926A1 (en) | Memory efficient video parameter processing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
MM4A | Annulment or lapse of patent due to non-payment of fees |