WO2019120532A1 - Procédé et appareil de commande de débit binaire adaptatif dans un réseau de communication sans fil - Google Patents
Procédé et appareil de commande de débit binaire adaptatif dans un réseau de communication sans fil Download PDFInfo
- Publication number
- WO2019120532A1 WO2019120532A1 PCT/EP2017/083993 EP2017083993W WO2019120532A1 WO 2019120532 A1 WO2019120532 A1 WO 2019120532A1 EP 2017083993 W EP2017083993 W EP 2017083993W WO 2019120532 A1 WO2019120532 A1 WO 2019120532A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- version
- ran
- client device
- segment
- delivering
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 25
- 238000004891 communication Methods 0.000 title claims description 30
- 230000003044 adaptive effect Effects 0.000 title claims description 8
- 230000005540 biological transmission Effects 0.000 claims abstract description 63
- 238000012544 monitoring process Methods 0.000 claims abstract description 19
- 238000012545 processing Methods 0.000 claims description 36
- 230000007246 mechanism Effects 0.000 claims description 5
- GGWBHVILAJZWKJ-UHFFFAOYSA-N dimethyl-[[5-[2-[[1-(methylamino)-2-nitroethenyl]amino]ethylsulfanylmethyl]furan-2-yl]methyl]azanium;chloride Chemical compound Cl.[O-][N+](=O)C=C(NC)NCCSCC1=CC=C(CN(C)C)O1 GGWBHVILAJZWKJ-UHFFFAOYSA-N 0.000 claims 19
- 230000007423 decrease Effects 0.000 abstract description 2
- 238000010586 diagram Methods 0.000 description 11
- 239000000872 buffer Substances 0.000 description 9
- 238000004590 computer program Methods 0.000 description 4
- 230000009977 dual effect Effects 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 230000015654 memory Effects 0.000 description 3
- 230000008685 targeting Effects 0.000 description 3
- 230000006835 compression Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 230000011664 signaling Effects 0.000 description 2
- 230000010267 cellular communication Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 230000002542 deteriorative effect Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- GVVPGTZRZFNKDS-JXMROGBWSA-N geranyl diphosphate Chemical compound CC(C)=CCC\C(C)=C\CO[P@](O)(=O)OP(O)(O)=O GVVPGTZRZFNKDS-JXMROGBWSA-N 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 230000003936 working memory Effects 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
- H04N21/2343—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
- H04N21/23439—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements for generating different versions
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L65/00—Network arrangements, protocols or services for supporting real-time applications in data packet communication
- H04L65/60—Network streaming of media packets
- H04L65/61—Network streaming of media packets for supporting one-way streaming services, e.g. Internet radio
- H04L65/612—Network streaming of media packets for supporting one-way streaming services, e.g. Internet radio for unicast
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L65/00—Network arrangements, protocols or services for supporting real-time applications in data packet communication
- H04L65/60—Network streaming of media packets
- H04L65/70—Media network packetisation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L65/00—Network arrangements, protocols or services for supporting real-time applications in data packet communication
- H04L65/60—Network streaming of media packets
- H04L65/75—Media network packet handling
- H04L65/752—Media network packet handling adapting media to network capabilities
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L65/00—Network arrangements, protocols or services for supporting real-time applications in data packet communication
- H04L65/60—Network streaming of media packets
- H04L65/75—Media network packet handling
- H04L65/762—Media network packet handling at the source
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/02—Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/60—Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client
- H04N21/63—Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
- H04N21/647—Control signaling between network components and server or clients; Network processes for video distribution between server and clients, e.g. controlling the quality of the video stream, by dropping packets, protecting content from unauthorised alteration within the network, monitoring of network load, bridging between two different networks, e.g. between IP and wireless
- H04N21/64723—Monitoring of network processes or resources, e.g. monitoring of network load
- H04N21/64738—Monitoring network characteristics, e.g. bandwidth, congestion level
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/60—Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client
- H04N21/63—Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
- H04N21/647—Control signaling between network components and server or clients; Network processes for video distribution between server and clients, e.g. controlling the quality of the video stream, by dropping packets, protecting content from unauthorised alteration within the network, monitoring of network load, bridging between two different networks, e.g. between IP and wireless
- H04N21/64746—Control signals issued by the network directed to the server or the client
- H04N21/64761—Control signals issued by the network directed to the server or the client directed to the server
- H04N21/64769—Control signals issued by the network directed to the server or the client directed to the server for rate control
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/83—Generation or processing of protective or descriptive data associated with content; Content structuring
- H04N21/845—Structuring of content, e.g. decomposing content into time segments
- H04N21/8456—Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments
Definitions
- the present invention relates to delivering streaming content, such as video-on-demand (VoD) content, and particularly relates to delivering streaming content via a radio access network (RAN).
- VoIP video-on-demand
- RAN radio access network
- Adaptive Bit Rate (ABR) technology encodes each video stream into multiple representations, also referred to as“representation levels”.
- representation levels In context of video content, the different representation levels correspond to different bitrates and resolutions.
- representations are packaged into segments, e.g., with each segment containing 2-10 seconds worth of video playback, and the segments comprising a selected representation level are streamed to the video player of a client device, for playout.
- the video player selects a given representation level in view of, e.g., user preferences, device capabilities, and communication link capabilities. At least that latter variable represents significant dynamic variability in the wireless context.
- a tablet, computer, smartphone, or other video playback device may connect to the content server via a wireless network, such as a cellular Radio Access Network (RAN).
- RAN Radio Access Network
- the phrase“radio conditions” refers to the particulars of the communication link between the playback device and the RAN
- the term“network conditions” alludes to broader conditions in the RAN, such as the“loading” level of the RAN, at least in the cell or network area relevant to serving the playback device.
- the playback device may or may not have direct knowledge of the network conditions, it nonetheless experiences the effects of network loading or congestion, e.g., by failing to receive streaming content at an appropriate delivery rate, which can be discerned at the playback device via monitoring of the receive buffer status.
- the playback device When the playback device experiences problems at the current representation level, it may automatically fall back to a lower representation level, with the fall back requiring renegotiation signaling between the playback device and the content server. Further, the playback device must flush its buffers of content at the old representation level and reset certain aspects of its decoding processes. Thus, while fall back offers a mechanism for continuing the content stream under worsening radio or network conditions, it is recognized herein that falling back imposes signaling and other burdens, and can have deleterious effects on the user experience.
- both versions have the same representation level but have different transmission bandwidth requirements.
- the representation level is a video resolution and one version of the content stream has higher video quality at the defined resolution but generally requires more transmission bandwidth than the other version.
- a control node having access to both versions dynamically selects which version to send for delivery to a client device via a Radio Access Network (RAN) supporting wireless delivery to the client device, in dependence on monitoring RAN performance.
- RAN Radio Access Network
- dynamic version selection on the network side reduces the need for the client device to reselect to a lower representation level responsive to decreases in the available transmission bandwidth in the RAN.
- An example network node includes communication circuitry configured for
- the example network node further includes processing circuitry that is operatively associated with the communication circuitry.
- the processing circuitry is configured to monitor a performance of the RAN with respect to delivering a content stream to a client device via the RAN, and to receive a request from the client device, requesting a segment of the content stream at a defined representation level. Further, the processing circuitry is configured to send a first or second version of the segment to the RAN for the client device, in dependence on the monitored performance, the first and second versions being at the defined representation level but having different transmission bandwidth requirements.
- Such operations may be ongoing, e.g., carried out over many segments of the content stream.
- a method of operation in a network node includes monitoring a performance of a RAN, with respect to delivering a content stream to a client device via the RAN, receiving a request from the client device, requesting a segment of the content stream at a defined representation level, and sending a first or second version of the segment to the RAN or the client device, in dependence on the monitored performance.
- the first and second versions of the content segment are at the same defined representation level, but they have different transmission bandwidth requirements.
- the content stream is a video stream
- the defined representation level is a defined resolution
- one version of the segment is higher quality than the other but requires greater transmission bandwidth than the other version.
- the term“higher quality” denotes additional or enhanced information content, such as further refinement layers in a layer-encoded video stream, lower coding rates, etc.
- Figure 1 is a block diagram of one embodiment of a communication network.
- Figure 2 is a block diagram of one embodiment of a Radio Access Network (RAN).
- Figure 3 is a logic flow diagram of one embodiment of a method of operation at a network node, such as the control node introduced in Figure 1.
- RAN Radio Access Network
- Figure 4 is a block diagram of one embodiment of a network node, such as the control node introduced in Figure 1.
- Figure 5 is block diagram of another embodiment of a communication network.
- Figure 1 illustrates a communication network 10 that provides communication services to a client device 12, which is denoted as“CD 12” in the diagram.
- the communication network 10 couples the client device 12 to one or more external networks 14, such as the Internet.
- the external network(s) 14 provide the client device 12 with access to a plethora of communication services, such as streaming services available from an origin content source 16.
- the origin content source 16 provides movies, television shows, or other video content for delivery to the client device 12 via the communication network 10.
- the communication network 10 in one or more of example embodiments comprises a Third Generation Partnership Project (3 GPP) cellular communication network.
- the client device 12 shall be understood as broadly connoting essentially any wireless communication apparatus having the requisite wireless communication circuitry and associated processing functionality necessary for connecting to the communication network 10 via one or more of the involved Radio Access Technologies (RATs), along with further processing circuitry and user interface circuitry and equipment for receiving and playing back one or more types of content streams.
- RATs Radio Access Technologies
- the illustrated communication network 10 includes a Radio Access Network (RAN) 20 and an associated Core Network (CN) 22.
- the CN 22 includes a network node 30 that is referred to herein as the“control node 30”, along with a multi-version content source 32, e.g., a configured computer server or the like.
- a multi-version content source 32 e.g., a configured computer server or the like.
- the depiction of the CN 22 is simplified to focus the discussion.
- the CN 22 includes other nodes, such as mobility management notes, packet gateway notes, etc., to support the routing of user traffic to and from the client device 12 and any number of other devices connected to the communication network 10.
- the multi- version content source 32 is configured to receive a content stream, denoted as“CS” in the illustration, which may be provided by an external content provider, such as NETFLIX.
- CS content stream
- NETFLIX external content provider
- the diagram assumes that the content stream is provided to the multi- version content source 32 at a defined representation level, such as may have been initially negotiated between the client device 12 in the origin content source 16 that provides the content stream.
- the client device and/or the origin content source 16 may renegotiate the representation level during streaming and that the operations of interest herein may be applied to any given representation level currently in use.
- the multi- version content source 32 For the current representation level in use for the content stream, the multi- version content source 32 generates first and second versions, denoted as“VI of CS” and“V2 of CS” in the diagram. Although the two versions are at the same representation level, they have different transmission bandwidth requirements.
- the term“transmission bandwidth requirement” refers to the transmission bandwidth required for transmitting content segments according to the required timing. While the number of bits required to represent a given content segment may vary as a function of the complexity of the content encoded within the segment, for the two content stream versions contemplated here, one version of the content stream is larger than the other version of the content stream. In the video context, one version is encoded for higher video quality than the other version, although the representation levels are the same. Higher-quality encoding may include fewer predictive frames, lower coding or compression rates, additional video enhancement layers, etc.
- ABR Adaptive Bit Rate
- CBR Constant Bit Rate
- ERICSSON AB has developed two versions of optimized ABR, for improved quality with same bandwidth and reduced bandwidth for the same quality. That is, for the same representation level, ABR encoding/re-coding produces two versions of the content stream.
- the multi- version encoding/re-coding uses a Variable Bit Rate (VBR) allocation in ABR content streams, in a manner that is transparent to the playback devices and to the delivery network.
- VBR Variable Bit Rate
- a first version is optimized or otherwise“tuned” for video quality
- a second version is optimized or otherwise“tuned” for bandwidth reduction.
- the VBR encoding tunes the bandwidth consumption to only what is needed for the targeted quality, with image bitrate being defined by the instant content complexity, to reclaim bandwidth when complexity is low.
- image bitrate being defined by the instant content complexity
- the multi- version content source 32 receives a content stream at a defined representation level and generates first and second versions of the content stream, with one version having a generally higher quality than the other version, and with one version having generally lower transmission bandwidth requirements than the other version. It should also be understood that such operations in one or more embodiments applied to cached or pre-fetched content. That is, the multi- version content source 32 stores, or has access to, previously generated and stored versions of a given content stream for each of one or more available representation levels.
- the network node 30 control node 30— operates as an edge-node in the diagram, controlling the delivery of a content stream to a client device 12 through the RAN 20.
- the control node 30 includes communication circuitry 40 that is configured for communicating with one or more other nodes in the RAN 20, and with one or more other nodes operative as content sources.
- the communication circuitry 40 includes communication interface circuits and associated protocol processors configured for communicating with a RAN node 34 in the RAN 20, and further includes communication interface circuits and associated protocol processors configured for communicating with the multi-version content source 32 in the CN 22.
- the RAN node 34 is a radio base station— e.g., and eNB in an FTE-based RAN or a gNB in a 5G-based RAN.
- the RAN node 34 may be a Radio Network Controller (RNC), or may be some other node associated with delivering the content stream to the client device 12 via the wireless link.
- RNC Radio Network Controller
- the control node 30 in the illustrated example further includes processing circuitry 42 that is operatively associated with the communication circuitry 40 and configured to monitor a performance of the RAN 20 with respect to delivering a content stream to a client device 12 via the RAN 20, receive a request from the client device 12, requesting a segment of the content stream at a defined representation level, and send a first or second version (VI or V2) of the segment to the RAN 20 for the client device 12, in dependence on the monitored performance— i.e., the monitored RAN communication performance.
- the first and second versions are at the defined representation level but have different transmission bandwidth requirements.
- the content stream is a video stream that comprises segments of video content, and the one version has a higher video quality than the other version and one version has a lower transmission bandwidth requirement than the other version.
- the processing circuitry 42 in an example embodiment is configured to receive successive requests from the client device 12, requesting successive segments of the content stream at the defined representation level, and is configured to send the first or second version of each successive segment in dependence on the monitored performance for a time relevant to the sending of each successive segment.
- the processing circuitry 42 in at least one implementation, is configured to dynamically decide on a per segment basis whether an available bandwidth in the RAN 20, as indicated by the monitored performance, is sufficient to support delivery to the client device 12 of the version of the segment having the higher transmission bandwidth requirements.
- the content stream is a video stream, for example.
- the defined representation level is a defined video resolution.
- given video content may be available in one or more resolutions, and, for a given resolution being streamed to the client device 12, the control node dynamically chooses between delivering first or second versions of the involved content segments, in dependence on a monitored performance of the RAN 20.
- the first and second versions are respective adaptive bit rate video segments.
- the first version is coded for a lower transmission bandwidth requirement at the defined representation level, as compared to the second version
- the second version is coded for a higher video quality at the defined representation level, as compared to the first version.
- “first” and“second” do not connote encoding orders or priorities, and instead operate as labels for conveniently referring to the different versions of the content.
- the processing circuitry 42 in an example embodiment is configured to monitor the performance of the RAN 20 by monitoring one or more transmission parameters on an ongoing basis.
- the transmission parameter(s) in question vary in dependence on a bandwidth available in the RAN 20 for delivering the content stream to the client device 12, and the processing circuitry 42 is configured to send the first or second version of the segment based on choosing the version having the lower transmission bandwidth requirements, if the available bandwidth is deemed insufficient for delivering the version having the higher transmission bandwidth requirements.
- the one or more transmission parameters comprise at least one of: a Transmission Control Protocol (TCP) congestion parameter associated with delivering the content stream to the client device 12 via the RAN 20, a parameter associated with delivering the content stream to the client device 12 via the RAN 20 using a Quick User Datagram Protocol Internet Connection (QUIC) delivery mechanism, a Round Trip Time (RTT) parameter associated with delivering the content stream to the client device 12 via the RAN 20, and a throughput parameter associated with delivering the content stream to the client device 12 via the RAN 20.
- TCP Transmission Control Protocol
- QUIC Quick User Datagram Protocol Internet Connection
- RTT Round Trip Time
- the processing circuitry 42 may be configured to monitor the performance of the RAN 20 based on receiving explicit feedback from the RAN 20 on an ongoing basis.
- the explicit feedback directly or indirectly indicates a bandwidth available in the RAN 20 for delivering the content stream to the client device 12, and the processing circuitry 42 is configured to send the first or second version of a requested segment based on choosing the version having the lower transmission bandwidth requirements, if the available bandwidth is deemed insufficient for delivering the version having the higher transmission bandwidth requirements.
- the processing circuitry 42 is configured to request the first and second versions of the content segment from a content source node, e.g., the multi-version content source 32.
- the first version of a content segment is a bandwidth-optimized version and the second version is a quality-optimized version.
- the processing circuitry 42 is configured to check whether an observed or indicated performance of the RAN 20 with respect to delivering one or more prior segments of the content stream is sufficient for delivering the quality-optimized version of the segment and, if so, send the quality-optimized version, and otherwise send the bandwidth-optimized version.
- the processing circuitry 42 comprises fixed circuitry, programmed circuitry, or some combination of both.
- the processing circuitry 42 comprises digital processing circuitry that is programmatically configured based on execution of stored computer program instructions.
- the control node 30 includes storage 44, which stores one or more computer programs 46 comprising such computer program instructions, and which may further store one or more supporting items of configuration data 48.
- the storage 44 comprises or includes a non-transitory computer-readable medium storing the computer program(s) 46, for execution by one or more microprocessors or other digital processors included in the processing circuitry 42.
- execution“specially adapts” the processing circuitry 42 to perform the processing described herein.
- the processing circuitry 42 comprises one or more microprocessor-based circuits, DSP-based circuits, FPGA-based circuits, ASIC-based circuits, etc.
- the storage 44 may comprise a mix of memory types or storage devices.
- the storage 44 may include FLASH memory or a Solid State Disk (SSD) for non-volatile storage of program instructions, provisioning data, etc., and may include SRAM or DRAM for use as working memory for program execution, data processing, etc., during live operation.
- SSD Solid State Disk
- Figure 1 depicts the control node 30 as part of the CN 22, one or more other contemplated embodiments involve implementation of the control node 30 as part of the RAN 20, such as suggested in Figure 2. In either case, there may be more than one control node 30 implemented in the network 10. For example, there may be different control nodes 30 for different geographic or logical areas of the network 10.
- Figure 3 depicts an example method 300 of operation in a network node, such as may be carried out by the network node 30.
- the method 300 includes monitoring (Block 302) a performance of a RAN 20, with respect to delivering a content stream to a client device 12 via the RAN 20, receiving (Block 304) a request from the client device 12, requesting a segment of the content stream at a defined representation level, and sending (Block 306) a first or second version of the segment to the RAN 20 for the client device 12, in dependence on the monitored performance.
- the first and second versions are both at the same defined representation level but have different transmission bandwidth requirements.
- the method 300 may be performed in an order different than that suggested by the logic flow diagram, and it may be performed for more than one client device 12 being served the same or different content streams. As regards a given client device 12 and a given content stream, one or more steps or operations shown in Figure 3 may be performed on a background or ongoing basis, e.g., the method 300 may include tracking RAN performance over some running window, to allow version control decisions to be made on filtered parameters.
- receiving (Block 304) a request from a client device 12 may comprise receiving successive requests, and requesting successive segments of a content stream at a defined representation level.
- the method 300 would include sending the first or second version of each successive segment in dependence on the monitored performance for a time relevant to the delivery of each successive segment.
- Such operations allow the network node 30 to change which segment version is selected for delivery to the client device 12 responsive to dynamic changes in the radio link to the client device 12 and/or changing loading conditions in the RAN 20.
- sending (Block 306) the first or second version of each successive segment in dependence on the monitored performance for the time relevant to the delivery of each successive segment comprises dynamically deciding on a per segment basis whether an available bandwidth in the RAN 20, as indicated by the monitored performance, is sufficient to support delivery of the version of the segment having the higher transmission bandwidth requirement.
- the content stream in question may be a video stream, in which case the defined representation level is a defined video resolution.
- the first and second versions of each content segment are respective adaptive bit rate video segments, with the first version coded for a lower transmission bandwidth requirement at the defined representation level, as compared to the second version, and with the second version coded for a higher video quality at the defined representation level, as compared to the first version.
- the monitoring step (Block 302) of the method 300 comprises, for example, monitoring one or more transmission parameters on an ongoing basis.
- the transmission parameter(s) vary in dependence on a bandwidth available in the RAN 20 for delivering the content stream to the client device 12.
- the step of sending (Block 306) the first or second version of a segment of a content stream comprises choosing the version having the lower transmission bandwidth requirements, if the available bandwidth is deemed insufficient for delivering the version having the higher transmission bandwidth requirements.
- Non-limiting examples of such transmission parameters include at least one of: a Transmission Control Protocol (TCP) congestion parameter associated with delivering the content stream to the client device 12 via the RAN 20, a parameter associated with delivering the content stream to the client device 12 via the RAN 20 using a Quick User Datagram Protocol Internet Connection (QUIC) delivery
- TCP Transmission Control Protocol
- QUIC Quick User Datagram Protocol Internet Connection
- RTT Round Trip Time
- the method 300 uses explicit feedback from the RAN 20 for performance monitoring.
- the monitoring step (Block 302) in one or more embodiments comprises or includes receiving explicit feedback from the RAN 20 on an ongoing basis.
- the explicit feedback directly or indirectly indicates a bandwidth available in the RAN 20 for delivering the content stream to the client device 12.
- sending (Block 306) the first or second version of the segment comprises choosing the version having the lower transmission bandwidth requirements, if the available bandwidth is deemed insufficient for delivering the version having the higher transmission bandwidth requirements.
- FIG. 4 illustrates another embodiment of the control node 30, such as may be implemented via computer processing.
- the control node 30 comprises a monitoring module 400 that is configured to monitor a performance of a RAN 20, with respect to delivering a content stream to a client device 12 via the RAN 20, and one or more communication modules 402 that are configured to receive a request from the client device 12, requesting a segment of the content stream at a defined representation level, and send 306 a first or second version of the segment to the RAN 20 for the client device 12.
- the control node 30 includes a decision module 404 that decides whether the first or second version should be sent, in dependence on the monitored performance.
- the first and second versions of the content segment are at the same defined representation level but have different transmission bandwidth requirements.
- Figure 5 illustrates a further example communication network 500, which may be a more detailed example of the network 10 introduced in Figure 1. It shall be understood that the various entities illustrated in Figure 5 are realized in various processing circuitry, e.g., in one or more servers or computer systems adapted for the requisite processing and having corresponding inter node communication interfaces.
- the network 500 includes an edge delivery node 501 that operates as a version of the above-described control node 30.
- the edge delivery node 501 delivers selected versions of requested content to a client device 12 through a RAN 20, which includes one or more forms of the RAN node 34.
- the network 500 also includes or is associated with an optimized ABR encoder 502, which may be a version or form of the earlier-described multi-version content source 32.
- the network 500 includes a live packager 504, a multi-version content (MC) sender 506, a video-on-demand (VOD) ingestor 508, an optimized ABR transcoder and packager 510, a content origin entity 512, a core delivery entity 514.
- MC multi-version content
- VOD video-on-demand
- Figure 5 provides for live content streaming, or VOD streaming, or both.
- the content stream of interest is encoded in two versions, one targeting quality and one targeting reduced transmission bandwidth (at least as compared to the version targeting quality).
- Such versions may be produced or otherwise be available for each of two or more different representation levels, e.g., resolutions, of the content stream in question.
- the edge delivery node 501 receives the two segments of the same
- such operations are transparent to the client device 12, with the client device 12 requesting a content segment and receiving one or the other version of the requested segment in dependence on the monitored performance of the RAN 20. That is, the client device 12 does not make version-specific requests and need not know about the different versions. Instead, the client device 12 merely makes content segment requests directed to the
- the edge delivery node 501 interacts with the RAN 20 and/or uses internal heuristics based on transport layer metrics to predict the RAN bandwidth available for delivering a requested content segment to the client device 12. In at least some embodiments, the edge delivery node 501 selects, as a default choice, the quality-optimized content segments for delivery to the client device 12 via the RAN 20, and then changes its selection over to the bandwidth-optimized content segments in response to the monitored RAN performance falling below some threshold.
- the edge delivery node 501 may monitor RAN performance indirectly, such as by monitoring for buffer underflow events at the client device 12, where the client device 12 experiences buffer underflow as a consequence of the requested content segments being delivered to it at a rate high enough to support uninterrupted playout at the client device 12.
- Buffer underflows may arise with deteriorating radio conditions on the radio link between the client device 12 and the RAN 20 and/or may arise as a consequence of RAN congestion, at least in the portion of the RAN 20 involved in delivering the content stream to the client device 12.
- the occurrence of buffer underflow events at the client device 12 may be understood as an implicit indication that the bandwidth currently available in the RAN 20 for delivering the content stream to the client device 12 is insufficient for the quality-optimized version of the content segments.
- Dynamic selection between the two versions of content segments by the edge delivery node 501 reduces the probability that the client device 12 will switch to a lower resolution of the content stream— i.e., reduces the number of instances in which the client device 12 will switch to a less demanding representation level.
- the edge delivery node 501 uses or otherwise depends on optimized ABR that encodes live MPEG2-Transport Stream (TS) content streams into multi bitrate ABR representations.
- TS live MPEG2-Transport Stream
- Each representation level is encoded into a quality-optimized version (referred to as Optimized ABR vl) and into a bandwidth-optimized version (referred to as Optimized ABR v2), resulting in a dual- version stream representation for each representation level of the original content stream.
- Optimized ABR vl quality-optimized version
- Optimized ABR v2 bandwidth-optimized version
- the optimized ABR encoder 502 performs such encoding based on receiving live content streams in the MPEG2-TS format, and it provides the resulting multi- version content streams to the live packager 504.
- the live packager 504 generates dual- version segments of the content stream for each representation level.
- the URL of the dual segments of the same representation is the same except for their version number (vl or v2), which is indicated in the URL as a query parameter.
- the live packager 504 also generates the manifest containing the URLs of the generated segments without any indication of their version, i.e. URLs that do not include any version.
- the version information is used within the network 500 for version selection as described herein, but is stripped by the edge delivery node 501 for delivery of the content segments to the client device 12— i.e., the versioning is transparent to the client device 12.
- the dual- version content segments of live content streams generated from the live packager 504 are pushed to edge delivery with multicast ABR (MABR).
- Each representation and each version of the representation is allocated a group address, for example. For instance, a given live stream is sent on six group addresses, if the corresponding original MPEG-TS live stream is encoded at three representation levels and two versions of each representation level are produced.
- the live content segments as well as the manifest are pushed to the content origin entity 512 for unicast ABR streaming of live content, i.e., pulling the appropriate version of live content segments from the content origin entity 512, instead of pushing with MABR. That is, live content can be delivered either as unicast end to end via the nodes 502, 504, 512, 514, and 501, or for efficiency reasons, at least some live content can be delivered using MABR in the core, with unicast at the network edge.
- the edge delivery node 501 stores the dual- version content segments for a short time, e.g. three times the segment length in seconds.
- the edge delivery node 501 receives a GET request from a client device 12 for a content segment from a particular representation of a content stream, it first checks the bandwidth available to the client device 12 and sends the quality-optimized version of the requested content segment if there is enough bandwidth for the delivery, and otherwise sends the bandwidth-optimized version of the requested content segment.
- the“bandwidth available to the client device 12” may be checked directly or indirectly, and it may be a predictive value, e.g., one based on the most recent assessment of RAN conditions, or based on a running assessment of RAN conditions.
- the edge delivery node 501 monitors, for example, one or more transmission parameters associated with ongoing content stream delivery to the client device 12, such as congestion-related parameters, buffer level feedback from the client device 12, etc., to infer the available bandwidth in the RAN 20 for delivering a currently-requested content segment to the client device 12. Additionally, or alternatively, the edge delivery node 501 may receive explicit feedback from the RAN 20 (suggested by the“RAN INTERACTION” label in the diagram), indicating RAN conditions.
- VOD content high-resolution VOD content may be ingested by the network 500 via the VOD ingestor entity 508, which provides the VOD content to the optimized ABR transcoder and packager 510.
- the entity 510 transcodes and packages the VOD content into multibit rate and multi- version representations in the same way as described previously for the live streams input to the optimized ABR encoder 502.
- the entity 510 generates dual version content segments for each representation level of a given VOD content stream, where the URLs of the two versions of the same representation are the same except for their version number (vl or v2).
- either the quality-optimized or the bandwidth optimized version of a given content stream may be requested for the current representation level, based on indicating the version number in the URL.
- versioning information may be stripped for delivery to the client device 12.
- the packager 510 also generates the manifest containing the URLs of the generated segments without any indication of their version.
- the VOD segments as well as the manifest are pushed to content origin entity 512, for unicast ABR streaming to the client device 12 via the edge delivery node 501, which couples to the content origin entity 512 through a core delivery entity 514.
- the core delivery entity 514 for example, provides a level of content caching.
- the edge delivery node 501 when the edge delivery node 501 receives a GET request (a URL) from a client device 12 for a content segment at a particular representation level, it fetches the dual versions of the requested content segment from the content origin entity 512, if those segments are not precached/prepositioned at the edge delivery node 501. The edge delivery node 501 then checks the bandwidth available to the client device 12 and sends the quality-optimized version of the requested content segment if there is enough bandwidth for the delivery, and otherwise sends the bandwidth-optimized version of the requested content segment.
- a GET request a URL
- saying that there is“enough” or“not enough” bandwidth available in the RAN 20 for delivery of the quality-optimized version to the client device 12 may constitute a simple threshold comparison by the edge delivery node 501.
- any one or more of monitored delays or round-trip-times, monitored buffer levels, monitored congestion window sizes, or the like may be compared to a corresponding“go” or“no-go” threshold.
- monitored congestion level as a specific example, if the parameter or parameters monitored as an indication of RAN congestion exceed a threshold value or values corresponding to a congestion level deemed too high, then the edge delivery node 501 would conclude that there is not enough bandwidth to deliver the quality-optimized version of the requested content segment on a timely basis.
- the edge delivery node 501 is configured in one or more embodiments to intelligently manage its dual- version content storage.
- the size of a given content segment depends on the involved image complexity. That is, as a rule, simpler or more uniform video content encodes at a higher compression ratio than more complex video content. While the quality-optimized version of a content segment will in general be larger than the bandwidth-optimized version of the same content segment, the degree or extent of the size difference depends on the underlying complexity of the encoded video content. When image complexity is low, there may not be a significant difference in the sizes of the two versions.
- At least one embodiment of the edge delivery node 501 includes processing circuitry that is configured to evaluate the size difference between quality-optimized and bandwidth-optimized versions of the same content segment, and keep only version if the size difference is less than a defined threshold value, e.g., ten percent. For example, the edge delivery node 501 retains only the quality-optimized version of the content segment. Of course, such retention decisions are made for each segment, thus, for a given content stream representation level, the edge delivery node 501 may store some content segments in dual version form and store others in single version form. The overall effect of such processing reduces the amount of memory or storage required to cache or buffer dual- version content at the edge delivery node 501.
- a defined threshold value e.g., ten percent.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Computer Networks & Wireless Communication (AREA)
- Computer Security & Cryptography (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
Des procédés et un appareil décrits à titre d'exemple dans ce document sélectionnent entre différentes versions d'un flux de contenu, par exemple, sur une base par segment, pour une distribution sans fil à un dispositif client. De manière avantageuse, les deux versions ont le même niveau de représentation mais ont des exigences de bande passante de transmission différentes. À titre d'exemple, le niveau de représentation est une résolution vidéo et une version du flux de contenu présente une qualité vidéo supérieure à la résolution définie mais nécessite généralement plus de bande passante de transmission que l'autre version. Un nœud de commande ayant accès aux deux versions sélectionne dynamiquement la version à envoyer pour une distribution à un dispositif client par l'intermédiaire d'un réseau d'accès radio (RAN) prenant en charge une distribution sans fil au dispositif client, en fonction de la surveillance de la performance du RAN. Notamment, une sélection de version dynamique sur le côté réseau réduit le besoin que le dispositif client sélectionne à nouveau à un niveau de représentation inférieur en réponse à des diminutions de la largeur de bande de transmission disponible dans le RAN.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/EP2017/083993 WO2019120532A1 (fr) | 2017-12-21 | 2017-12-21 | Procédé et appareil de commande de débit binaire adaptatif dans un réseau de communication sans fil |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/EP2017/083993 WO2019120532A1 (fr) | 2017-12-21 | 2017-12-21 | Procédé et appareil de commande de débit binaire adaptatif dans un réseau de communication sans fil |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2019120532A1 true WO2019120532A1 (fr) | 2019-06-27 |
Family
ID=60937735
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2017/083993 WO2019120532A1 (fr) | 2017-12-21 | 2017-12-21 | Procédé et appareil de commande de débit binaire adaptatif dans un réseau de communication sans fil |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2019120532A1 (fr) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114885208A (zh) * | 2022-03-21 | 2022-08-09 | 中南大学 | Ndn网络下可伸缩流媒体传输的动态自适应方法、设备及介质 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013151674A1 (fr) * | 2012-03-13 | 2013-10-10 | Google Inc. | Transmission multimédia en continu adaptative prédictive |
US20140143823A1 (en) * | 2012-11-16 | 2014-05-22 | James S. Manchester | Situation-dependent dynamic bit rate encoding and distribution of content |
US20150023404A1 (en) * | 2013-07-16 | 2015-01-22 | Cisco Technology, Inc. | Quality Optimization with Buffer and Horizon Constraints in Adaptive Streaming |
US20150282000A1 (en) * | 2012-10-01 | 2015-10-01 | Telefonaktiebolaget L M Ericsson (Publ) | Apparatus and method relating to the streaming of content to one or more user devices |
-
2017
- 2017-12-21 WO PCT/EP2017/083993 patent/WO2019120532A1/fr active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013151674A1 (fr) * | 2012-03-13 | 2013-10-10 | Google Inc. | Transmission multimédia en continu adaptative prédictive |
US20150282000A1 (en) * | 2012-10-01 | 2015-10-01 | Telefonaktiebolaget L M Ericsson (Publ) | Apparatus and method relating to the streaming of content to one or more user devices |
US20140143823A1 (en) * | 2012-11-16 | 2014-05-22 | James S. Manchester | Situation-dependent dynamic bit rate encoding and distribution of content |
US20150023404A1 (en) * | 2013-07-16 | 2015-01-22 | Cisco Technology, Inc. | Quality Optimization with Buffer and Horizon Constraints in Adaptive Streaming |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114885208A (zh) * | 2022-03-21 | 2022-08-09 | 中南大学 | Ndn网络下可伸缩流媒体传输的动态自适应方法、设备及介质 |
CN114885208B (zh) * | 2022-03-21 | 2023-08-08 | 中南大学 | Ndn网络下可伸缩流媒体传输的动态自适应方法、设备及介质 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8812621B2 (en) | Reducing fetching load on cache servers in adaptive streaming | |
US10455404B2 (en) | Quality of experience aware multimedia adaptive streaming | |
US10034058B2 (en) | Method and apparatus for distributing video | |
US10034048B2 (en) | Multipath delivery for adaptive streaming | |
US10721715B2 (en) | Link-aware streaming adaptation | |
US9838459B2 (en) | Enhancing dash-like content streaming for content-centric networks | |
US11949512B2 (en) | Retransmission of data in packet networks | |
Thomas et al. | Enhancing MPEG DASH performance via server and network assistance | |
US20120140645A1 (en) | Method and apparatus for distributing video | |
US20140012981A1 (en) | Apparatus and methods for optimizing network data transmission | |
US9967768B2 (en) | Apparatus and method relating to the streaming of content to one or more user devices | |
US10063893B2 (en) | Controlling the transmission of a video data stream over a network to a network user device | |
CN107210999B (zh) | 链路感知流送自适应 | |
US10834161B2 (en) | Dash representations adaptations in network | |
US20150052236A1 (en) | Load based target alteration in streaming environments | |
CN109076062B (zh) | 边缘节点控制 | |
Afzal et al. | A holistic survey of multipath wireless video streaming | |
US20140325023A1 (en) | Size prediction in streaming enviroments | |
WO2019120532A1 (fr) | Procédé et appareil de commande de débit binaire adaptatif dans un réseau de communication sans fil | |
Seo et al. | Network-adaptive autonomic transcoding algorithm for seamless streaming media service of mobile clients |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 17825842 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 17825842 Country of ref document: EP Kind code of ref document: A1 |