THE CAPABILITY of real-time transmission of video over

Size: px
Start display at page:

Download "THE CAPABILITY of real-time transmission of video over"

Transcription

1 1124 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 15, NO. 9, SEPTEMBER 2005 Efficient Bandwidth Resource Allocation for Low-Delay Multiuser Video Streaming Guan-Ming Su, Student Member, IEEE, and Min Wu, Member, IEEE Abstract This paper studies efficient bandwidth resource allocation for streaming multiple MPEG-4 fine granularity scalability (FGS) video programs to multiple users. We begin with a simple single-user scenario and propose a rate-control algorithm that has low delay and achieves an excellent tradeoff between the average visual distortion and the quality fluctuation. The proposed algorithm employs two weight factors for adjusting the tradeoff, and the optimal choice of these factors is derived. We then extend to the multiuser case and propose a dynamic resource allocation algorithm with low delay and low computational complexity. By exploring the variations in the scene complexity of video programs as well as dynamically and jointly distributing the available system resources among users, our proposed algorithm provides low fluctuation of quality for each user, and can support consistent or differentiated quality among all users to meet applications needs. Experimental results show that compared to traditional look-ahead sliding-window approaches, our algorithm can achieve comparable visual quality and channel utilization at a much lower cost of delay, computation, and storage. Index Terms Dynamic resource allocation, fine granularity scalability (FGS) coding, multiuser video communications, rate control, visual quality fluctuation. I. INTRODUCTION THE CAPABILITY of real-time transmission of video over network paves way to a number of emerging applications, which allows us to communicate and entertain from almost every corner of the world. In such applications as digital video on-demand service, broad-band wireless video streaming and conferencing, and direct broadcast satellite (DBS) service, multiple encoded video programs will be transmitted or relayed through a central server. The overall bandwidth of the outbound video streams is limited by the server s outbound communication capacity. To efficiently share critical resources and meet a set of quality of service (QoS) requirements, a major concern for the server is how to allocate the bandwidth resource to each stream. There are two different strategies of resource allocation for multiple users, namely, a collection of single-user subsystems with static resource allocation among users, and a dynamic joint resource allocation system [1] [3]. In the first system, each user is allocated a fixed amount of bandwidth and the video transmitted for each user is kept Manuscript received December 31, 2003; revised July 8, This work was supported in part by the U.S. National Science Foundation under Award CCR Part of this work was presented at the IEEE International Conference on Communications, Paris, France, June This paper was recommended by Associate Editor H. Sun. The authors are with the Department of Electrical and Computer Engineering, University of Maryland, College Park, MD USA ( gmsu@eng.umd.edu; minwu@eng.umd.edu). Digital Object Identifier /TCSVT below the bound. This bit allocation strategy treats each user individually without dynamically sharing resources with other users. Since different video programs have different content complexity, at a given bit rate some may have unnecessarily high perceptual quality, while others may have low perceptual quality. In contrast, a system with dynamic resource allocation can leverage the variation of the content complexity in different video programs, aggregate the resource from all users into a common pool, and jointly allocate bandwidth resource to each user to achieve consistent perceptual quality to each other [4]. In this paper, we focus on the dynamic bandwidth resource allocation for a multiuser video streaming system. As with every resource allocation mechanism, we are concerned with the efficiency and effectiveness of the allocation. In particular, we are dealing with dynamically allocating bandwidth resource to a potentially large number of users in real-time video streaming applications. In order for a system to have good scalability to handle many users as well as to do real-time processing, the computational complexity of a resource allocator should be sufficiently low. The end-to-end delay is often stringent for accommodating interactive applications such as video conferencing. These motivate us to investigate efficient resource allocation strategies with low computational complexity, high scalability, and low delay. Resource allocation strategies are tied to the system s service objective. In a video transmission system, the perceptual quality of the received video is one of the most important aspects of quality of service experienced by the end users. There are two types of visual quality concern. The most common concern is the average visual quality, often measured in terms of the average mean-square-error (MSE) of all video frames, or the corresponding peak signal-to-noise ratio (PSNR) [5] [7]. The other important concern is the quality fluctuation, as substantial quality differences between nearby frames can bring annoying flickering and other artifacts to viewers even when the average PSNR is satisfactory. In many systems that employ a set of frames as an encoding unit (known as a group of pictures/frames), severe quality fluctuation may also appear at the boundaries between groups of frames [8]. The quality fluctuation can be measured by the mean absolute difference of the MSE between adjacent frames [6], [7], [9]. Most prior work targeted at optimizing one of the two measures. If the rate-distortion (R-D) characteristics of all video frames are identical, the bit rate allocated to each frame will be equal, leading to identical perceptual quality between frames and the above two measures can be simultaneously optimized [1]. In reality, however, a video has varying R-D characteristics, making it difficult to optimize the average quality and the quality fluctuation at the same time /$ IEEE

2 SU AND WU: EFFICIENT BANDWIDTH RESOURCE ALLOCATION FOR LOW-DELAY MULTIUSER VIDEO STREAMING 1125 Our work aims at reaching an excellent tradeoff between these two quality criteria through a real-time low-delay algorithm. In addition to perceptual quality criteria, resource allocation strategies are also closely related to a system s adjustability on resources. For video transmission, the adjustability concerns how a video system can change video encoding/transmission rate to achieve desired visual quality or vice versa. A highly scalable video codec is desirable since it provides flexibility and convenience in reaching the desired visual quality and/or the desired bit rate. Recently, the fine granularity scalability (FGS) coding [10] [12] and fine granular scalability temporal (FGST) coding [13] have been added into the MPEG-4 video coding standard. The encoder generates a base layer at a low bit rate using a large quantization step and computes the residues between the original frame and the base layer. The bit planes of discrete cosine transform (DCT) transformed coefficients of these residues are then encoded sequentially as an enhancement layer, which is the FGS layer. The decoder can decode any truncated segment of the bit stream corresponding to each frame. The more bits the decoder receives and decodes, the higher the perceptual quality of video we observe. We adopt FGS codec in this work to allow convenient adjustment of rate and distortion. While a number of works have been devoted to bandwidth allocation for video, only a small amount of them address the multiuser problem. As we will review in Section II, the existing approaches do not always provide a good tradeoff on the two types of visual quality concerns (low average distortion and low fluctuation), or do not always keep delay low and scale well to accommodate a large number of users. In this paper, we propose an efficient algorithm for dynamic bandwidth resource allocation that addresses the above issues and provides superior performance over the prior art. Our work starts with a simple scenario where there is only a single user in the system. We employ two weight factors for adjusting the tradeoff between the overall distortion and the quality fluctuation, and derive the optimal choice of these factors for achieving an excellent tradeoff. We then extend the strategy to the multiuser case and propose a multiuser real-time resource allocation algorithm with low delay and low computational complexity. By exploring the variation in the scene complexity of each video program and jointly redistributing the system resources among users, our proposed algorithm provides low fluctuation of quality for each user, and depending on the applications needs, it can provide consistent or differentiated quality among all users. Our experimental results on 15 sequences of a total of nearly 6000 frames show that compared to conventional look-ahead sliding-window approaches, the proposed algorithm can achieve comparable perceptual quality and channel utilization at a much lower cost of delay, computation, and storage, and therefore is suitable for a variety of multiuser broad-band applications. The paper is organized as follows. We first review the prior work in Section II and provide preliminaries on the R-D model for FGS video in Section III. Section IV discusses the simple case of single-user bandwidth resource allocation and proposes an efficient real-time algorithm. We then extend the strategy to the multiuser scenario and present a new algorithm in Section V. Experimental results are shown in Section VI and conclusions drawn in Section VII. II. PRIOR WORK Rate control for single user can be considered as a special case of multiuser resource allocation. In general, a video encoded in variable bit rate (VBR) bitstream gives better perceptual quality than in constant bit rate (CBR) bitstream due to the variation of the scene complexity [14]. Most applications deliver a VBR video bitstream through a channel with a fixed amount of bandwidth (known as a CBR channel) because of its predictable traffic pattern as well as simple network management. However, VBR transmission has been shown to provide better source quality and network utilization [15]. To smoothen the traffic and alleviate the jitter caused by VBR coding and transmission, the system allocates buffers on both the transmitter side and the receiver side. The dynamics of the buffer is subject to two constraints to maintain the QoS. When the buffer overflows, we will start to lose data, which degrades the received visual quality; and when the decoder buffer underflows, the decoder has no data to keep up the decoding, which causes jitters. Therefore, a rate control algorithm must be applied to prevent the buffers from overflowing and underflowing [16]. For systems employing MPEG-1/2, H261, or H.263, the encoding rate is often changed by adjusting the quantization step size [16], [17]. To achieve high overall perceptual quality in the single-user scenario, rate control was formulated as an optimization problem in [5], [18] [20]. These approaches are suitable for off-line applications where the entire video content is known to the transmitter. The computation cost for handling a long video sequence is high due to the nature of integer and dynamic programming. To facilitate solving the rate control problems, several R-D models of existing video codecs have been exploited in the literature. An R-D based approach was proposed in [21] under the assumption that the DCT coefficients of a motion-compensated residue frame are uncorrelated and Laplacian distributed. A R-D model using intra-frame approximation and inter-frame dependency within one GOP was proposed in [6] to meet the perceptual requirement. A quadratic R-D model and rate control for MPEG-4 was studied in [22] and [23]. A rate control algorithm employing a linear correlation model was proposed in [24], whereby the correlation between the rate and the percentage of zeros among the quantized transform coefficients was explored. To simplify the selection of encoding and channel rates, wavelet-based embedded codecs were considered in the rate control problems of [7] and [25]. Sliding window is a general approach that can be used to keep track and allocate system resources. The work in [26] took advantage of the fine granularity of the MPEG-4 FGS codec and proposed a variable-size sliding window scheme to control how much FGS layer data is sent under different channel conditions. An R-D based rate control scheme for prestored video was studied in [9] using a three-level bit allocation for the base layer and employing a sliding window for the FGS layer rate control. An online algorithm using a look-ahead sliding window to achieve constant perceptual quality was proposed in [1]. To apply this scheme for transmitting real-time encoded video, we need to allocate extra storage to store several frames ahead, and perform bit allocation for the current frame by solving such an optimization problem that all frames within a

3 1126 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 15, NO. 9, SEPTEMBER 2005 look-ahead window have consistent and the highest possible perceptual quality subject to a given rate budget. Our studies show that to obtain a low fluctuation of quality, the window size should be no smaller than the size of half to one GOP, which leads to a nontrivial amount of delay that is often too long for real-time interactive applications. We will investigate in this paper how to overcome the problems of long delay and extra storage associated with the sliding window approach. Several works on joint rate control for multiple video programs employed MPEG-1/2 codecs [2] [4]. And the extension of the sliding window approach to multiple MPEG-4 FGS video programs was proposed in [1], employing a two-dimensional (2-D) window to address the multiuser problem. However, the computational complexity and extra storage for the look-ahead frames of the sliding window approach go up with the increase of the window size and the number of users. As the number of users increases in the system, the required computational resources to achieve a low fluctuation of quality become formidable. Our work in the current paper will overcome the problem of high computational complexity of the 2-D sliding window approach and improve the system scalability to accommodate many users. III. FGS RATE-DISTORTION MODEL AND SIMILARITY Existing rate control schemes for a single-layer video stream often employ an intra-frame R-D model. Laplacian and Gaussian distribution are typical approximations of DCT coefficients, leading to the frequent use of an exponential or a polynomial R-D model [1], [21]. In contrast to single-layer codecs, FGS codec is a two-layer embedded scheme with an enhancement layer encoded bit plane by bit plane. There is a need to model the statistical distribution of DCT bit planes and their R-D characteristics. Furthermore, due to the nature of the temporal redundancy in video, the predicatively encoded frames within one scene have highly similar R-D characteristics. In this section, we present R-D models for intra-frame and inter-frame of a FGS layer, which will be used in our work. A. Intra-frame Rate-Distortion Model As reviewed earlier, the MPEG-4 FGS standard employs bit-plane coding of the DCT residue between the original frame and the base layer. For a given bit plane in a frame, if the video is spatially stationary so that the length of the entropy encoded FGS symbols in all blocks is similar to each other, the decoded bit rate and the corresponding amount of reduced distortion will have an approximately linear relationship over the bit rate range of this bit plane. Previous studies in [1] and [9] and our experiments show that a piecewise linear line is a good approximation to the R-D curve of FGS video in the frame level. This piecewise linear line model can be described as for and (1) Here, represents the MSE between the th original frame and the decoded frame with rate, the distortion of Fig. 1. Inter-frame similarity of FGS R-D characteristics. The results for the odd and even scenes are presented in alternating colors. the th frame measured in mean square error after completely decoding the first DCT bit planes, the corresponding bit rate, and the total number of bit planes. We use and to represent the distortion and rate of the base layer, respectively. Since DCT is a unitary transform, measuring the mean square error between an original frame and its partially decoded version from FGS encoded stream is equivalent to calculating the average energy of the undecoded DCT bit planes in the FGS data stream, along with the residue between the original frame and the complete FGS data. Thus, all s and s can be obtained during the encoding process. B. Similarity in Interframe R-D Characteristics Another important characteristic of FGS video is that the R-D curves of FGS layer between two consecutive frames are similar when they are within the same scene. The rationale is as follows: for a video segment within a scene, the energy of the motion compensation residues between two adjacent frames are comparable. As the base layer is generated using a set of large quantization steps, it leaves most motion residues to be coded by the FGS layer. Therefore, after FGS encoding, the overall R-D characteristics between two adjacent frames are similar. We quantify the similarity of the R-D characteristics between frame and using where is a bit rate sampling interval,, and is the maximal available amount of FGS data for the th frame. A low value of implies high similarity in the R-D characteristics of the th and th frames. Fig. 1 shows the for a long video sequence consisting of 15 different standard QCIF clips. As we can see, the R-D models within each clip show a strong similarity. The value becomes large and suggests low similarity when transiting from one clip to another. (2)

4 SU AND WU: EFFICIENT BANDWIDTH RESOURCE ALLOCATION FOR LOW-DELAY MULTIUSER VIDEO STREAMING 1127 TABLE I SUMMARY OF NOTATIONS Fig. 2. Block diagram of a single-user video streaming system. IV. LOW-DELAY BANDWIDTH RESOURCE ALLOCATION FOR SINGLE USER To facilitate the investigation of the resource allocation problem in a multiuser system, we first study in this section a special case that concerns only a single user in the system. We begin with a discussion on the mechanism of a single-user FGS streaming video system and the corresponding constraints. We formulate this system as a resource allocation problem with two perceptual objectives subject to the system constraints. An online bandwidth resource allocation algorithm with low delay and low fluctuation of quality is then proposed to achieve a tradeoff point between these two perceptual criteria. A. System Constraints Illustrated in Fig. 2 is a typical streaming video system. There are two subcomponents in the encoder. One is the base layer encoder and the other is the FGS layer encoder. We discretize the time line by dividing one second into time slots, where is the video frame rate. For the simplicity in system design and providing a primitive quality with low fluctuation, we set a large fixed quantization step for all frames in the base layer codec and only perform the rate control for the FGS layer. We denote the base layer rate as, i.e., a total of bits must be sent at the th time slot to ensure the baseline quality. The FGS encoder encodes the bit planes of the residue. Both encoders analyze the R-D characteristics of the incoming video frame and pass the necessary information, such as the R-D pairs, to the rate control module. After the rate control module determines the amount of FGS data to be transmitted, the encoded base layer and the truncated FGS layer bitstream are moved to the encoder buffer, where we denote the FGS data rate at the th time slot as. The channel then delivers video bitstream from the encoder buffer to the decoder buffer. Here we assume that the channel has a maximum rate for reliable transmission,, although it is not necessarily in its full load all the time. The amount of channel transmission rate at the th time slot, denoted as, is also determined by the rate control module. For simplicity, we assume that the transmission delay of every packet is fixed at time slots [5], [19]: if a packet is sent from the encoder buffer at the th time slot, it will arrive at the decoder buffer at the th time slot. The decoder fetches data from the decoder buffer, decodes it, and displays each decompressed video frame at its desired instant. Therefore, the major task of the rate control module is to determine and. To ease the discussion, we summarize the notations in Table I. There are three constraints imposed in this system, as studied in the literature [15], [16]. The first constraint is to prevent the encoder buffer of a limited size from overflow. At the th time slot, a data segment of size is taken from an encoder buffer and sent through the channel, and then a newly encoded frame with size is added to the encoder. The dynamics of the encoder buffer can thus be expressed as where is nonnegative and describes the occupancy of encoder buffer, and the maximal size of encoder buffer. In addition, the FGS rate should be nonnegative. For a given, we can rearrange inequality (3) as a constraint for The second constraint is on the channel transmission rate,. It is nonnegative and cannot exceed the maximal channel capacity,. That is The third constraint is on the occupancy of the decoder buffer, which should neither overflow nor underflow. We assume that the decoder fetches all the data that belongs to the next frame from the decoder buffer and decodes it within one time slot. In addition, we assume playback buffering of frames, i.e., the first frames are received and stored in the decoder buffer before the playback is started. The total end-to-end delay from the encoder buffer through the channel and decoder buffer to the decoder is thus frames delay. The decision on how much data is sent into the channel at the th time slot will directly affect the decoder buffer occupancy at the th time slot. To meet the constraint imposed on the decoder buffer occupancy at the th time slot, there is a corresponding limit on how much data can be sent through the channel at the th time slot. Denote the decoder buffer occupancy as, and the maximal (3) (4) (5)

5 1128 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 15, NO. 9, SEPTEMBER 2005 size of decoder buffer as the decoder buffer at the. We summarize the constraint on th time slot as Combining (5) and (6), we arrive at the following constraint for the channel transmission rate In summary, inequalities (4) and (7) are the fundamental constraints for a single-user FGS video streaming system. B. Criteria for Visual Quality We adopt two visual quality criteria for video sequences to measure the average distortion and the quality fluctuation. More specifically, the average received quality is measured by the average mean square error (avemse) of all frames in a video sequence where represents the MSE between the th original frame and the decoded frame with rate. To account for the fluctuation of quality between consecutive frames, a large value of which can be objectionable to viewers, we use the mean absolute difference of consecutive frames mean square error (madmse) to measure the perceptual fluctuation The higher the madmse is, the larger the perceptual fluctuation is. We also define the corresponding PSNR version of these two criteria and denote as avepsnr and madpsnr, respectively. (6) (7) (8) (9) C. Problem Formulation Our objective is to design a rate control strategy to achieve both low avemse (high avepsnr) and low madmse (low madpsnr) subject to the constraints of (4) and (7). For offline applications where the entire video content is readily available before the transmission, all R-D information is known and we can formulate this system as (10), shown at the bottom of the page. In this formulation, is a function reflecting the importance and relevance of the average distortion and the quality fluctuation in the human perceptual system. For example, a linear combination function of avemse and madmse is a simple choice of. An optimal solution can be found for the above offline problem using standard nonlinear programming with penalty functions. The complexity for searching for optimal solution would, however, be formidable except for short video clips. In addition, the offline solution is not applicable to online applications where the video content is not entirely available beforehand. If the variations of the R-D characteristics of video sources can be well captured by a finite-state Markovian chain, we can model this system using stochastic three-machine flowshop with finite buffers [27] and obtain an optimal rate control policy using dynamic programming techniques. However, it has been shown that a compressed video sequence trace has long-range dependence [28], which is different from the short-range dependence such as a Markovian process and cannot be handled well using existing solutions. Thus, in this paper, we focus on a sequential resource allocation solution that has a moderate amount of computational complexity and can accommodate online video applications. The strategy of choosing the effective encoding rate for the FGS layer and the channel transmission rate closely depends on the relative weights of the average distortion and the perceptual fluctuation in the objective function. To achieve low avemse alone, one may employ a greedy strategy to make the encoder buffer as full as possible all the time and make full use of the available channel bandwidth. This may lead to the desire to select at the upper bound in (7), namely (11) and to set the FGS rate at the upper bound in (4), which is denoted as and defined as (12) When the encoder buffer is always full, the amount of incoming data cannot exceed the maximal amount of data allowed to be sent through the channel at each time slot. This is equivalent to assigning the same bandwidth resource for transmitting each frame. When encountering intra-coded frames (or I-frames), which have a larger amount of data at the base layer than predictively coded frames, we will have very limited budget left for sending their associated FGS enhancement layers. The MSE of I-frames will thus be larger than the MSE of the other types of frames. This leads to a potential increase in madmse. On the other hand, low madmse may be achieved by assigning each frame a rate that corresponds to the same distortion,. To do so, we extract the R-D pairs from the FGS subject to (10)

6 SU AND WU: EFFICIENT BANDWIDTH RESOURCE ALLOCATION FOR LOW-DELAY MULTIUSER VIDEO STREAMING 1129 encoder, approximate the R-D curve for each frame, and assign the rate for the FGS enhancement layer as. To prevent encoder buffer from overflowing when encountering I-frames or a new complex scene, we would have to allocate a small amount of data rate for the FGS layers of these I-frames. To keep the lowest madmse, other frames will also have a small amount of FGS-layer data. As a result, this second approach would not give a low avemse. Next, we present a new resource allocation algorithm that can achieve an improved tradeoff between the average distortion and the quality fluctuation. D. Proposed Resource Allocation Algorithm We introduce two weight factors in our proposed resource allocation algorithm to solve the above-mentioned problems. To overcome the quality fluctuation problem in the lowest-avemse scheme, we propose to use a fraction of the maximally allowed FGS data rate (determined by the buffer constraints) as the effective FGS encoding rate, i.e.,, where is a budget factor. Compared to adopting the full budget, the fractional budget can keep the encoder buffer occupancy low to accommodate future I-frames and other complex frames. As such, the rate budget available to the incoming I-frames will be close to the maximal encoder buffer size plus the full channel bandwidth, allowing for more FGS data of the I-frames to be sent to avoid a high increase in the madmse. To overcome the problem of low overall perceptual quality as in the lowest-madmse scheme, we relax the requirement of zero madmse fluctuation by taking partial consideration of both the rate that maintains zero madmse and the current occupancy of the encoder buffer. We quantify this strategy using a weight factor and allocate the FGS rate for the th frame as (13) where is the amount of FGS data needed to achieve the same perceptual quality as the previous frame and can be determined by (14) As we can see, the allocated FGS rate is determined using the two factors and. The lowest avemse scheme and madmse scheme are two special cases of this new strategy: when and, (13) becomes the lowest-avemse scheme; and when, (13) becomes the lowest-madmse scheme. We now examine how to select appropriate and to achieve a good tradeoff between low avemse and low madmse. 1) Selection of : We first fix and study the impact of on avepsnr and madpsnr when consecutive video frames have similar R-D characteristics. In this situation and with a fixed, when becomes larger, both the madpsnr and the avepsnr will increase. However, after passes a specific value,, the improvement of avepsnr is dramatically reduced while the quality fluctuation becomes more significant. This phenomenon is demonstrated in Fig. 3(a), where we use the first 200 Fig. 3. Impact of and w on visual quality for the first 400 frames of the grandmother sequence. (a) avepsnr and madpsnr for different. (b) avepsnr and madpsnr for different w. frames from the QCIF video clip of the grandmother as an example and set the factor at Given such trends of avepsnr and madpsnr for different, the value, indicated by a vertical line in Fig. 3(a), provides a good tradeoff between avepsnr and madpsnr. As shown in Appendix I, can be expressed as (15) where represents the average rate of the base layer, which can be approximated using a moving average of the bit rate statistics of the past frames. We can see that is an equilibratory operating point to keep the encoder buffer near empty and the channel utilization near full. We should notice that in reality, the consecutive video frames do not have exactly the same R-D characteristics. So if is set to be exactly, the system is on the verge between stable and unstable operation: the encoder buffer is nearly empty, and as the video content fluctuates, the buffer may underflow. Thus, to ensure a high utilization of channel bandwidth and high avepsnr, we should select a that is slightly above such that, where is a small positive constant. 2) Selection of : In general, a system with a high value of has low fluctuation of visual quality. When consecutive frames within a video segment have similar R-D characteristics, increasing affects only the madpsnr while the avepsnr has

7 1130 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 15, NO. 9, SEPTEMBER 2005 TABLE II PROPOSED SINGLE-USER RESOURCE ALLOCATION ALGORITHM (S-LDLF) Fig. 4. Selection of w according to the encoder buffer occupancy. little decrease until is close to one. To achieve low fluctuation of quality, high value of is preferred. This trend is illustrated in Fig. 3(b), where we again use the above-mentioned grandmother video clip as an example and set to a fixed value of 0.5. When two adjacent frames exhibit significant difference in R-D characteristics such as when arriving at scene boundary, we need to make adjustment to this system to handle the following frames. We consider two cases here. The first case is that the video sequence enters a new segment with more complex R-D characteristics than that of the previous segment, whereby the FGS rate required to maintain the same PSNR level as before is higher than the FGS rate for the previous sequence. To balance between the need of preventing the encoder buffer from overflowing and controlling the fluctuation of perceptual quality, we dynamically adjust the weight factor with respect to the encoder buffer occupancy. When the encoder buffer occupancy is lower than a threshold, we set at a high value to keep the distortion similar to that of the previous frame. When the encoder buffer occupancy is higher than threshold, we try to drain out the data from the buffer quickly by choosing as a concave and decreasing function of the buffer occupancy as shown in Fig. 4, so that the higher the buffer occupancy is, the lower is. As an example, the overall selection of can be chosen as (16) where is the step function, and and are positive constants. The second case is that the video sequence enters a new segment with simpler R-D characteristics than that of the past segment, whereby the FGS rate required to maintain the same PSNR level as before is lower than the FGS rate for the previous sequence. To balance between fully utilizing the available channel bandwidth resource and maintaining low fluctuation of quality, after detecting a change in R-D characteristics, we immediately adjust to a low value to utilize more available bandwidth and maintain this value for the following frames. As the scene transition is complete and the channel bandwidth becomes highly utilized again, we can adjust back to a high value to maintain constant quality. In summary, we adjust dynamically according to the encoder buffer occupancy and the detection of significant change in R-D characteristics. The changes in R-D characteristics can be identified by calculating the relative rate change between and, i.e., we check whether is greater than a threshold. The parameter will be chosen to be right above as in (15) and the channel transmission rate according to (11). We present the detailed algorithm in Table II. V. LOW-DELAY BANDWIDTH RESOURCE ALLOCATION FOR MULTIPLE USERS In this section, we extend the proposed bandwidth resource allocation algorithm from handling single user to multiple users. A simple way to deal with multiple users/sequences is to allocate a fixed amount of resource, including various buffers and channel bandwidth, to each user, and apply our proposed single-user approach to each individual user. We shall call this strategy multiple single-user approach. A more sophisticated approach allows for dynamically allocating resource among users and has the potential to improve the utilization of critical resources. Multiple users share the total channel bandwidth and buffer capacity, and a central resource allocation system dynamically distribute these system resources to handle the transmission of the video sequences from all users. We shall call this class of strategies dynamic multiuser approaches. We will focus on the dynamic multiuser approach and aim at achieving high average visual quality and low fluctuation of quality for each user. We will examine the scenarios of uniform quality of service among all users versus differentiated service. The performance of the dynamic multiuser strategy will be compared with the multiple single-user strategy through simulations in Section VI.

8 SU AND WU: EFFICIENT BANDWIDTH RESOURCE ALLOCATION FOR LOW-DELAY MULTIUSER VIDEO STREAMING 1131 and (19) Since all users share the overall bandwidth, both the individual and the aggregate channel transmission rate should be nonnegative and not exceed the maximal capacity. These channel transmission rate constraints can be described as (20) Fig. 5. Block diagram of a multiuser video streaming system. A. System Constraints An -user system is depicted in Fig. 5. At the server side, each user has his/her own video encoder to encode a different video program in real time. For the th user, the corresponding encoder sends the measured parameters of the R-D model of the current th frame to the resource allocation module. The parameters are in the form of for the first th bitplane. Using the R-D model, the resource allocation module determines the amount of FGS data to be transmitted. The encoder of each user then moves both the base layer data at the rate of and the FGS layer bitstream truncated at the allocated rate to the shared server buffer whose maximal capacity is. Denote the occupancy of the shared server buffer at the time slot as and the amount of data left by the th user in the server buffer as. We can treat as a virtual encoder buffer for the th user, and the sum of all virtual encoder buffers occupancy equals to. All users also share a channel whose maximal outbound capacity is. The resource allocation module needs to determine the channel transmission rate allocated for each user s data at the time slot, which we denote as for the th user. Upon receiving the data packets of the video program intended for him/her, each end-user first stores them temporarily in the decoder buffer, then decodes and renders each frame on time. In summary, similar to the single user case, the duty of the multiuser resource allocation module is to determine and jointly for all users. In parallel to the single-user case, there are three sets of system constraints for multiuser resource allocation. The first set of constraints is on the server buffer, which should not overflow. In particular, the sum of all virtual encoder buffers should not exceed the capacity of the server buffer. The dynamics of the buffer occupancy can be extended from the single-user case. The constraints can be described as (17) The constraints of the FGS layer rates for each user can be extended from the single-user problem and described as (18) (21) The constraints of the decoder buffer are the same as the single-user case (22) where is the maximal size of the th decoder buffer, the channel transmission delay for user, and the prestored frame delay in the decoder buffer for user. Rearranging and combining (20) and (22), we obtain a simplified constraint for the individual channel transmission rate where (23) Inequalities (18), (19), (21), and (23) are fundamental constraints in a multiuser system. Under these constraints, we determine the rate of the FGS data and the channel transmission rate for each user in the system to achieve low fluctuation of perceptual quality of each program as well as the desired uniform or differentiated perceptual quality among all programs. B. Proposed Resource Allocation Algorithm Our proposed multiuser resource allocation algorithm first allocates the channel transmission rate for each user subject to (21) and (23). With a selected channel transmission rate, we extend rate control strategy that we have proposed for the single-user case to the multiuser case to determine the feasible range for FGS layer data of each user according to (18) and (19). Specifically, we use two weight factors and to achieve a tradeoff between average perceptual quality and quality fluctuations. 1) Selection of Channel Transmission Rate: As all users share the overall channel bandwidth in multiuser system, we need to dynamically adjust the transmission rate allocated for each user. Our strategy consists of two steps: first, we assign each user a lower bound of channel transmission rate to prevent all decoder buffers from underflowing. Second, to help drain out the virtual encoder buffers, we distribute the rest of the available bandwidth to each user proportional to his/her previous encoding rate. Thus, when a video program encounters an I-frame and leaves a large amount of data in its virtual encoder buffer at the previous

9 1132 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 15, NO. 9, SEPTEMBER 2005 time slot, our strategy will assign the corresponding user a high channel transmission rate to drain his/her virtual encoder buffer at the current time slot. 2) Selection of FGS Rate: As in the single-user strategy proposed in Section IV-D, to balance between low fluctuation of quality and high average quality, we introduce two weight factors to our multiuser algorithm, namely, and. We first take an aggregated view on how much total bit rate are spent in the base layer for all users, and on what the upper bound on total FGSrate is at the th time slot according to (19). This is as if the aggregated rates are applied to a single super-user. The factor is applied to to obtain a fractional FGS rate budget that helps overcome the quality fluctuation. Next, we distribute to each user. For applications that desire uniform quality among users, the fractional rate budget for each user is determined through the following optimization formulation TABLE III PROPOSED MULTIUSER RESOURCE ALLOCATION ALGORITHM (M-LDLF) subject to (24) Since the R-D functions are monotonically decreasing, this optimization problem with equality constraints can be easily solved using bisection search. The search algorithm calculates the total required rates to achieve a target distortion, and then increases the target distortion at the next iteration if the total required rates is higher than the rate constraint and vice versa. Finally, we determine the allocated FGS rate for each user using a similar linear combination as in (13) (25) where represents the FGS rate for the th user in the th frame (time slot) in order to maintain the same quality as the previous frame, and is the upper bound in (18). C. Differentiated Service (DS) Differentiated service (DS) refers to a service in which each user receives different quality according to his/her service agreement with the server. We consider a scenario that at the beginning of the service, each user submits his/her priority request, quantified by, such that the average distortion received by each user normalized by is constant: (26) In other words, a user who specifies a smaller value of (and possibly pays a premium fee in return) will receive a higher overall perceptual quality. This can be achieved by modifying the optimization problem in (24) as follows: subject to (27) The uniform quality problem of (24) is a special case of (27) when all s are the same. This generalized optimization problem can also be solved using bisection search. We present the complete multiuser algorithm in Table III. VI. EXPERIMENTAL RESULTS In this section, we examine the performance of the proposed low-delay resource allocation algorithm with low-fluctuation of quality (LDLF), and compare it with two alternatives. The first alternative is the constant-bitrate (CBR) approach, which assigns a constant bit rate to each frame. The second alternative is a look-ahead sliding-window algorithm (SWLF) with buffer constraints adapted from [1], the details of which are given in Appendix II. Three statistics are used to evaluate the proposed algorithm and the two alternatives: the average PSNR (avepsnr), the mean of absolute difference of PSNR (madpsnr), and the overall channel utilization (ChUtiliz). A. Experiment Setup We concatenate 15 QCIF ( ) video sequences to form one testing video sequence of 5760 frames. The 15 sequences are 300-frame Akiyo, 360-frame carphone, 480-frame Claire, 300-frame coastguard, 300-frame container, 390-frame foreman, 870-frame grandmother, 330-frame hall objects,

10 SU AND WU: EFFICIENT BANDWIDTH RESOURCE ALLOCATION FOR LOW-DELAY MULTIUSER VIDEO STREAMING frame Miss American, 960-frame mother and daughter, 300-frame MPEG4 news, 420-frame salesman, 300-frame silent, 150-frame Suzie, and 150-frame Trevor. The base layer is generated by MPEG-4 encoder with a fixed quantization step of 30 and the GOP pattern is 29 P-frames after one I-frame. All frames of FGS layer have up to six bit planes. For users in this system, we allocate K bits for the server buffer and the shared maximal channel capacity is kb/s. Each user has a small decoder buffer of 400 kb. For each user, the transmission delay,, is 3 frames and initial playback delay,, is 3 frames. The parameters (,,,,, ) used in the LDLF algorithm are set to. B. Experimental Results for Single-User Rate Control For the single-user system, the video content is picked from frame 301 to 2100, corresponding to the video sequences of carphone, Claire, coastguard, container, and foreman. Fig. 6 shows the avepsnr, madpsnr, and ChUtiliz using the three different algorithms. The solid line with triangle makers represents the results of single-user SWLF algorithm (S-SWLF) with different window sizes. The solid line represents the results of CBR approach, which encodes each frame at a constant bit rate of 32 kb. The dotted and dashed lines represent the results of the proposed single-user LDLF algorithm (S-LDLF) with and (0.98,0.3), respectively. As the CBR and S-LDLF algorithms do not have the window parameter, we plot their results as horizontal lines to allow for the performance comparison among CBR, S-LDLF, and S-SWLF of different window sizes. As we can see from Fig. 6, CBR approach provides the highest average perceptual quality and channel utilization. However, CBR has the worst fluctuation of visual quality. In contrast, our experiment shows that variable rate control for video, such as S-LDLF and S-SWLF, can provide more consistent quality. For the proposed S-LDLF algorithm, a higher value of gives smaller madpsnr and a little lower avepsnr as expected. We also observe that madpsnr decreases when the window size increases in S-SWLF algorithm. To provide sufficient smoothening, the window size of the S-SWLF algorithm needs to be at least the size of half to one GOP to cover an I-frame of high data rate, which is frames in our experiment. We compare S-SWLF with window size 30 frames with S-LDLF with, and present the PSNR results for each frame in Fig. 7. We can see that S-SWLF and S-LDLF have similar performance in terms of visual quality. However, to achieve this comparable performance, S-SWLF requires about one-second more delay and a corresponding large amount of extra storage for the look-ahead frames on the encoder side, while the proposed S-LDLF algorithm requires no extra delay and storage on the encoder side. C. Experimental Results for Multiuser Resource Allocation For the multiuser system, the content program for each user is 1200 frames long and starts from a randomly selected I-frame of the testing video source. If the length of this video source Fig. 6. Comparison of CBR, S-SWLF, and the proposed S-LDLF rate control algorithms for video frame from the testing sequence. (a) avepsnr. (b) madpsnr. (c) Channel utilization. is not long enough, we loop from the beginning of the testing sequence. We repeat the simulation multiple times for a total of about user cases to obtain the averaged results for the systems with 4, 8, 16, 32, 64, and 128 users. We first demonstrate the performance when all users request the same level of visual quality. Fig. 8 shows the average of all users avepsnr, madpsnr, and ChUtiliz for different number of users using four algorithms, namely, the CBR algorithm (CBR), the multiuser SWLF algorithm (M-SWLF) with different window sizes, the multiple single-user approach using the above S-LDLF (M S-LDLF) with, and the proposed multiuser LDLF (M-LDLF) approach with. The CBR approach assigns each user a fixed encoding rate, 32 kb/frame. The M S-LDLF system provides individual encoder buffer (80 kb) and channel bandwidth (960 kb/s) to each user using S-LDLF algorithm. As we can see from Fig. 8, the CBR approach suffers from much higher fluctuation of visual quality than any other approaches, suggesting once again the need of variable rate control. Among

11 1134 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 15, NO. 9, SEPTEMBER 2005 Fig. 7. Frame-by-frame PSNR comparison of the CBR, S-SWLF, and the proposed S-LDLF approaches. the three approaches providing VBR video, the two joint resource allocation approaches for multiuser system, M-SWLF and M-LDLF, provide smaller fluctuation of visual quality than the individual resource allocation scheme (M S-LDLF). The more users a system has, the higher possibility we can take the advantage of the variations of video content similar to those in multiplexing [4] and offer desired quality to each user through dynamic bandwidth allocation. Comparing these two dynamic multiuser algorithms, the fluctuation of visual quality of M-LDLF algorithm is between the quality fluctuations of the M-SWLF algorithm with window sizes of 15 and 30 frames; and the performance of both visual quality measurements, avepsnr and madpsnr, of the M-LDLF algorithm approaches the results of M-SWLF with window size 30 frames when the number of users increases. To compare the frame-by-frame PSNR of the three algorithms, M S-LDLF, M-SWLF, and M-LDLF, we simulate the scenario in which the content program for user is 1200 frames long and starts from frame of the testing video source. Fig. 9 shows the frame-by-frame PSNR of the first and tenth users in the M S-LDLF, M-SWLF, and M-LDLF systems when there are 16 and 32 users in the systems, respectively. Again, we see that the dynamic multiuser approaches (M-LDLF and M-SWLF) can provide more uniform quality than the multiple single-user approach both within a scene and when crossing scene boundaries. When the number of users increases, the gain from joint resource allocation is more significant, providing more uniform quality and less quality fluctuation. Between the two dynamic multiuser approaches, our proposed M-LDLF approach can achieve similar perceptual quality to that of M-SWLF approach with large window size (30 frames); however, similar to the single-user case, the prior work M-SWLF needs a longer delay (1 s) and a substantially larger storage than the propose approach. This additional storage is for keeping the look-ahead data of all users. In the example illustrated above, M-SWLF needs an extra storage of 30 frames/user 32 users 960 frames. As a result, the proposed approach Fig. 8. Comparison of CBR, M-SWLF, M S-LDLF, and M-LDLF systems under uniform service for all users. Shown here are three statistics averaged among all users. (a) Average avepsnr among all users. (b) Average madpsnr among all users. (c) Average channel utilization among all users. has higher system scalability than the M-SWLF approach. This makes the proposed scheme an attractive choice for building a large system to accommodate many users. We now use Fig. 10 to demonstrate the differentiated service of a 32-user system by keeping the same system settings as in Fig. 9 except setting the differentiated service priority for the th user as follows: (28) Fig. 10(a) illustrates the average MSE for each user. As we can see, the proposed algorithm can achieve the required differentiated service priority, which is almost a linear line as we have designed for in (28). Fig. 10(b) and (c) highlight the received

12 SU AND WU: EFFICIENT BANDWIDTH RESOURCE ALLOCATION FOR LOW-DELAY MULTIUSER VIDEO STREAMING 1135 Fig. 9. Frame-by-frame PSNR results of the first and tenth user in the M S-LDLF, M-SWLF, and M-LDLF systems. (a) First user in a 16-user system. (b) Tenth user in a 16-user system. (c) First user in a 32-user system. (d) Tenth user in a 32-user system. Fig. 10. Results of a 32-user system using M-LDLF with differentiated service. (a) avemse for each user. (b) PSNR for the first user. (c) PSNR for the 32nd user. PSNR for the first and the last user in this system, who request the lowest and the highest video quality, respectively. VII. CONCLUSION In summary, we have proposed an efficient bandwidth resource allocation algorithm for streaming multiple MPEG-4 FGS video sequences. By exploring the intra- and inter-frame R-D characteristics of MPEG-4 FGS codec, we present a control policy to achieve an excellent tradeoff between the average quality and quality fluctuation criteria. We demonstrate that multiuser systems with dynamic joint resource allocation provide more consistent quality than the multiple single-user approaches that do not dynamically share resources. Evaluating the video quality in terms of the average distortion and the quality fluctuation, our algorithm gives excellent performance comparable to those by the general look-ahead sliding-window approach. But compared to the existing approaches, our algorithm has higher system scalability, as it does not need a delay of dozens of frames long and does not require extra storage proportional to the number of users. Therefore, the proposed multiuser resource allocation algorithm with low

13 1136 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 15, NO. 9, SEPTEMBER 2005 delay and low fluctuation of quality can serve as an efficient and effective building block for real-time multiuser broad-band communications. APPENDIX I IMPACT OF BUDGET FACTOR ON PERCEPTUAL QUALITY In this appendix, we present detailed rationale behind the bandwidth resource allocation algorithm proposed in Section IV-D. In particular, for a video scene consisting of similar R-D characteristics, we analyze the trend of the avemse and the madmse as changes, and derive the result of (15) for. Consider an -frame video clip with similar visual contents, whereby the R-D model of each frame within the clip are similar to each other. As such, it is reasonable to assume that the feasible range of FGS data rates for all frames are within the same bit plane, and. The R-D model can be expressed as follows for rates falling in the range of interest for all (29) where and are constants. We denote and as the average rate of base layer and FGS layer within this video sequence, respectively. Here is fixed due to the use of contant quantization step for the base layer of all frames. Let us first consider madmse, which can be represented as follows using (29) (30) where. We examine the absolute difference of FGS rates between two frames at time slot (31) With a fixed, a larger would usually result in larger hence larger madmse by (30). Next, we consider avemse. With the R-D model in (29), we can express avemse as follows: (32) Summing up from to using (12) and (13) and taking the average, we obtain (33) where,, represent the average of their corresponding rates or buffer occupancy. Bringing, the condition of encoder buffer occupancy, and the conservation law of data flow that the overall input flow should be equal to the overall output flow we arrive at (34) (35) Taking (35) into (32), we can represent avemse as a function of (36) Thus, if the system has a larger, the average distortion (avemse) will decrease. Note that does not affect avemse as long as. The above analysis shows that a larger reduces avemse but leads to larger madmse. To complete the derivation for,we observe from (34) that since is fixed, increasing is equivalent to increasing. But when approaches the channel capacity, and avemse cannot be further improved. Thus, there exists such that its corresponding from (35) is equal to, i.e., (37) Recalling the results in (30) and (36), the selection of can give an excellent tradeoff between avemse and madmse. Solving (37) for, we arrive at the tradeoff point (38) APPENDIX II REVIEW OF AN ALTERNATIVE SLIDING-WINDOW ALGORITHM The sliding-window algorithm is originally proposed in [1] and only concerns the channel capacity without considering the buffer and delay constraints. To determine the bit rate for a frame, the sliding window algorithm requires the complete bit rate and R-D information of frames ahead. The algorithm then distributes the FGS rates to the current frame by solving an optimization problem that all frames within the look-ahead window have the uniform and highest possible quality subject to a FGS rate budget for all frames in this sliding window, denoted. This FGS rate budget is obtained by subtracting all base layer rates within the window from an overall rate budget,. The rate budget is updated by removing the bandwidth used for the previous frame and adding the currently available channel transmission rate. That is (39) (40) For a fair comparison with our proposed algorithms, we modify the sliding window approach by adding the delay and encoder/decoder buffer constraints. We also make two further modifications to fit in the scenarios considered in this paper. The first modification is on. When the sliding window does not across the scene boundary, it is reasonable to assume that the frames within the sliding window have similar R-D characteristics. Under this assumption, the estimated total transmission rate for these frames in the sliding window is, where is the upper bound in (7). To keep the occupancy

14 SU AND WU: EFFICIENT BANDWIDTH RESOURCE ALLOCATION FOR LOW-DELAY MULTIUSER VIDEO STREAMING 1137 of the encoder buffer low, the data left in the encoder buffer at time slot is flushed out during the next time slots. Thus, the modified FGS rate budget for a sliding window is (41) Second, we observe that when a sliding window enters a segment with simple R-D characteristics from a past segment with complex R-D characteristics, the encoder buffer may overflow and force the system to drop more FGS layer data, which leads to several severe quality fluctuations near the R-D characteristics dissimilarity boundaries (which often coincide with scene changes). To overcome this problem and allow fair comparison with our proposed schemes, we detect the change in R-D characteristics once it appears in the sliding window and adaptively reduce the window size so that the sliding window does not across this dissimilarity boundary. After passing the boundary, the size of the sliding window is restored to the original size. REFERENCES [1] X. M. Zhang, A. Vetro, Y. Q. Shi, and H. Sun, Constant quality constrained rate allocation for FGS video coded videos, IEEE Trans. Circuits Syst. Video Technol., vol. 13, no. 2, pp , Feb [2] L. Wang and A. Vincent, Bit allocation and constraints for joint coding of multiple video programs, IEEE Trans. Circuits Syst. Video Technol., vol. 9, no. 6, pp , Sep [3] L. Boroczky, A. Y. Nagi, and E. F. Westermann, Joint rate control with look-ahead for multi-program video coding, IEEE Trans. Circuits Syst. Video Technol., vol. 10, no. 7, pp , Oct [4] M. Wu, R. Joyce, H.-S. Wong, L. Guan, and S.-Y. Kung, Dynamic resource allocation via video content and short-term traffic statistics, IEEE Trans. Multimedia, vol. 3, no. 2, pp , Jun [5] C.-Y. Hsu, A. Ortega, and A. R. Reibman, Joint selection of source and channel rate for VBR video transmission under ATM policing constraints, IEEE J. Sel. Areas Commun., vol. 15, pp , Aug [6] L.-J. Lin and A. Ortega, Bit-rate control using piecewise approximated rate-distortion characteristics, IEEE Trans. Circuits Syst. Video Technol., vol. 8, no. 4, pp , Aug [7] Y. Yang and S. S. Hemami, Rate control for VBR video over ATM: Simplification and implementation, IEEE Trans. Circuits Syst. Video Technol., vol. 11, no. 9, pp , Sep [8] J. Xu, Z. Xiong, S. Li, and Y.-Q. Zhang, Memory-constrained 3-D wavelet transform for video coding without boundary effects, IEEE Trans. Circuits Syst. Video Technol., vol. 12, no. 9, pp , Sep [9] L. Zhao, J. Kim, and C.-C. J. Kuo, MPEG-4 FGS video streaming with constant-quality rate control and differentiated forwarding, in Proc. SPIE Conf. Visual Communications and Image Processing, 2002, pp [10] ISO/IEC JTC1/SC29/WG11 MPEG-4 Video Verification Model version 18.0 N3908, Jan [11] W. Li, Overview of fine granularity scalability in MPEG-4 video standard, IEEE Trans. Circuits Syst. Video Technol., vol. 11, no. 3, pp , Mar [12] H. M. Radha, M. van der Schaar, and Y. Chen, The MPEG-4 finegrained scalable video coding method for multimedia streaming over IP, IEEE Trans. Multimedia, vol. 3, no. 1, pp , Mar [13] M. van der Schaar and H. M. Radha, A hybrid temporal-snr finegranular scalability for internet video, IEEE Trans. Circuits Syst. Video Technol., vol. 11, no. 3, pp , Mar [14] T. V. Lakshman, A. Ortega, and A. R. Reibman, VBR video: Tradeoffs and potentials, Proc. IEEE, vol. 86, no. 5, pp , May [15] M.-T. Sun and A. R. Reibman, Compressed Video Over Networks. New York: Marcel Dekker, [16] A. R. Reibman and B. G. Haskell, Constraints on variable bit-rate video for ATM networks, IEEE Trans. Circuits Syst. Video Technol., vol. 2, no. 4, pp , Dec [17] W. Ding, Joint encoder and channel rate control of VBR video over ATM networks, IEEE Trans. Circuits Syst. Video Technol., vol. 7, no. 2, pp , Apr [18] E. C. Reed and J. S. Lim, Optimal multidimensional bit-rate control for video communication, IEEE Trans. Image Process., vol. 11, no. 8, pp , Aug [19] C. E. Luna, L. P. Kondi, and A. K. Katsaggelos, Maximizing user utility in video streaming applications, IEEE Trans. Circuits Syst. Video Technol., vol. 13, no. 2, pp , Feb [20] X. S. Zhou and S.-P. Liou, Optimal nonlinear sampling for video streaming at low bit rates, IEEE Trans. Circuits Syst. Video Technol., vol. 12, no. 6, pp , Jun [21] J. Ribas-Corbera and S. Lei, Rate control in DCT video coding for low-delay communications, IEEE Trans. Circuits Syst. Video Technol., vol. 9, no. 1, pp , Feb [22] H.-J. Lee, T. Chiang, and Y.-Q. Zhang, Scalable rate control for MPEG-4 video, IEEE Trans. Circuits Syst. Video Technol., vol. 10, no. 6, pp , Sep [23] K. N. Ngan, T. Meier, and Z. Chen, Improved single-video-object rate control for MPEG-4, IEEE Trans. Circuits Syst. Video Technol., vol. 13, no. 5, pp , May [24] Z. He and S. K. Mitra, A linear source model and a unified rate control algorithm for DCT video coding, IEEE Trans. Circuits Syst. Video Technol., vol. 12, no. 11, pp , Nov [25] H. Radha, Y. Chen, K. Parthasarathy, and R. Cohen, Scalable internet video using MPEG-4, Signal Process.: Image Commun., vol. 15, pp , Sep [26] R. Cohen and H. Radha, Streaming fine-grained scalable video over packet-based networks, in IEEE Global Telecommunications Conf., vol. 1, 2000, pp [27] S. P. Sethi and Q. Zhang, Hierarchical Decision Making in Stochastic Manufacturing Systems. Cambridge, MA: Birkhäuser Boston, [28] J. Beran, R. Sherman, M. S. Taqqu, and W. Willinger, Long-range dependence in variable-bit-rate video traffic, IEEE Trans. Commun., vol. 43, no. 2, pp , Feb Guan-Ming Su (S 04) received the B.S.E. degree from National Taiwan University, Taipei, Taiwan, R.O.C., in 1996 and the M.S. degree from University of Maryland, College Park, in 2001, both in electrical engineering. He is currently working toward the Ph.D. degree in the Department of Electrical and Computer Engineering at the University of Maryland, College Park. He was with the Research and Development Department, Qualcomm, Inc, San Diego, CA, during the summer of His research interests are multimedia communications and multimedia signal processing. Min Wu (S 95 M 01) received the B.E. degree in electrical engineering and the B.A. degree in economics from Tsinghua University, Beijing, China, in 1996 (both with the highest honors), and the M.A. and Ph.D. degrees in electrical engineering from Princeton University, Princeton, NJ, in 1998 and 2001, respectively. She was with NEC Research Institute and Signafy, Inc. in 1998, and with Panasonic Information and Networking Laboratories in Since 2001, she has been an Assistant Professor of the Department of Electrical and Computer Engineering, the Institute of Advanced Computer Studies, and the Institute of Systems Research at the University of Maryland, College Park. Her research interests include information security, multimedia signal processing, and multimedia communications. She co-authored a book Multimedia Data Hiding (New York: Springer-Verlag, 2003) and holds four U.S. patents on multimedia security. Dr. Wu received a CAREER award from the U.S. National Science Foundation in 2002, and a George Corcoran Education Award from University of Maryland in 2003, a TR100 Young Investigator Award from the MIT Technology Review Magazine in 2004, a EURASIP Best Paper Award, and a U.S. Office of Naval Research Young Investigator Award in She is a Member of the IEEE Technical Committees on Multimedia Signal Processing and on Multimedia Systems and Applications, Publicity Chair of 2003 IEEE International Conference on Multimedia and Expo, and a guest editor of the Special Issue on Media Security and Rights Management for the EURASIP Journal on Applied Signal Processing.

Efficient Bandwidth Resource Allocation for Low-Delay Multiuser MPEG-4 Video Transmission

Efficient Bandwidth Resource Allocation for Low-Delay Multiuser MPEG-4 Video Transmission Efficient Bandwidth Resource Allocation for Low-Delay Multiuser MPEG-4 Video Transmission Guan-Ming Su and Min Wu Department of Electrical and Computer Engineering, University of Maryland, College Park,

More information

1022 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 19, NO. 4, APRIL 2010

1022 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 19, NO. 4, APRIL 2010 1022 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 19, NO. 4, APRIL 2010 Delay Constrained Multiplexing of Video Streams Using Dual-Frame Video Coding Mayank Tiwari, Student Member, IEEE, Theodore Groves,

More information

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video Mohamed Hassan, Taha Landolsi, Husameldin Mukhtar, and Tamer Shanableh College of Engineering American

More information

Constant Bit Rate for Video Streaming Over Packet Switching Networks

Constant Bit Rate for Video Streaming Over Packet Switching Networks International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Constant Bit Rate for Video Streaming Over Packet Switching Networks Mr. S. P.V Subba rao 1, Y. Renuka Devi 2 Associate professor

More information

Minimax Disappointment Video Broadcasting

Minimax Disappointment Video Broadcasting Minimax Disappointment Video Broadcasting DSP Seminar Spring 2001 Leiming R. Qian and Douglas L. Jones http://www.ifp.uiuc.edu/ lqian Seminar Outline 1. Motivation and Introduction 2. Background Knowledge

More information

Analysis of Video Transmission over Lossy Channels

Analysis of Video Transmission over Lossy Channels 1012 IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 18, NO. 6, JUNE 2000 Analysis of Video Transmission over Lossy Channels Klaus Stuhlmüller, Niko Färber, Member, IEEE, Michael Link, and Bernd

More information

An Efficient Low Bit-Rate Video-Coding Algorithm Focusing on Moving Regions

An Efficient Low Bit-Rate Video-Coding Algorithm Focusing on Moving Regions 1128 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 11, NO. 10, OCTOBER 2001 An Efficient Low Bit-Rate Video-Coding Algorithm Focusing on Moving Regions Kwok-Wai Wong, Kin-Man Lam,

More information

Behavior Forensics for Scalable Multiuser Collusion: Fairness Versus Effectiveness H. Vicky Zhao, Member, IEEE, and K. J. Ray Liu, Fellow, IEEE

Behavior Forensics for Scalable Multiuser Collusion: Fairness Versus Effectiveness H. Vicky Zhao, Member, IEEE, and K. J. Ray Liu, Fellow, IEEE IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. 1, NO. 3, SEPTEMBER 2006 311 Behavior Forensics for Scalable Multiuser Collusion: Fairness Versus Effectiveness H. Vicky Zhao, Member, IEEE,

More information

Chapter 10 Basic Video Compression Techniques

Chapter 10 Basic Video Compression Techniques Chapter 10 Basic Video Compression Techniques 10.1 Introduction to Video compression 10.2 Video Compression with Motion Compensation 10.3 Video compression standard H.261 10.4 Video compression standard

More information

DELTA MODULATION AND DPCM CODING OF COLOR SIGNALS

DELTA MODULATION AND DPCM CODING OF COLOR SIGNALS DELTA MODULATION AND DPCM CODING OF COLOR SIGNALS Item Type text; Proceedings Authors Habibi, A. Publisher International Foundation for Telemetering Journal International Telemetering Conference Proceedings

More information

Multimedia Communications. Image and Video compression

Multimedia Communications. Image and Video compression Multimedia Communications Image and Video compression JPEG2000 JPEG2000: is based on wavelet decomposition two types of wavelet filters one similar to what discussed in Chapter 14 and the other one generates

More information

WYNER-ZIV VIDEO CODING WITH LOW ENCODER COMPLEXITY

WYNER-ZIV VIDEO CODING WITH LOW ENCODER COMPLEXITY WYNER-ZIV VIDEO CODING WITH LOW ENCODER COMPLEXITY (Invited Paper) Anne Aaron and Bernd Girod Information Systems Laboratory Stanford University, Stanford, CA 94305 {amaaron,bgirod}@stanford.edu Abstract

More information

Multimedia Communications. Video compression

Multimedia Communications. Video compression Multimedia Communications Video compression Video compression Of all the different sources of data, video produces the largest amount of data There are some differences in our perception with regard to

More information

An Overview of Video Coding Algorithms

An Overview of Video Coding Algorithms An Overview of Video Coding Algorithms Prof. Ja-Ling Wu Department of Computer Science and Information Engineering National Taiwan University Video coding can be viewed as image compression with a temporal

More information

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur Module 8 VIDEO CODING STANDARDS Lesson 27 H.264 standard Lesson Objectives At the end of this lesson, the students should be able to: 1. State the broad objectives of the H.264 standard. 2. List the improved

More information

Intra-frame JPEG-2000 vs. Inter-frame Compression Comparison: The benefits and trade-offs for very high quality, high resolution sequences

Intra-frame JPEG-2000 vs. Inter-frame Compression Comparison: The benefits and trade-offs for very high quality, high resolution sequences Intra-frame JPEG-2000 vs. Inter-frame Compression Comparison: The benefits and trade-offs for very high quality, high resolution sequences Michael Smith and John Villasenor For the past several decades,

More information

Bit Rate Control for Video Transmission Over Wireless Networks

Bit Rate Control for Video Transmission Over Wireless Networks Indian Journal of Science and Technology, Vol 9(S), DOI: 0.75/ijst/06/v9iS/05, December 06 ISSN (Print) : 097-686 ISSN (Online) : 097-5 Bit Rate Control for Video Transmission Over Wireless Networks K.

More information

Video coding standards

Video coding standards Video coding standards Video signals represent sequences of images or frames which can be transmitted with a rate from 5 to 60 frames per second (fps), that provides the illusion of motion in the displayed

More information

Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes. Digital Signal and Image Processing Lab

Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes. Digital Signal and Image Processing Lab Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes Digital Signal and Image Processing Lab Simone Milani Ph.D. student simone.milani@dei.unipd.it, Summer School

More information

Bridging the Gap Between CBR and VBR for H264 Standard

Bridging the Gap Between CBR and VBR for H264 Standard Bridging the Gap Between CBR and VBR for H264 Standard Othon Kamariotis Abstract This paper provides a flexible way of controlling Variable-Bit-Rate (VBR) of compressed digital video, applicable to the

More information

Understanding Compression Technologies for HD and Megapixel Surveillance

Understanding Compression Technologies for HD and Megapixel Surveillance When the security industry began the transition from using VHS tapes to hard disks for video surveillance storage, the question of how to compress and store video became a top consideration for video surveillance

More information

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur Module 8 VIDEO CODING STANDARDS Lesson 24 MPEG-2 Standards Lesson Objectives At the end of this lesson, the students should be able to: 1. State the basic objectives of MPEG-2 standard. 2. Enlist the profiles

More information

CONSTRAINING delay is critical for real-time communication

CONSTRAINING delay is critical for real-time communication 1726 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 16, NO. 7, JULY 2007 Compression Efficiency and Delay Tradeoffs for Hierarchical B-Pictures and Pulsed-Quality Frames Athanasios Leontaris, Member, IEEE,

More information

MPEG has been established as an international standard

MPEG has been established as an international standard 1100 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 9, NO. 7, OCTOBER 1999 Fast Extraction of Spatially Reduced Image Sequences from MPEG-2 Compressed Video Junehwa Song, Member,

More information

Motion Video Compression

Motion Video Compression 7 Motion Video Compression 7.1 Motion video Motion video contains massive amounts of redundant information. This is because each image has redundant information and also because there are very few changes

More information

AUDIOVISUAL COMMUNICATION

AUDIOVISUAL COMMUNICATION AUDIOVISUAL COMMUNICATION Laboratory Session: Recommendation ITU-T H.261 Fernando Pereira The objective of this lab session about Recommendation ITU-T H.261 is to get the students familiar with many aspects

More information

The H.263+ Video Coding Standard: Complexity and Performance

The H.263+ Video Coding Standard: Complexity and Performance The H.263+ Video Coding Standard: Complexity and Performance Berna Erol (bernae@ee.ubc.ca), Michael Gallant (mikeg@ee.ubc.ca), Guy C t (guyc@ee.ubc.ca), and Faouzi Kossentini (faouzi@ee.ubc.ca) Department

More information

Performance Evaluation of Error Resilience Techniques in H.264/AVC Standard

Performance Evaluation of Error Resilience Techniques in H.264/AVC Standard Performance Evaluation of Error Resilience Techniques in H.264/AVC Standard Ram Narayan Dubey Masters in Communication Systems Dept of ECE, IIT-R, India Varun Gunnala Masters in Communication Systems Dept

More information

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /ISCAS.2005.

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /ISCAS.2005. Wang, D., Canagarajah, CN., & Bull, DR. (2005). S frame design for multiple description video coding. In IEEE International Symposium on Circuits and Systems (ISCAS) Kobe, Japan (Vol. 3, pp. 19 - ). Institute

More information

MPEGTool: An X Window Based MPEG Encoder and Statistics Tool 1

MPEGTool: An X Window Based MPEG Encoder and Statistics Tool 1 MPEGTool: An X Window Based MPEG Encoder and Statistics Tool 1 Toshiyuki Urabe Hassan Afzal Grace Ho Pramod Pancha Magda El Zarki Department of Electrical Engineering University of Pennsylvania Philadelphia,

More information

Dynamic bandwidth allocation scheme for multiple real-time VBR videos over ATM networks

Dynamic bandwidth allocation scheme for multiple real-time VBR videos over ATM networks Telecommunication Systems 15 (2000) 359 380 359 Dynamic bandwidth allocation scheme for multiple real-time VBR videos over ATM networks Chae Y. Lee a,heem.eun a and Seok J. Koh b a Department of Industrial

More information

Visual Communication at Limited Colour Display Capability

Visual Communication at Limited Colour Display Capability Visual Communication at Limited Colour Display Capability Yan Lu, Wen Gao and Feng Wu Abstract: A novel scheme for visual communication by means of mobile devices with limited colour display capability

More information

Contents. xv xxi xxiii xxiv. 1 Introduction 1 References 4

Contents. xv xxi xxiii xxiv. 1 Introduction 1 References 4 Contents List of figures List of tables Preface Acknowledgements xv xxi xxiii xxiv 1 Introduction 1 References 4 2 Digital video 5 2.1 Introduction 5 2.2 Analogue television 5 2.3 Interlace 7 2.4 Picture

More information

MULTI-STATE VIDEO CODING WITH SIDE INFORMATION. Sila Ekmekci Flierl, Thomas Sikora

MULTI-STATE VIDEO CODING WITH SIDE INFORMATION. Sila Ekmekci Flierl, Thomas Sikora MULTI-STATE VIDEO CODING WITH SIDE INFORMATION Sila Ekmekci Flierl, Thomas Sikora Technical University Berlin Institute for Telecommunications D-10587 Berlin / Germany ABSTRACT Multi-State Video Coding

More information

Implementation of an MPEG Codec on the Tilera TM 64 Processor

Implementation of an MPEG Codec on the Tilera TM 64 Processor 1 Implementation of an MPEG Codec on the Tilera TM 64 Processor Whitney Flohr Supervisor: Mark Franklin, Ed Richter Department of Electrical and Systems Engineering Washington University in St. Louis Fall

More information

Analysis of Packet Loss for Compressed Video: Does Burst-Length Matter?

Analysis of Packet Loss for Compressed Video: Does Burst-Length Matter? Analysis of Packet Loss for Compressed Video: Does Burst-Length Matter? Yi J. Liang 1, John G. Apostolopoulos, Bernd Girod 1 Mobile and Media Systems Laboratory HP Laboratories Palo Alto HPL-22-331 November

More information

Dual frame motion compensation for a rate switching network

Dual frame motion compensation for a rate switching network Dual frame motion compensation for a rate switching network Vijay Chellappa, Pamela C. Cosman and Geoffrey M. Voelker Dept. of Electrical and Computer Engineering, Dept. of Computer Science and Engineering

More information

Feasibility Study of Stochastic Streaming with 4K UHD Video Traces

Feasibility Study of Stochastic Streaming with 4K UHD Video Traces Feasibility Study of Stochastic Streaming with 4K UHD Video Traces Joongheon Kim and Eun-Seok Ryu Platform Engineering Group, Intel Corporation, Santa Clara, California, USA Department of Computer Engineering,

More information

Free Viewpoint Switching in Multi-view Video Streaming Using. Wyner-Ziv Video Coding

Free Viewpoint Switching in Multi-view Video Streaming Using. Wyner-Ziv Video Coding Free Viewpoint Switching in Multi-view Video Streaming Using Wyner-Ziv Video Coding Xun Guo 1,, Yan Lu 2, Feng Wu 2, Wen Gao 1, 3, Shipeng Li 2 1 School of Computer Sciences, Harbin Institute of Technology,

More information

Relative frequency. I Frames P Frames B Frames No. of cells

Relative frequency. I Frames P Frames B Frames No. of cells In: R. Puigjaner (ed.): "High Performance Networking VI", Chapman & Hall, 1995, pages 157-168. Impact of MPEG Video Trac on an ATM Multiplexer Oliver Rose 1 and Michael R. Frater 2 1 Institute of Computer

More information

Digital Video Telemetry System

Digital Video Telemetry System Digital Video Telemetry System Item Type text; Proceedings Authors Thom, Gary A.; Snyder, Edwin Publisher International Foundation for Telemetering Journal International Telemetering Conference Proceedings

More information

PACKET-SWITCHED networks have become ubiquitous

PACKET-SWITCHED networks have become ubiquitous IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 13, NO. 7, JULY 2004 885 Video Compression for Lossy Packet Networks With Mode Switching and a Dual-Frame Buffer Athanasios Leontaris, Student Member, IEEE,

More information

Delay Cognizant Video Coding: Architecture, Applications and Quality Evaluations

Delay Cognizant Video Coding: Architecture, Applications and Quality Evaluations Draft to be submitted to IEEE Transactions on Image Processing. Please send comments to Yuan-Chi Chang at yuanchi@eecs.berkeley.edu. Delay Cognizant Video Coding: Architecture, Applications and Quality

More information

Principles of Video Compression

Principles of Video Compression Principles of Video Compression Topics today Introduction Temporal Redundancy Reduction Coding for Video Conferencing (H.261, H.263) (CSIT 410) 2 Introduction Reduce video bit rates while maintaining an

More information

Video Transmission. Thomas Wiegand: Digital Image Communication Video Transmission 1. Transmission of Hybrid Coded Video. Channel Encoder.

Video Transmission. Thomas Wiegand: Digital Image Communication Video Transmission 1. Transmission of Hybrid Coded Video. Channel Encoder. Video Transmission Transmission of Hybrid Coded Video Error Control Channel Motion-compensated Video Coding Error Mitigation Scalable Approaches Intra Coding Distortion-Distortion Functions Feedback-based

More information

Research Article. ISSN (Print) *Corresponding author Shireen Fathima

Research Article. ISSN (Print) *Corresponding author Shireen Fathima Scholars Journal of Engineering and Technology (SJET) Sch. J. Eng. Tech., 2014; 2(4C):613-620 Scholars Academic and Scientific Publisher (An International Publisher for Academic and Scientific Resources)

More information

Pattern Smoothing for Compressed Video Transmission

Pattern Smoothing for Compressed Video Transmission Pattern for Compressed Transmission Hugh M. Smith and Matt W. Mutka Department of Computer Science Michigan State University East Lansing, MI 48824-1027 {smithh,mutka}@cps.msu.edu Abstract: In this paper

More information

A Study of Encoding and Decoding Techniques for Syndrome-Based Video Coding

A Study of Encoding and Decoding Techniques for Syndrome-Based Video Coding MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com A Study of Encoding and Decoding Techniques for Syndrome-Based Video Coding Min Wu, Anthony Vetro, Jonathan Yedidia, Huifang Sun, Chang Wen

More information

Interleaved Source Coding (ISC) for Predictive Video Coded Frames over the Internet

Interleaved Source Coding (ISC) for Predictive Video Coded Frames over the Internet Interleaved Source Coding (ISC) for Predictive Video Coded Frames over the Internet Jin Young Lee 1,2 1 Broadband Convergence Networking Division ETRI Daejeon, 35-35 Korea jinlee@etri.re.kr Abstract Unreliable

More information

OBJECT-BASED IMAGE COMPRESSION WITH SIMULTANEOUS SPATIAL AND SNR SCALABILITY SUPPORT FOR MULTICASTING OVER HETEROGENEOUS NETWORKS

OBJECT-BASED IMAGE COMPRESSION WITH SIMULTANEOUS SPATIAL AND SNR SCALABILITY SUPPORT FOR MULTICASTING OVER HETEROGENEOUS NETWORKS OBJECT-BASED IMAGE COMPRESSION WITH SIMULTANEOUS SPATIAL AND SNR SCALABILITY SUPPORT FOR MULTICASTING OVER HETEROGENEOUS NETWORKS Habibollah Danyali and Alfred Mertins School of Electrical, Computer and

More information

COMP 249 Advanced Distributed Systems Multimedia Networking. Video Compression Standards

COMP 249 Advanced Distributed Systems Multimedia Networking. Video Compression Standards COMP 9 Advanced Distributed Systems Multimedia Networking Video Compression Standards Kevin Jeffay Department of Computer Science University of North Carolina at Chapel Hill jeffay@cs.unc.edu September,

More information

Chapter 2 Introduction to

Chapter 2 Introduction to Chapter 2 Introduction to H.264/AVC H.264/AVC [1] is the newest video coding standard of the ITU-T Video Coding Experts Group (VCEG) and the ISO/IEC Moving Picture Experts Group (MPEG). The main improvements

More information

INTERNATIONAL TELECOMMUNICATION UNION. SERIES H: AUDIOVISUAL AND MULTIMEDIA SYSTEMS Coding of moving video

INTERNATIONAL TELECOMMUNICATION UNION. SERIES H: AUDIOVISUAL AND MULTIMEDIA SYSTEMS Coding of moving video INTERNATIONAL TELECOMMUNICATION UNION CCITT H.261 THE INTERNATIONAL TELEGRAPH AND TELEPHONE CONSULTATIVE COMMITTEE (11/1988) SERIES H: AUDIOVISUAL AND MULTIMEDIA SYSTEMS Coding of moving video CODEC FOR

More information

Research Topic. Error Concealment Techniques in H.264/AVC for Wireless Video Transmission in Mobile Networks

Research Topic. Error Concealment Techniques in H.264/AVC for Wireless Video Transmission in Mobile Networks Research Topic Error Concealment Techniques in H.264/AVC for Wireless Video Transmission in Mobile Networks July 22 nd 2008 Vineeth Shetty Kolkeri EE Graduate,UTA 1 Outline 2. Introduction 3. Error control

More information

Adaptive Key Frame Selection for Efficient Video Coding

Adaptive Key Frame Selection for Efficient Video Coding Adaptive Key Frame Selection for Efficient Video Coding Jaebum Jun, Sunyoung Lee, Zanming He, Myungjung Lee, and Euee S. Jang Digital Media Lab., Hanyang University 17 Haengdang-dong, Seongdong-gu, Seoul,

More information

IN OBJECT-BASED video coding, such as MPEG-4 [1], an. A Robust and Adaptive Rate Control Algorithm for Object-Based Video Coding

IN OBJECT-BASED video coding, such as MPEG-4 [1], an. A Robust and Adaptive Rate Control Algorithm for Object-Based Video Coding IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 14, NO. 10, OCTOBER 2004 1167 A Robust and Adaptive Rate Control Algorithm for Object-Based Video Coding Yu Sun, Student Member, IEEE,

More information

FLEXIBLE SWITCHING AND EDITING OF MPEG-2 VIDEO BITSTREAMS

FLEXIBLE SWITCHING AND EDITING OF MPEG-2 VIDEO BITSTREAMS ABSTRACT FLEXIBLE SWITCHING AND EDITING OF MPEG-2 VIDEO BITSTREAMS P J Brightwell, S J Dancer (BBC) and M J Knee (Snell & Wilcox Limited) This paper proposes and compares solutions for switching and editing

More information

TERRESTRIAL broadcasting of digital television (DTV)

TERRESTRIAL broadcasting of digital television (DTV) IEEE TRANSACTIONS ON BROADCASTING, VOL 51, NO 1, MARCH 2005 133 Fast Initialization of Equalizers for VSB-Based DTV Transceivers in Multipath Channel Jong-Moon Kim and Yong-Hwan Lee Abstract This paper

More information

Fast MBAFF/PAFF Motion Estimation and Mode Decision Scheme for H.264

Fast MBAFF/PAFF Motion Estimation and Mode Decision Scheme for H.264 Fast MBAFF/PAFF Motion Estimation and Mode Decision Scheme for H.264 Ju-Heon Seo, Sang-Mi Kim, Jong-Ki Han, Nonmember Abstract-- In the H.264, MBAFF (Macroblock adaptive frame/field) and PAFF (Picture

More information

EMBEDDED ZEROTREE WAVELET CODING WITH JOINT HUFFMAN AND ARITHMETIC CODING

EMBEDDED ZEROTREE WAVELET CODING WITH JOINT HUFFMAN AND ARITHMETIC CODING EMBEDDED ZEROTREE WAVELET CODING WITH JOINT HUFFMAN AND ARITHMETIC CODING Harmandeep Singh Nijjar 1, Charanjit Singh 2 1 MTech, Department of ECE, Punjabi University Patiala 2 Assistant Professor, Department

More information

CONTEMPORARY hybrid video codecs use motion-compensated

CONTEMPORARY hybrid video codecs use motion-compensated IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 18, NO. 2, FEBRUARY 2008 249 Transactions Letters Dual Frame Motion Compensation With Uneven Quality Assignment Vijay Chellappa, Pamela

More information

Improvement of MPEG-2 Compression by Position-Dependent Encoding

Improvement of MPEG-2 Compression by Position-Dependent Encoding Improvement of MPEG-2 Compression by Position-Dependent Encoding by Eric Reed B.S., Electrical Engineering Drexel University, 1994 Submitted to the Department of Electrical Engineering and Computer Science

More information

II. SYSTEM MODEL In a single cell, an access point and multiple wireless terminals are located. We only consider the downlink

II. SYSTEM MODEL In a single cell, an access point and multiple wireless terminals are located. We only consider the downlink Subcarrier allocation for variable bit rate video streams in wireless OFDM systems James Gross, Jirka Klaue, Holger Karl, Adam Wolisz TU Berlin, Einsteinufer 25, 1587 Berlin, Germany {gross,jklaue,karl,wolisz}@ee.tu-berlin.de

More information

Drift Compensation for Reduced Spatial Resolution Transcoding

Drift Compensation for Reduced Spatial Resolution Transcoding MERL A MITSUBISHI ELECTRIC RESEARCH LABORATORY http://www.merl.com Drift Compensation for Reduced Spatial Resolution Transcoding Peng Yin Anthony Vetro Bede Liu Huifang Sun TR-2002-47 August 2002 Abstract

More information

AN UNEQUAL ERROR PROTECTION SCHEME FOR MULTIPLE INPUT MULTIPLE OUTPUT SYSTEMS. M. Farooq Sabir, Robert W. Heath and Alan C. Bovik

AN UNEQUAL ERROR PROTECTION SCHEME FOR MULTIPLE INPUT MULTIPLE OUTPUT SYSTEMS. M. Farooq Sabir, Robert W. Heath and Alan C. Bovik AN UNEQUAL ERROR PROTECTION SCHEME FOR MULTIPLE INPUT MULTIPLE OUTPUT SYSTEMS M. Farooq Sabir, Robert W. Heath and Alan C. Bovik Dept. of Electrical and Comp. Engg., The University of Texas at Austin,

More information

A Novel Approach towards Video Compression for Mobile Internet using Transform Domain Technique

A Novel Approach towards Video Compression for Mobile Internet using Transform Domain Technique A Novel Approach towards Video Compression for Mobile Internet using Transform Domain Technique Dhaval R. Bhojani Research Scholar, Shri JJT University, Jhunjunu, Rajasthan, India Ved Vyas Dwivedi, PhD.

More information

Temporal Error Concealment Algorithm Using Adaptive Multi- Side Boundary Matching Principle

Temporal Error Concealment Algorithm Using Adaptive Multi- Side Boundary Matching Principle 184 IJCSNS International Journal of Computer Science and Network Security, VOL.8 No.12, December 2008 Temporal Error Concealment Algorithm Using Adaptive Multi- Side Boundary Matching Principle Seung-Soo

More information

Comparative Study of JPEG2000 and H.264/AVC FRExt I Frame Coding on High-Definition Video Sequences

Comparative Study of JPEG2000 and H.264/AVC FRExt I Frame Coding on High-Definition Video Sequences Comparative Study of and H.264/AVC FRExt I Frame Coding on High-Definition Video Sequences Pankaj Topiwala 1 FastVDO, LLC, Columbia, MD 210 ABSTRACT This paper reports the rate-distortion performance comparison

More information

Chapter 2. Advanced Telecommunications and Signal Processing Program. E. Galarza, Raynard O. Hinds, Eric C. Reed, Lon E. Sun-

Chapter 2. Advanced Telecommunications and Signal Processing Program. E. Galarza, Raynard O. Hinds, Eric C. Reed, Lon E. Sun- Chapter 2. Advanced Telecommunications and Signal Processing Program Academic and Research Staff Professor Jae S. Lim Visiting Scientists and Research Affiliates M. Carlos Kennedy Graduate Students John

More information

In MPEG, two-dimensional spatial frequency analysis is performed using the Discrete Cosine Transform

In MPEG, two-dimensional spatial frequency analysis is performed using the Discrete Cosine Transform MPEG Encoding Basics PEG I-frame encoding MPEG long GOP ncoding MPEG basics MPEG I-frame ncoding MPEG long GOP encoding MPEG asics MPEG I-frame encoding MPEG long OP encoding MPEG basics MPEG I-frame MPEG

More information

Popularity-Aware Rate Allocation in Multi-View Video

Popularity-Aware Rate Allocation in Multi-View Video Popularity-Aware Rate Allocation in Multi-View Video Attilio Fiandrotti a, Jacob Chakareski b, Pascal Frossard b a Computer and Control Engineering Department, Politecnico di Torino, Turin, Italy b Signal

More information

Implementation of MPEG-2 Trick Modes

Implementation of MPEG-2 Trick Modes Implementation of MPEG-2 Trick Modes Matthew Leditschke and Andrew Johnson Multimedia Services Section Telstra Research Laboratories ABSTRACT: If video on demand services delivered over a broadband network

More information

SAVE: An Algorithm for Smoothed Adaptive Video over Explicit Rate Networks

SAVE: An Algorithm for Smoothed Adaptive Video over Explicit Rate Networks SAVE: An Algorithm for Smoothed Adaptive Video over Explicit Rate Networks N.G. Duffield, K. K. Ramakrishnan, Amy R. Reibman AT&T Labs Research Abstract Supporting compressed video efficiently on networks

More information

RATE-REDUCTION TRANSCODING DESIGN FOR WIRELESS VIDEO STREAMING

RATE-REDUCTION TRANSCODING DESIGN FOR WIRELESS VIDEO STREAMING RATE-REDUCTION TRANSCODING DESIGN FOR WIRELESS VIDEO STREAMING Anthony Vetro y Jianfei Cai z and Chang Wen Chen Λ y MERL - Mitsubishi Electric Research Laboratories, 558 Central Ave., Murray Hill, NJ 07974

More information

Video 1 Video October 16, 2001

Video 1 Video October 16, 2001 Video Video October 6, Video Event-based programs read() is blocking server only works with single socket audio, network input need I/O multiplexing event-based programming also need to handle time-outs,

More information

ELEC 691X/498X Broadcast Signal Transmission Fall 2015

ELEC 691X/498X Broadcast Signal Transmission Fall 2015 ELEC 691X/498X Broadcast Signal Transmission Fall 2015 Instructor: Dr. Reza Soleymani, Office: EV 5.125, Telephone: 848 2424 ext.: 4103. Office Hours: Wednesday, Thursday, 14:00 15:00 Time: Tuesday, 2:45

More information

RECOMMENDATION ITU-R BT.1203 *

RECOMMENDATION ITU-R BT.1203 * Rec. TU-R BT.1203 1 RECOMMENDATON TU-R BT.1203 * User requirements for generic bit-rate reduction coding of digital TV signals (, and ) for an end-to-end television system (1995) The TU Radiocommunication

More information

Selective Intra Prediction Mode Decision for H.264/AVC Encoders

Selective Intra Prediction Mode Decision for H.264/AVC Encoders Selective Intra Prediction Mode Decision for H.264/AVC Encoders Jun Sung Park, and Hyo Jung Song Abstract H.264/AVC offers a considerably higher improvement in coding efficiency compared to other compression

More information

Video Over Mobile Networks

Video Over Mobile Networks Video Over Mobile Networks Professor Mohammed Ghanbari Department of Electronic systems Engineering University of Essex United Kingdom June 2005, Zadar, Croatia (Slides prepared by M. Mahdi Ghandi) INTRODUCTION

More information

Express Letters. A Novel Four-Step Search Algorithm for Fast Block Motion Estimation

Express Letters. A Novel Four-Step Search Algorithm for Fast Block Motion Estimation IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 6, NO. 3, JUNE 1996 313 Express Letters A Novel Four-Step Search Algorithm for Fast Block Motion Estimation Lai-Man Po and Wing-Chung

More information

Efficient Implementation of Neural Network Deinterlacing

Efficient Implementation of Neural Network Deinterlacing Efficient Implementation of Neural Network Deinterlacing Guiwon Seo, Hyunsoo Choi and Chulhee Lee Dept. Electrical and Electronic Engineering, Yonsei University 34 Shinchon-dong Seodeamun-gu, Seoul -749,

More information

WITH the rapid development of high-fidelity video services

WITH the rapid development of high-fidelity video services 896 IEEE SIGNAL PROCESSING LETTERS, VOL. 22, NO. 7, JULY 2015 An Efficient Frame-Content Based Intra Frame Rate Control for High Efficiency Video Coding Miaohui Wang, Student Member, IEEE, KingNgiNgan,

More information

Interactive multiview video system with non-complex navigation at the decoder

Interactive multiview video system with non-complex navigation at the decoder 1 Interactive multiview video system with non-complex navigation at the decoder Thomas Maugey and Pascal Frossard Signal Processing Laboratory (LTS4) École Polytechnique Fédérale de Lausanne (EPFL), Lausanne,

More information

NUMEROUS elaborate attempts have been made in the

NUMEROUS elaborate attempts have been made in the IEEE TRANSACTIONS ON COMMUNICATIONS, VOL. 46, NO. 12, DECEMBER 1998 1555 Error Protection for Progressive Image Transmission Over Memoryless and Fading Channels P. Greg Sherwood and Kenneth Zeger, Senior

More information

Analysis of a Two Step MPEG Video System

Analysis of a Two Step MPEG Video System Analysis of a Two Step MPEG Video System Lufs Telxeira (*) (+) (*) INESC- Largo Mompilhet 22, 4000 Porto Portugal (+) Universidade Cat61ica Portnguesa, Rua Dingo Botelho 1327, 4150 Porto, Portugal Abstract:

More information

Color Quantization of Compressed Video Sequences. Wan-Fung Cheung, and Yuk-Hee Chan, Member, IEEE 1 CSVT

Color Quantization of Compressed Video Sequences. Wan-Fung Cheung, and Yuk-Hee Chan, Member, IEEE 1 CSVT CSVT -02-05-09 1 Color Quantization of Compressed Video Sequences Wan-Fung Cheung, and Yuk-Hee Chan, Member, IEEE 1 Abstract This paper presents a novel color quantization algorithm for compressed video

More information

Modeling and Optimization of a Systematic Lossy Error Protection System based on H.264/AVC Redundant Slices

Modeling and Optimization of a Systematic Lossy Error Protection System based on H.264/AVC Redundant Slices Modeling and Optimization of a Systematic Lossy Error Protection System based on H.264/AVC Redundant Slices Shantanu Rane, Pierpaolo Baccichet and Bernd Girod Information Systems Laboratory, Department

More information

Interframe Bus Encoding Technique for Low Power Video Compression

Interframe Bus Encoding Technique for Low Power Video Compression Interframe Bus Encoding Technique for Low Power Video Compression Asral Bahari, Tughrul Arslan and Ahmet T. Erdogan School of Engineering and Electronics, University of Edinburgh United Kingdom Email:

More information

PERCEPTUAL QUALITY OF H.264/AVC DEBLOCKING FILTER

PERCEPTUAL QUALITY OF H.264/AVC DEBLOCKING FILTER PERCEPTUAL QUALITY OF H./AVC DEBLOCKING FILTER Y. Zhong, I. Richardson, A. Miller and Y. Zhao School of Enginnering, The Robert Gordon University, Schoolhill, Aberdeen, AB1 1FR, UK Phone: + 1, Fax: + 1,

More information

Packet Scheduling Algorithm for Wireless Video Streaming 1

Packet Scheduling Algorithm for Wireless Video Streaming 1 Packet Scheduling Algorithm for Wireless Video Streaming 1 Sang H. Kang and Avideh Zakhor Video and Image Processing Lab, U.C. Berkeley E-mail: {sangk7, avz}@eecs.berkeley.edu Abstract We propose a class

More information

Delay allocation between source buffering and interleaving for wireless video

Delay allocation between source buffering and interleaving for wireless video Shen et al. EURASIP Journal on Wireless Communications and Networking (2016) 2016:209 DOI 10.1186/s13638-016-0703-4 RESEARCH Open Access Delay allocation between source buffering and interleaving for wireless

More information

Delay Cognizant Video Coding: Architecture, Applications and Quality Evaluation

Delay Cognizant Video Coding: Architecture, Applications and Quality Evaluation Delay Cognizant Video Coding: Architecture, Applications and Quality Evaluation Yuan-Chi Chang a,d, David G. Messerschmitt a, Thom Carney b,c and Stanley A. Klein b a Department of Electrical Engineering

More information

SERIES J: CABLE NETWORKS AND TRANSMISSION OF TELEVISION, SOUND PROGRAMME AND OTHER MULTIMEDIA SIGNALS Measurement of the quality of service

SERIES J: CABLE NETWORKS AND TRANSMISSION OF TELEVISION, SOUND PROGRAMME AND OTHER MULTIMEDIA SIGNALS Measurement of the quality of service International Telecommunication Union ITU-T J.342 TELECOMMUNICATION STANDARDIZATION SECTOR OF ITU (04/2011) SERIES J: CABLE NETWORKS AND TRANSMISSION OF TELEVISION, SOUND PROGRAMME AND OTHER MULTIMEDIA

More information

T he Electronic Magazine of O riginal Peer-Reviewed Survey Articles ABSTRACT

T he Electronic Magazine of O riginal Peer-Reviewed Survey Articles ABSTRACT THIRD QUARTER 2004, VOLUME 6, NO. 3 IEEE C OMMUNICATIONS SURVEYS T he Electronic Magazine of O riginal Peer-Reviewed Survey Articles www.comsoc.org/pubs/surveys NETWORK PERFORMANCE EVALUATION USING FRAME

More information

Real Time PQoS Enhancement of IP Multimedia Services Over Fading and Noisy DVB-T Channel

Real Time PQoS Enhancement of IP Multimedia Services Over Fading and Noisy DVB-T Channel Real Time PQoS Enhancement of IP Multimedia Services Over Fading and Noisy DVB-T Channel H. Koumaras (1), E. Pallis (2), G. Gardikis (1), A. Kourtis (1) (1) Institute of Informatics and Telecommunications

More information

Optimization of Multi-Channel BCH Error Decoding for Common Cases. Russell Dill Master's Thesis Defense April 20, 2015

Optimization of Multi-Channel BCH Error Decoding for Common Cases. Russell Dill Master's Thesis Defense April 20, 2015 Optimization of Multi-Channel BCH Error Decoding for Common Cases Russell Dill Master's Thesis Defense April 20, 2015 Bose-Chaudhuri-Hocquenghem (BCH) BCH is an Error Correcting Code (ECC) and is used

More information

International Journal for Research in Applied Science & Engineering Technology (IJRASET) Motion Compensation Techniques Adopted In HEVC

International Journal for Research in Applied Science & Engineering Technology (IJRASET) Motion Compensation Techniques Adopted In HEVC Motion Compensation Techniques Adopted In HEVC S.Mahesh 1, K.Balavani 2 M.Tech student in Bapatla Engineering College, Bapatla, Andahra Pradesh Assistant professor in Bapatla Engineering College, Bapatla,

More information

Overview: Video Coding Standards

Overview: Video Coding Standards Overview: Video Coding Standards Video coding standards: applications and common structure ITU-T Rec. H.261 ISO/IEC MPEG-1 ISO/IEC MPEG-2 State-of-the-art: H.264/AVC Video Coding Standards no. 1 Applications

More information

Robust 3-D Video System Based on Modified Prediction Coding and Adaptive Selection Mode Error Concealment Algorithm

Robust 3-D Video System Based on Modified Prediction Coding and Adaptive Selection Mode Error Concealment Algorithm International Journal of Signal Processing Systems Vol. 2, No. 2, December 2014 Robust 3-D Video System Based on Modified Prediction Coding and Adaptive Selection Mode Error Concealment Algorithm Walid

More information

INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET)

INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET) INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET) International Journal of Electronics and Communication Engineering & Technology (IJECET), ISSN 0976 ISSN 0976 6464(Print)

More information