| |||||||||||||||||||||||||
|
|
Joan Van Tassel, Ph.D. and Steve W. RoseThe Evolution of the Interactive Broadband ServerThis is Part I of a two-part article discussing video enabled servers for metropolitan areas. The division is based on two architectural approaches for building interactive broadband servers. In this first part, we discuss developments leading up to the creation of these devices, and then turn to servers that are constructed by aggregating conventional single bus computers with other necessary components.In Part II, we cover servers based on massively parallel architectures and describe what we feel is the most appropriate model for future architectures. While we have tried to be objective, we do have a point of view that we will develop in the second article. This article is predominantly hardware oriented. Although common software platforms are important, the nature and scale of the task at hand make the selection of appropriate hardware more immediate to the successful deployment of an interactive broadband server. What is an interactive broadband server (IBS)? It is a device that delivers many different kinds of data and provides many simultaneous services. These include but are not limited to:
In addition, the IBS must accurately track, account for, store, and bill for all services while providing for network management! Other terms used to describe a fundamentally similar device have been Video Server, Media Server, and Metropolitan Media Server. It is clear that IBS is at the heart of the new interactive broadband businesses that companies in the cable television, telephone, computer, and wireless industries plan to launch. Attractive programs, services, and applications such as high-speed Internet access over cable, video on demand, interactive shopping and games, and many others depend on the availability of a reliable, cost-effective IBS. The primary difference between an interactive broadband server and a large conventional computer server, used in many organizations for client/server purposes, is the ability of IBS to provide thousands of simultaneous isochronous data streams. Isochronous ("same time") refers to data streams that are time-sensitive and must be delivered continuously without interruption or they are incoherent. An example of isochronous data streams would be real-time video and audio which are retransmitted as soon as they are received, such as a live television signal. STAKES AND STAKEHOLDERSMany different interests are watching the evolution of IBS. Cable, wireless cable, telephone, and computer companies all want to provide new programs, services, and applications to their customers. Equipment suppliers and content providers hope to provide products, if dependable standards for them are available. Finally, regulators and consumer groups want to clarify such issues as universal access, rate structure, privacy, and security.The high level of interest is a consequence of the anticipated size of the video on demand market. Existing markets are substantial. The advertising revenues for broadcast and cable television were about $30 billion in 1995, according to estimates by the Television Advertising Bureau. The National Cable Television Association reports that cable revenues were $26 billion in 1995. Conservative estimates from Satellite Business News magazine indicate there will be 12 million to 13 million direct broadcast satellite (DBS) subscribers by 2000; assuming an average bill of $40 a month, satellite delivery revenues would be more than $6 billion by the end of the decade. Wireless cable revenue is expected to be $600 million in 1996. Finally, Paul Kagan & Associates report that the videocassette rental market earned $10 billion in 1995, and sell-through of videos to consumers was about $6 billion. Packaged programming for other stand-alone devices also brings in significant revenue. Consumers spent $6 billion for interactive games in 1995 says research firm DataQuest and about $8 billion to $9 billion for dedicated game players. The computer is an increasingly profitable venue for video material. There are now 25 million CD-ROM-equipped multimedia machines in consumer hands worldwide and more than 5,000 titles. DataQuest estimates that the market for CD-ROM games is now about $660 million and growing rapidly. Based on these figures plus a large grain of salt, the video market was more than $60 billion in 1995. While that is less than the telephony market (about $100 billion annually) and even less than the income of power utility companies (about $200 billion), it is still a substantial enough figure for the various stakeholders to concern themselves with the design, deployment, and implementation of an interactive broadband platform, of which the server is a central element. REQUISITE EARLY DEVELOPMENTSIn order to understand the evolution of interactive broadband servers, it is helpful to understand the environment which made it possible, and the elements designers had to work with in building them.BROADBAND NETWORK ENVIRONMENTS
The Cable Industry -- Hybrid Fiber/Coax. Time Warner Cable pioneered a technology in the late 1980s which has become known as hybrid fiber/coax (HFC). It divides existing coax-based systems into neighborhoods of about 500 to 2,000 subscribers each and sends an individual optical fiber from the headend to each neighborhood. No one in the neighborhood is more than four amplifiers away from the optical fiber. By minimizing the number of cascaded amplifiers and upgrading them, it is possible to more than double the bandwidth delivered to each subscriber -- from about 500 Megahertz currently to as much as 1.2 gigahertz, while reducing system noise. Due to the reduced noise and direct connection to each neighborhood, getting information back from subscribers becomes practical and fundamentally one-way systems become two-way systems. The HFC architecture is being widely adopted by cable companies and some telephone companies, and won the group at Time Warner Cable an Emmy in 1994. HFC is significant because it allowed companies to consider delivering custom material to each household. The greater bandwidth provided by the HFC architecture led the press to refer to "the five hundred channel cable universe." However, this characterization missed the real innovation, which was that operators could deliver 500 different programs simultaneously to each neighborhood node of 500 subscribers. Put another way, the new technologies make it possible to advance from delivering 50 of the same channels to 50,000 viewers to offering an individual channel to each of 50,000 interactive viewers. Further, because of the way that HFC expands the available bandwidth, cable operators can provide hybrid service: New digital programs can be delivered to individual households over the new bandwidth from a server and long-term storage, while leaving intact the old analog services on the existing bandwidth. This means no changes for subscribers unless they choose to take advantage of new services. Telcos -- ADSL and Fiber-to-the-Curb (FTTC). The telephone industry had a different asset to protect: a network of twisted pairs of copper wire that took more than 100 years and $1,500 per household to construct. This infrastructure required a different approach, as the bandwidth a twisted pair can support and the distance it can transport a high-bandwidth signal are greatly restricted, as compared with coaxial cable. As a result, telco designs focused on sending one video signal at a time from the central office to the subscriber over the twisted pair. An example is Asymmetrical Digital Subscriber Line (ADSL) technology, which uses a technique which trades reduced upstream bandwidth for much greater downstream bandwidth. As the limitations of ADSL became apparent, phone companies focused on carrying fiber optic cable deep into the neighborhood, an architecture called Fiber-to-the-Curb. Each FTTC node serves about 20 subscribers over existing twisted pairs from the curb to the home. Wireless Systems. There are two wireless infrastructures that deliver television. Wireless cable systems (sometimes called MMDS for multichannel, multipoint distribution service) cover about a 35-mile radius and are not likely to become interactive. One reason is that MMDS is typically promoted as a low-cost alternative to wired cable service. In addition, MMDS would need to invest in a cellular-based return path to make two-way communication feasible. By contrast, LMDS (local multipoint distribution service) systems use a cellular approach, where each transmitter reaches a defined area, as small as two or three miles. Nonadjacent cells can carry different content just as with cellular telephony, and it greatly increases the effective deliverable bandwidth to the overall service area. Wireless data return from subscribers to the cell site is also possible, because the return path from the viewer back to the cell is so short that it needs only a four- to six-inch antenna and a little power, using the cell structure already in place. ATM SWITCHING
DIGITAL VIDEO COMPRESSION (DVC)
Several techniques of compression have been developed, including discrete cosine transform (DCT), wavelet, vector quantitization, and fractal schemes. However, the dominant family of standards was developed by the Moving Picture Experts Group (MPEG) based on DCT. OTHER IMPORTANT DEVELOPMENTS
Powerful Single Bus Computers. In the computer industry, a battle has waged for years between harnessing the power of tens to thousands of processors running independently in parallel linked by a communication mesh, versus connecting a limited number of processors running on a single bus. Parallel processors have gained a reputation for being difficult to program. However, the rapid development of ever more powerful single bus processors has allowed them to match the computing capacity of any existing parallel processor during its lifetime. Most successful computer servers are single bus, single processor designs. Some use a single bus, but multiple identical tightly-coupled processors, and are referred to as Symmetric Multiprocessors (SMP). Real-Time Encryption. Protecting intellectual property, when it is represented as a digital data stream, requires that the stream be encrypted in real time so that each instance of the stream can be separately protected. Isochronous digital video streams pose a particular problem due to their high data rate. Error Correction Codes. There are two points during isochronous stream delivery that errors must be anticipated and corrected in advance. The first high-potential point occurs when the data is played from the hard disk arrays. Hard disk drives are electromechanical, so they can be expected to fail more often than electronic equipment. Their failures are critical due to loss of data and/or interruption of service. As a result, a group of standards for increasing storage reliability, collectively known as RAID (Redundant Arrays of Independent Disks), has evolved which allows disks to be grouped so that the failure of a single drive does not affect the output of the array. The penalty for this reliability is typically an increase of 25% in the number of drives required. The second error-prone point occurs during network transport. Anticipating these errors involves sending enough redundant information that almost any errors caused by transient noise or interference can be fixed on the receiving end. This anticipatory error correction is critical for isochronous data, where there isn't enough time to detect an error and ask for a retransmission of the corrupted data. The procedure is called Forward Error Correction (FEC). Digital Modulation. In order to transmit digital information over analog channels (important for HFC and wireless networks), the digital information must be changed in format. Digital information has only two levels representing 0 and 1. Modern techniques increase the number of levels through phase and amplitude modulation of each cycle to achieve bit efficiencies of up to six to eight bits per Hertz. Typical of these is 64 QAM (Quadrature Amplitude Modulation), which uses 64 unique combinations to transmit six bits per Hertz. Network Management. As computer networks have grown more complex, the difficulties of monitoring and controlling the equipment that constitute the network have grown. Software and standards now allow central management of networks that span the globe. Chief among the current standards is the Simple Network Management Protocol (SNMP). Business Support. The last few years have seen the creation of reliable computer software to accommodate various complex forms of billing. Cable television billing typically includes a flat monthly fee, fixed monthly additions for additional services, plus individual billing for special events. Telephone billing includes flat monthly charges, plus billing on the basis of utilization on a minute-by-minute or second-by-second basis, plus billing on behalf of secondary companies (e.g., long distance providers). Telemarketing services require immediate credit verification and real-time interaction with financial and fulfillment (inventory and shipping) systems. Finally, just as neighborhood shopping centers must monitor the sales of associated businesses to enable billing on a percentage of revenue, so must virtual shopping centers on the network track sales. All of these complex billing models will be required in an interactive broadband network. CONVENTIONAL INTERACTIVE BROADBAND SERVERSDEFINING THE TASK
A BRUTE FORCE SERVER MODEL
We assume a population of 500,000 subscribers, 200,000 streams, and a library of 300 movies. Each movie requires about three gigabytes of storage, resulting in about one terabyte of memory to store all the movies. Although it would be too expensive to actually store this content with RAM chips, we will store the movies in RAM, using 16 megabit DRAMs for comparison purposes. This storage will require about 500,000 RAM chips. To sort each resulting four megabits per second (Mb/s) stream to the viewer requesting it, we hypothesize a switch made of 100 input by 100 output crosspoint switch chips. To actually deliver each stream to the requester, we will need a routing switch with 200,000 inputs (the number of streams) and 500,000 outputs (we have to be able to switch any stream to any subscriber). With our 100 x 100 switch chips, we will need 2,000 times 5,000 chips, or 10,000,000 switch chips -- 20 times more than the number of RAM chips! It is clear that the big problem is not storage, but switching. The conclusion that emerges is: The cost of storage increases linearly with capacity, and the cost of switching increases geometrically with capacity. ASSUMPTIONSCompressed digital video streams have data rates varying from 1.2 Mb/s to nine Mb/s, with four Mb/s being a typical choice for an excellent quality video plus audio signal. The definition of excellent quality is that the image is equivalent to Super-VHS or Hi8 video and CD-quality multichannel audio. In the remainder of this article, calculations will be based on four Mb/s constant bit rate MPEG-2 encoding.Assuming that an average movie lasts 100 minutes, it will consume three gigabytes of storage. The estimates of the maximum number of subscribers using interactive digital services simultaneously have ranged from 7% to 40% of total subscribers. We will use a figure of 20% to represent the peak capacity design point for our calculations. We will also assume that the subscriber population for a single server will be between 20,000 and 100,000 subscribers and the typical size is 50,000 households. This results in a server design that must be capable of back-to-front throughput of 16 gigabits per second (Gb/s) to 80 Gb/s; throughput of 40 Gb/s is needed to serve a 50,000 household system. STORAGE AND IMPORTATION OF CONTENT INTO STREAM GENERATOR
REASSEMBLE DATA INTO A STREAM: GENERATION
When more streams are required, these stream generators are grouped. Since each has its own hard disk storage, a title which is popular enough to generate demand for more streams than one unit can provide must be replicated to as many units as necessary to meet the demand. In fact, since many titles can be stored on one stream generator, it must be managed so that a popular title doesn't block access to other titles stored uniquely on that unit by consuming all of its output streams. In a 50,000 subscriber area, the peak server load will be 10,000 streams. This means that, to satisfy peak load requirements, we will need 100 stream generators, each with a 100 stream capacity. First-run movie demand, as indicated by box office receipts, shows that one title can account for as much as 40% of demand. With a peak server load of 10,000 streams, a single title could account for 4,000 of the streams. Assuming that we will allow up to 80% of the streams on one unit to come from a single title, that title will have to be replicated on at least 50 of the stream generators (4,000 streams divided by 80 streams per generator). At three gigabytes per copy, at least 150 gigabytes of storage will have to be devoted on the system to that title. If the operator decides to store it on every server, the storage requirement rises to 300 GB. In a large metropolitan system with 500,000 subscribers, this redundant storage would expand to at least 1.5 terabytes (TB), regardless of whether it was on one large server or 10 smaller ones. ENCRYPTION
STREAM SORTING, ROUTING, AND MULTIPLEXING
The volume of information in video streams, the high transport speed required, and the ability to switch all types of information has caused most server architects to incorporate ATM switching into their designs to sort and route all the streams and to multiplex streams going to the same neighborhood. In addition, ATM makes interconnection with other services using the ATM protocol straightforward (for example, long distance service providers). ENCODING FOR TRANSPORT: MONITOR QUALITY OF SERVICE, FEC, AND MODULATION
Conventional designs usually place FEC and quality monitoring functions within the modulator. As a result, an "intelligent modulator" that accepts standard ATM input and provides sufficient processing power to accomplish the additional information-modifying tasks is needed for each output channel. Typically, the modulator puts 27 Mb/s of data on a six MHz chunk of spectrum (the space allotted for a conventional analog channel). This allocation allows for six 4-Mb/s compressed digital streams plus overhead (four Mb/s, divided into 27 MHz equals 24 MHz, with three MHz capacity left over for overhead). OPERATIONS SUPPORT AND BILLING
Another separate control computer, or sometimes multiple computers arranged in a hierarchy, direct the operation of the stream servers and switch. This system manages content and directs normal operation of the system. Billing information is sent from the control computer to a separate billing computer system. For the sake of reliability, an isolated machine is used to bill for services. This physical separation of the billing data from the content also ensures that access to the content server cannot be used to hack billing data. The billing system is referred to as the Business Support System (BSS). ADDITIONAL CONSIDERATIONS FOR INTERACTIVE BROADBAND SERVERS OF CONVENTIONAL ARCHITECTURELIBRARY STORAGE VERSUS PRIMARY STORAGE
The critical speed for importation is real time (e.g., four Mb/s). If the content can be downloaded at the same or a faster rate than the rate of delivery to the subscriber, then it is possible to begin to deliver the content shortly after beginning the download. However, if the import rate is much slower than real time, then the entire program must be loaded before delivery to the subscriber can begin. For example, a dominant vendor's product takes 3x real-time to download from a single tape drive to the stream generator. This speed means that if a customer requests a 100-minute movie that is not already stored in the stream generator, it will be five hours after loading begins from tape before the movie can even begin to be delivered to the subscriber. A further consequence of the 3x real-time import speed and a single drive is that when the system launches, if it has a capacity of only 200 hours of content (120 movies), it will take 600 hours to load them into the stream generators, or 15 work-weeks of eight-hour work days, or five weeks of 24-hour days! A rapid import rate confers important advantages that consumers like, such as "VCR functionality." If the material is imported from the library at a real-time rate, the subscriber can be allowed VCR functionality (except fast forward). If it is imported at or above the fast forward rate, the subscriber can receive the requested program within seconds of the time that the download begins -- with full VCR functionality. It is not possible to implement fast forward functionality directly from the library by jumping ahead in the material because the read element of storage devices (CD-ROM, DVD, tape drive, or hard disk) addresses only one stream at a time. If the library read element jumped ahead to support a viewer's request for fast forward, there would be a gap in the material stored in the hard disk array. If a second customer were to request the same program, it would not be possible to deliver it until the first viewer was done, as the copy stored on hard disk (which can support multiple viewers) would be incomplete. The nature of the transfer from the library to hard disk storage makes an enormous difference in the nature of the server. If the transfer occurs at a rate equal to or faster than real time and the server supports immediate delivery, then the library becomes the primary storage of the system, and consumers have immediate access to the full content of the library. If the transfer is slower than real time or immediate delivery is not supported, then the only titles which may be offered to subscribers for immediate consumption are those already loaded in hard disk storage. This usually represents the difference between 100 or 200 and thousands of titles. STREAM GENERATOR SIZE
ATM SWITCH LIMITATIONS
Another limitation is the nature of the traffic, which is largely unidirectional because so much of the information is the downstream delivery of high bandwidth video on demand to subscribers. The design of ATM switches assumes that there will be approximately the same amount of traffic in both directions, and each downstream channel is paired with an upstream channel. It would seem logical to reverse some of the upstream channels and use them to support the downstream traffic. Unfortunately, when an ATM switch is wired this way, it triggers SNMP (Simple Network Management Protocol) error messages. Disabling the error messages removes the ability to manage and monitor the switch, which is then operating in the "crossed-fingers" management mode. As a result, upstream and downstream channels must remain paired, and almost half of the I/O capacity of the switch (and in some cases, some of its throughput bandwidth) goes to waste. This results in up to twice as many switches being required. INTELLIGENT MODULATOR SIZE
SCALABILITY
For example, our considerations so far have led us to conclude that the conventional interactive broadband server designed for a 50,000 subscriber area calls for between 25 and 300 racks of equipment for stream generation, and 77 to 425 racks for downstream modulation and associated processing. Switching equipment is relatively small, requiring about 10 to 20 racks of gear. The sum of these requirements amounts to between 112 and 745 racks of equipment. If we allow 12 square feet per rack including service access isles, and 15% for office space, air and power conditioning equipment, then a facility for 50,000 subscribers will occupy from 1,600 to 10,280 square feet. While these areas are less than huge, they are substantial when compared with the customary 600 square feet used by the headend of an average 50,000 subscriber cable system. For a 500,000 subscriber system, the total area would be about an acre, full of electronic equipment. Zoning approval delays and ongoing real estate costs must be factored into the planning for a conventional IBS. Now, let us turn to the problem of power that the conventional design requires. In the design plans for an actual trial-size facility of an interactive broadband server, estimates of power requirements for the stream generators, switches, and modulators added up to 200 watts per video stream. This figure means that a 50,000 subscriber server putting out 10,000 digital streams would require two megawatts. Even assuming there might be some economies that could be employed to reduce the usage to 50 to 100 watts per stream, it would still result in a monthly bill of $70,000 to $140,000 including air conditioning.1
SYSTEM RELIABILITY
In order for the viewer to receive a coherent picture, the delivery must be nearly perfect. What is transported is not digital video, where a single error results in a bad pixel that disrupts a tiny portion of the picture for 1/60th of a second. Rather, the data is a compressed digital data stream, where a modest error rate can affect a large part of the screen for up to several seconds or even disrupt the session altogether. If the disruption results in a complaint from the subscriber, it costs more than $1.00 to handle the phone call. Even without the loss of the revenue from the movie, there is an extremely narrow profit margin because the net revenue from the $0.99 to $4.00 charge is so low. Any disruption of service will offset any possible profit. The server itself contributes little to overall system reliability; however, it affects the new digital services from which operators hope to derive additional revenue. The server must be designed to deliver signals without disrupting service, in spite of the failure of any one of its components. Conventional servers are sometimes designed with RAID technology to accommodate the failure of single hard disks. However, the remainder of the server is susceptible to single point failures. The sheer amount of equipment and the complex interfacing of equipment from multiple suppliers called for by the conventional approach multiplies the probability of failures and the difficulty of correcting them when they occur. SUMMARYThe evolution of the conventional IBS has been essentially linear. Each function has been addressed by adding on a new layer of hardware and software. The design begins logically with storage and stream generation. Then, the switch is added for sorting, routing, and multiplexing. Appended downstream modulators encode the output for transport. Then, the need for end-to-end management results in the overlay of an operational support system. Finally, business requirements demand an additional system to monitor usage, store the data, and charge customers for service.Table 1 recaps traditional approaches to the design of interactive broadband servers:
PREVIEW OF PART IIIn the second part of this article that will appear in the next issue, we present our point of view about how to design an integrated IBS that might perform these functions in more efficient, cost-effective, and reliable ways. We will look at the application of massively parallel processing technology to an IBS and describe a design that looks at the task as a throughput problem rather than as an input/output problem. We will also present innovative ideas for configuring storage. Finally, our suggestions will address the problems of headend design and power consumption. In the process, we will present a metaphor to provide common ground for future discussions.Authors' Note -- Questions or comments can be sent to the authors via e-mail to Joan Van Tassel at dr.joanvt@gte.net and Steve Rose at roses@maui.com. 1 Here is how the numbers play out. Electrical usage takes place whether or not the operator is actually generating streams; that is, the equipment is always on drawing electricity. Each of the 10,000 streams consumes 100 watts of power. Thus, at any instant, the system is using a megawatt of electricity. Over the course of an hour, this consumption becomes a megawatt hour, or 1,000 kilowatt hours for all the streams. At $0.10 per kWh (1,000 x .10 = $100), the systems consumes $100 an hour. Given 24 hours in a day, electrical power alone will cost $2,400 per day, or $72,000 per month. Air conditioning the equipment stored on this many racks will double the electricity needed by the headend. Source:This article originaly appeared in the New Telecom Quarterly: 1996Q1. We also provide a PDF version. [home][author] [subject] [title]
| ||||||||||||||||||||||||