Learn
Best Video Streaming CDN in 2026? 7 Providers Compared With Real Performance Data
Best CDN for Video Streaming in 2026: 7 Providers Compared A single rebuffer event at the two-second mark costs you 8% ...
During the 2026 Super Bowl, Netflix sustained over 18 Tbps of aggregate egress from its Open Connect footprint without a single rebuffer-rate spike visible in public quality-of-experience dashboards. That number matters less for its magnitude than for what it reveals about topology: virtually none of that traffic touched Netflix's origin in AWS. Every byte was served from an Open Connect Appliance sitting inside or adjacent to an ISP network. If you architect delivery at scale—video, software updates, game patches—the Netflix Open Connect model is the most publicly documented example of what happens when you push content to the absolute last mile and then push it further.
This article breaks down how the Open Connect architecture actually works as of Q2 2026, including OCA hardware generations, cache-fill orchestration, the newer home caching node program, and a failure-mode analysis you will not find in Netflix's own documentation.

Netflix Open Connect is not a commercial CDN product. It is a purpose-built content delivery network that Netflix operates and finances, placing cache appliances directly inside ISP facilities or at internet exchange points. As of 2026, Netflix reports partnerships with over 1,000 ISP networks across six continents, with appliance deployments numbering in the tens of thousands.
The control plane runs in AWS. When a subscriber hits play, Netflix's steering service (running on EC2) evaluates the client's resolver IP, current OCA health telemetry, and content popularity to generate a ranked manifest of cache URLs. The client then fetches segments over HTTPS directly from the selected OCA. No traffic hairpins through a Netflix origin during steady-state playback—the origin's role is limited to the initial API call and manifest generation.
OCAs do not pull content on demand from origin. Netflix uses a proactive fill model. Each night during off-peak hours, a central scheduling system analyzes regional viewing predictions and pushes content to appliances before demand materializes. The prediction engine weights historical watch patterns, new-release metadata, promotional schedules, and regional trending signals. Fill traffic flows over dedicated peering or transit links to avoid competing with subscriber sessions during peak.
This off-peak fill pattern is critical: it converts the cache-miss problem from a latency event into a capacity-planning problem. As long as the fill window and available bandwidth exceed the volume of new content scheduled for a region, cache-hit ratios stay above 95%. Netflix has publicly cited hit ratios exceeding 95% for embedded OCAs in well-provisioned ISP deployments.
Netflix has iterated on OCA hardware across several generations. The appliances ISPs receive at no cost are purpose-built storage-dense servers optimized for sequential read throughput, not general computation.
| Spec | Storage OCA (Embedded) | Flash OCA (IXP / High-Density) |
|---|---|---|
| Storage media | 36x HDD (16–20 TB each) | NVMe SSDs, ~500 TB usable |
| Network interfaces | 4x 100 GbE | 4x 100 GbE (or 2x 400 GbE) |
| Typical egress per box | ~90 Gbps sustained | ~160 Gbps sustained |
| OS | FreeBSD (custom Netflix fork) | FreeBSD (custom Netflix fork) |
| TLS termination | In-kernel TLS offload | In-kernel TLS offload |
Netflix's use of FreeBSD and its investment in kernel-level TLS and sendfile optimizations are well documented in their tech blog posts. The appliances run a minimal software stack: no container orchestration, no general-purpose web server. The HTTP server is a custom asynchronous implementation tuned to serve large sequential reads from disk with minimal syscall overhead.
The home caching node concept represents Netflix's most aggressive topology shift. Rather than placing appliances only at ISP headends or IXPs, Netflix has experimented with smaller cache units positioned even closer to clusters of subscribers—think neighborhood-level or building-level deployments. Public details remain limited, but the architecture follows a logical extension of the embedded OCA model.
Home caching nodes serve a subset of the content catalog, focusing exclusively on titles predicted to be popular within a narrow geographic or network radius. Predictive models determine what to stage; the node pulls fills from the nearest upstream OCA rather than from origin. This creates a three-tier cache hierarchy: origin (AWS) → regional/IXP OCA → embedded ISP OCA → home caching node.
For an ISP, every gigabit served from a home caching node is a gigabit that does not traverse backhaul links to the headend. In GPON-based FTTH deployments, where the backhaul between the OLT and the core router is often the most expensive link to upgrade, moving Netflix traffic off that path has direct capex implications. Netflix positions this as a mutual benefit: the ISP saves on backhaul, and Netflix improves quality metrics by reducing the number of congestion-prone hops.
No architecture discussion is complete without understanding how the system degrades. Netflix Open Connect is designed to fail gracefully across several scenarios, and studying these patterns is instructive for anyone building distributed cache systems.
When an individual OCA becomes unresponsive or reports degraded health via its heartbeat to the control plane, the steering service removes it from manifest generation within seconds. Clients that already received a manifest pointing to the failed node will experience a segment fetch timeout; the Netflix client-side player is built to retry against the next URL in the manifest's ranked list. In practice, this means a brief stall (typically under two seconds) rather than a playback failure.
If an entire ISP site goes offline—power failure, fiber cut—the steering service redirects subscribers to the next-closest OCA deployment, which might be at an IXP or a neighboring ISP facility. Latency increases, but playback continues. Netflix's adaptive bitrate algorithm absorbs the additional RTT by stepping down quality temporarily.
A more subtle failure: if the nightly fill cycle fails to complete—because a peering link between Netflix and the ISP was saturated, or the scheduling system had a bug—the OCA enters the next peak period with a stale catalog. New releases that subscribers expect to watch are not cached locally. The OCA must then perform on-demand origin fetches or redirect to another OCA that has the content. This scenario is the primary driver behind Netflix's investment in fill-path redundancy and multi-path scheduling.
Steering depends on mapping client resolver IPs to the correct OCA. If an ISP changes its IP allocation or a large block of subscribers begins using a public DNS resolver that geolocates to a different region, steering sends clients to the wrong OCA cluster. Netflix mitigates this with EDNS Client Subnet (ECS) support and continuous telemetry validation, but mismatches still occur and are a known source of localized quality regressions.
Netflix can afford to build and ship custom hardware to ISPs because its traffic volume justifies the economics. Most organizations cannot. But the architectural principles transfer directly: proactive cache warming beats reactive cache fills, steering intelligence belongs in a centralized control plane rather than distributed via DNS TTL hacks, and cache-hit ratio is a planning metric, not just an operational one.
For organizations delivering video, large binaries, or software updates at significant scale but without the volume to justify an ISP-embedded appliance program, a commercial CDN with flexible edge configuration and aggressive volume pricing closes most of the gap. BlazingCDN's media delivery platform offers stability and fault tolerance on par with Amazon CloudFront at a fraction of the cost—starting at $4 per TB for smaller workloads and scaling down to $2 per TB at 2 PB+ commitments. For enterprises running high-volume delivery, that pricing delta compounds into material savings without sacrificing uptime or configurability.
Open Connect is a private, purpose-built CDN that Netflix operates exclusively for its own traffic. Unlike commercial CDNs that serve arbitrary customer origins, OCAs are single-tenant appliances running a minimal FreeBSD stack optimized for large sequential video segment reads. Netflix bears the hardware and shipping cost; ISPs provide rack space and power.
An OCA is a custom-built server Netflix deploys inside ISP networks or at IXPs. Current-generation storage OCAs contain 36 HDDs (up to 20 TB each) and push approximately 90 Gbps of sustained egress per unit. Flash-based OCAs use NVMe SSDs and can sustain roughly 160 Gbps. Both run Netflix's custom FreeBSD fork with in-kernel TLS offload.
Netflix's fill scheduler uses a prediction engine that combines historical viewing data per region, new-release metadata, promotional calendar data, and real-time trending signals. Content is pre-positioned during off-peak hours so that OCAs are warm before peak viewing begins. The system targets cache-hit ratios above 95% for well-provisioned sites.
Yes. The program is voluntary. Netflix provides appliances at no cost and covers hardware replacement, but ISPs must provide rack space, power, and connectivity. ISPs that decline still receive Netflix traffic over standard peering or transit, typically at higher cost and lower quality for end users.
The steering control plane detects the failure via health-check telemetry and removes the OCA from manifest generation within seconds. Clients already mid-stream retry against the next-ranked URL in their manifest. The player typically recovers in under two seconds, potentially at a reduced bitrate until a healthy OCA is fully serving the session.
As of Q2 2026, home caching nodes are in limited deployment and not broadly available. Netflix has discussed the concept publicly but has not opened a general enrollment program. ISPs interested in the standard embedded OCA program can apply through the Open Connect partner portal.
If you operate a multi-tier cache hierarchy—whether you built it or you are paying a CDN vendor for one—run this exercise this week: measure your cache-hit ratio at each tier during your peak hour, then calculate the origin-fetch bandwidth cost for every miss. Multiply by 30 days. That number is your monthly cache-miss tax. Compare it against the cost of proactive fill during off-peak windows. Netflix solved this problem by shipping hardware to ISPs. You can solve a version of it by instrumenting your fill pipeline and treating cache warmth as an SLO, not a side effect. If the numbers surprise you, your architecture is telling you something worth hearing.
Learn
Best CDN for Video Streaming in 2026: 7 Providers Compared A single rebuffer event at the two-second mark costs you 8% ...
Learn
Video CDN Providers Compared: BlazingCDN vs Cloudflare vs Akamai for OTT If you are choosing a video CDN for an OTT ...
Learn
Video CDN Pricing Explained: How to Stop Overpaying for Streaming Bandwidth Video already accounts for 38% of total ...