Internet2 NetFlow: Weekly Reports: Week of 20051219

  1. Introduction
  2. Bulk TCP
  3. Full Data Set

Introduction

You are looking at the weekly Abilene network usage report for the week of 20051219 produced from NetFlow records. The view of the whole network as a single traffic-relaying unit is presented. More formally, data from all interior circuits (those connecting two Abilene routers) were discarded while all the rest of the data were merged to create this view.

During this week, there were no missing data days.

The data are split into two sections: bulk TCP data and the full data set. A "bulk TCP" flow is defined as a TCP flow that transferred more than 10MB of data. The first section only concerns these data. The second section studies the overall traffic composition.

All the numbers in this report are hyperlinked to plots that show their history (e.g., clicking on the percentage of octets of NNTP traffic will bring up a time-series plot that shows the history of this parameter).

Bulk TCP

During this week, bulk TCP traffic comprised 51.63% of octets and 0.66% of packets of the full data set traffic.

The distribution of bulk TCP throughputs is the most important piece of data in this report. Cumulative distribution function plots (1-CDF vs. throughput in bits/second) in semi-log and log-log scales are as follows:
[Bulk TCP throughputs (semi-log scale).] [Bulk TCP throughputs (log-log scale).]

Distribution of the amount of data transferred (in semi-log and log-log scale, 1-CDF vs. total trasfer size in octets) is presented below. It should be recognized that NetFlow collection mechanism is always configured so that flows (in the accounting sense) cannot last longer than a certain period of time. Therefore, the distribution of transfer sizes is to a certain extent skewed in the upper part.
[Bulk TCP transfer sizes (semi-log scale)] [Bulk TCP transfer sizes (log-log scale).]

The distribution of durations of bulk TCP flows (in seconds) is as follows (you may notice the cut-off phenomenon mentioned above):

[Bulk TCP durations distribution.]

The following table shows actual values from the above distribution plots that correspond to characteristic values (such as median, 90%, max, etc.).

Table 1. Selected Points from Distribution Graphs (Bulk TCPs)

Percentile Throughput (b/s) Durations (s) Size (octets)
1 1.384M 6 10.05M
5 1.489M 14 10.46M
10 1.617M 21 10.95M
50 3.128M 58 17.18M
90 9.246M 59 45.60M
95 13.78M 59 63.38M
99 48.18M 59 161.3M
99.9 1.016G 119 3.692G
99.99 1.121G 120 3.807G
99.999 2.411G 125 5.226G
100 280.8G 4294967 339.4G

We compute average packet size of each flow by dividing the number of octets in a flow by the number of packets. Distribution of average sizes of packets belonging to bulk TCP flows is as follows:

Table 2. Packet Sizes (Bulk TCP)

Packet Size Packets
Small (<100B)2.84% 2.938G
Medium (100-1400B)5.09% 5.257G
Large (1401-1500B)82.53% 85.27G
Jumbo (>1500B)9.54% 9.854G
Total100.00% 103.3G

We show what applications transfer large amounts of data in the following table. Note that this is bulk TCP traffic only; full data set usage is presented in the next section.

Table 3. Aggregated Application Types (Bulk TCP)

Traffic Type OctetsPacketsFlows
Measurement41.58% 91.52T 13.74% 14.19G 1.61% 87.35k
Data Transfers23.75% 52.27T 34.70% 35.85G 41.59% 2.254M
Advanced Apps8.24% 18.13T 11.99% 12.39G 18.29% 991.4k
Encrypted Traffic7.70% 16.94T 11.16% 11.52G 9.58% 519.4k
File Sharing2.07% 4.555T 3.08% 3.182G 2.21% 120.0k
Misc0.14% 299.9G 0.52% 539.0M 0.32% 17.22k
Games0.13% 282.2G 0.19% 198.1M 0.25% 13.48k
Audio/Video0.11% 235.7G 0.16% 163.7M 0.27% 14.61k
Unidentified16.28% 35.84T 24.46% 25.27G 25.88% 1.403M
Total100.00% 220.0T 100.00% 103.3G 100.00% 5.421M

The following are the fastest 10 measurement flows with unique source and destination AS numbers (i.e., for any given pair of source and destination AS numbers, no more than one fastest flow is shown).

Table 4. Fastest Bulk TCP Measurement Flows with Unique AS Source and Destination

Throughput (b/s)Packet size (bytes)Duration (s)Src ASDest ASApplication type
1.168G900010Abilene [11537]Abilene [11537]Iperf
963.1M150060Unknown [17934]MREN [22335]Iperf
759.8M896552APNIC [7575]Abilene [11537]Iperf
376.9M150010SLAC [3671]U Florida [6356]Iperf
324.2M150012Brookhaven National Lab [43]U Florida [6356]Iperf
130.6M150010CERN [513]U Florida [6356]Iperf
87.41M140215NASA-GSFC [1749]UT-Austin [18]Iperf
74.57M142020SLAC [3671]UTDALL [20162]Iperf
74.03M150016NASA-ESDIS-NET [22767]UT-Austin [18]Iperf
67.06M150012CERN [513]CalTech [31]Iperf

The following are the fastest 10 non-measurement flows with unique source and destination AS numbers (i.e., for any given pair of source and destination AS numbers, no more than one fastest flow is shown). When unable to determine the application type, we give the source and destination port numbers.

Table 5. Fastest Bulk TCP Non-measurement Flows with Unique AS Source and Destination

Throughput (b/s)Packet size (bytes)Duration (s)Src ASDest ASApplication type
267.3M150019NASA-ESDIS-NET [22767]NCSA [1224]36176 -> 8889
260.1M147830NASA-HPCC-ESS [7847]UCAR [194]Hotline
216.1M150022Indiana [87]Network for Education and Research in Oregon [3701]Rsync
201.7M150020NASA-ESDIS-NET [22767]Pennsylvania State U [3999]Hotline
192.9M150020Network for Education and Research in Oregon [3701]Indiana [87]Rsync
170.3M142028NOAA [6629]NIST-BOULDER [2648]SSH
163.1M150021UCAR [194]NASA-HPCC-ESS [7847]47780 -> 5102
156.9M150013NASA-ESDIS-NET [22767]EROS Data Center - USGS [5663]Hotline
134.0M150020UCAR [194]PSC [1207]SSH
126.0M150011NASA-ESDIS-NET [22767]UCAR [194]Hotline

We also compute the average concurrency of bulk TCP flows for the week (by adding durations of all captured flows and dividing the result by the by the duration of the week). This week's average number of concurrent bulk TCP flows: 442.0.

Full Data Set

In addition to bulk TCP flows data, we provide statistics that characterize the overall composition of the complete data set (everything that transited the Abilene network this week).

The following table describes what kinds of traffic went through the network (multiple applications are aggregated into classes):

Table 6. Aggregated Application Types (Full Data Set)

Type OctetsPackets
Measurement26.90% 114.6T 0.16% 25.09G
Data Transfers25.32% 107.9T 0.79% 124.5G
Advanced Apps8.87% 37.82T 0.28% 44.00G
Encrypted Traffic5.38% 22.94T 0.18% 27.84G
File Sharing5.28% 22.50T 0.23% 36.18G
Audio/Video1.65% 7.024T 0.90% 141.3G
Misc1.51% 6.446T 0.95% 149.2G
Games0.34% 1.435T 0.02% 2.548G
Unidentified24.75% 105.5T 96.50% 15.19T
Total100.00% 426.3T 100.00% 15.74T

This table is available additionally in the following more verbose version (no applications are aggregated into classes, but class composition is shown):

Table 7. Detailed Application Types (Full Data Set)

Traffic type OctetsPackets
Measurement
Iperf
ICMP
IPMP
---
26.84%
0.06%
0.01%
---
114.4T
235.1G
21.82G
---
0.14%
0.02%
0.00%
---
22.00G
2.783G
303.0M
Data Transfers
NNTP
HTTP
FTP
Rsync
---
10.01%
9.86%
2.95%
2.50%
---
42.67T
42.03T
12.56T
10.67T
---
0.27%
0.35%
0.10%
0.07%
---
42.30G
55.46G
15.89G
10.90G
Advanced Apps
IBP
UNIDATA LDM
BBCP
McIDAS
GsiFTP
BBFTP
---
4.42%
4.12%
0.25%
0.09%
0.00%
0.00%
---
18.82T
17.54T
1.065T
362.6G
14.11G
7.905G
---
0.12%
0.15%
0.01%
0.00%
0.00%
0.00%
---
18.64G
23.67G
1.211G
380.3M
25.76M
61.91M
Encrypted Traffic
SSH
HTTPS
IPsec ESP
IPsec AH
IPsec IKE
---
4.72%
0.55%
0.10%
0.01%
0.00%
---
20.12T
2.361T
406.7G
44.73G
1.105G
---
0.15%
0.03%
0.00%
0.00%
0.00%
---
22.86G
4.186G
723.7M
66.26M
5.267M
File Sharing
Shoutcast
BitTorrent
Hotline
Audiogalaxy
eDonkey2000
Gnutella
FastTrack
Carracho
WinMX
Neo-Modus
Freenet
Blubster
Direct Connect++
---
2.85%
0.66%
0.65%
0.59%
0.43%
0.09%
0.01%
0.00%
0.00%
0.00%
0.00%
0.00%
0.00%
---
12.13T
2.816T
2.777T
2.502T
1.840T
398.1G
21.37G
4.437G
2.737G
1.995G
1.899G
565.7M
60.40M
---
0.15%
0.02%
0.02%
0.02%
0.02%
0.01%
0.00%
0.00%
0.00%
0.00%
0.00%
0.00%
0.00%
---
23.17G
3.816G
2.826G
3.041G
2.417G
850.5M
32.08M
8.902M
5.101M
2.261M
2.524M
7.967M
77.20k
Audio/Video
Any-Source Multicast
Real Player
Windows Media
Backbone Radio
H.323 Signaling
StreamWorks
Subset of VoIP
Camarades webcams
Single-Source Multicast
---
1.11%
0.49%
0.03%
0.01%
0.01%
0.00%
0.00%
0.00%
0.00%
---
4.739T
2.088T
113.4G
43.00G
25.08G
9.494G
4.906G
749.9M
0.000
---
0.88%
0.02%
0.00%
0.00%
0.00%
0.00%
0.00%
0.00%
0.00%
---
138.6G
2.477G
147.5M
55.23M
40.09M
15.90M
8.301M
3.774M
0.000
Misc
Mail
Squid
Port 0
DNS
X11
MS Windows
AFS
NFS
IRC
Telnet
NTP
AOL AIM
SOCKS
IDENT
SNMP
RPC Portmapper
RTIP
---
0.62%
0.28%
0.22%
0.13%
0.13%
0.04%
0.03%
0.02%
0.01%
0.01%
0.01%
0.00%
0.00%
0.00%
0.00%
0.00%
0.00%
---
2.627T
1.211T
924.9G
563.6G
536.1G
189.5G
145.6G
79.19G
43.93G
43.00G
30.64G
20.11G
17.38G
9.549G
4.087G
646.6M
78.90M
---
0.03%
0.01%
0.00%
0.03%
0.01%
0.85%
0.00%
0.00%
0.00%
0.00%
0.00%
0.00%
0.00%
0.00%
0.00%
0.00%
0.00%
---
4.741G
2.046G
735.8M
4.943G
1.023G
133.8G
390.1M
475.5M
217.0M
258.3M
401.3M
31.37M
30.18M
34.01M
33.87M
10.98M
204.4k
Games
DirectX
Battlenet
Half-Life
Quake
Asheron
Starsiege Tribes
Spy Arcade
---
0.26%
0.04%
0.01%
0.01%
0.00%
0.00%
0.00%
---
1.116T
185.4G
57.74G
48.69G
20.62G
5.029G
1.528G
---
0.01%
0.00%
0.00%
0.00%
0.00%
0.00%
0.00%
---
1.649G
364.4M
405.7M
70.37M
43.43M
10.76M
3.744M
Unidentified
Unidentified
---
24.75%
---
105.5T
---
96.50%
---
15.19T
Total
Total
---
100.00%
---
426.3T
---
100.00%
---
15.74T

The following table summarizes use of most popular IPv4 protocols:

Table 8. IP Protocols Distribution (Full Data set)

Protocols OctetsPackets
ICMP[1]0.06% 235.1G 0.02% 2.783G
IGMP[2]0.00% 384.0M 0.00% 11.69M
IP-ENCAP[4]0.00% 1.795G 0.00% 9.975M
TCP[6]85.53% 364.6T 2.27% 356.9G
UDP[17]11.22% 47.85T 0.39% 61.96G
IPv6[41]0.00% 7.886G 0.00% 25.06M
GRE[47]0.09% 364.4G 0.00% 775.3M
ESP[50]0.10% 406.7G 0.00% 723.7M
AX.25[93]0.01% 27.29G 0.43% 67.42G
PIM[103]0.00% 5.206G 0.00% 40.47M
IPMP[169]0.01% 21.82G 0.00% 303.0M
Other3.00% 12.78T 96.88% 15.25T
Total100.00% 426.3T 100.00% 15.74T

We compute average packet size of each flow by dividing the number of octets in a flow by the number of packets. Distribution of (average) packet sizes is as follows:

Table 9. Packet Sizes (Full Data Set)

Packet Size Packets
Small (<100B)98.36% 15.49T
Medium (100-1400B)0.57% 90.19G
Large (1401-1500B)0.99% 155.8G
Jumbo (>1500B)0.08% 12.46G
Total100.00% 15.74T

We only track DSCP values for which special treatment was defined by Internet2 QoS working group (and the default of DSCP=0):

Table 10. Important DSCP Values (Full Data Set)

Type OctetsPackets
Best effort [DSCP=0]91.48% 390.0T 50.20% 7.906T
Scavenger [DSCP=8]0.21% 900.4G 0.01% 1.085G
EF [DSCP=46]0.00% 2.375G 0.00% 9.666M
Other8.31% 35.41T 49.79% 7.841T
Total100.00% 426.3T 100.00% 15.74T

We collect statistics about ECN-capable traffic:

Table 11. ECN-Capable Traffic

Type OctetsPackets
ECN-Capable1.90% 8.111T 44.04% 6.936T

To facilitate detection of emerging applications, we present statistics about frequently encountered unidentified port numbers (no distinction is made in this table between TCP and UDP):

Table 12. Frequent Unidentified Ports

Port OctetsPackets
202240.74% 3.166T 22.50% 3.543T
400000.73% 3.122T 0.01% 2.330G
80900.72% 3.052T 0.02% 3.211G
790.53% 2.276T 16.46% 2.593T
191010.40% 1.715T 0.01% 1.908G