Survey Data Mining:   Home | FAQ | Archive | Glossary
Free Reports
  


You are viewing an outdated report. The latest version of this report was published on October 1st, 2024


    
Weighed Web Server Share
March 1st, 2002

Report Description
How popular is a web server? Is it based on installed base, web pages served out, or other means? This weighed web server share report provides a market share analysis of web servers by weighing a web server based on how many sites link to it (see methodology).

As a result, some web servers that may not rank very high on our regular survey, may rank highly here, if they are used by very well known, highly referenced web sites. As the number of websites change (top 50, top 250 and top 1000 websites from our survey), the results are also very interesting and different.

Weighed Web Server Share - Top 50 Sites
Weighed Web Server Share - Top 50 Sites Web Server Historical Share - Top 50 Sites

Web ServerShare
Apache47.10%
Microsoft-IIS18.06%
Not Known12.10%
Netscape-Enterprise 9.22%
Stronghold 3.75%
GWS 2.73%
0W 1.64%
Roxen 1.45%
AV 1.40%
AOLserver 1.30%
DCLK-HttpSvr 1.25%

Weighed Web Server Share - Top 250 Sites
Weighed Web Server Share - Top 250 Sites Web Server Historical Share - Top 250 Sites

Web ServerShare
Apache47.50%
Microsoft-IIS13.90%
Netscape-Enterprise10.24%
Zeus 9.81%
Not Known 8.16%
Stronghold 2.08%
GWS 1.38%
AOLserver 1.13%
AV 0.97%
DCLK-HttpSvr 0.83%
0W 0.82%
AkamaiGHost 0.76%
Roxen 0.73%
IBM_HTTP_Server 0.67%
Resin 0.25%
Y.G.Apache-SSLv3 0.23%
Hello! 0.19%
ConcentricHost-Naram 0.18%
Netscape-Commerce 0.18%

Weighed Web Server Share - Top 1000 Sites
Weighed Web Server Share - Top 1000 Sites
Web Server Historical Share - Top 1000 Sites

Web ServerShare
Apache54.23%
Microsoft-IIS13.38%
Netscape-Enterprise 9.73%
Zeus 7.63%
Not Known 5.94%
Stronghold 1.89%
GWS 0.92%
AkamaiGHost 0.79%
AV 0.70%
AOLserver 0.66%
Roxen 0.48%
IBM_HTTP_Server 0.46%
DCLK-HttpSvr 0.45%
0W 0.45%
thttpd 0.25%
Resin 0.23%
MIT 0.14%
Y.G.Apache-SSLv3 0.13%
swcd 0.12%
Hello! 0.11%

Methodology

Our goal with this report is to get closer to answering the question of how much traffic is served up on the internet by various servers than a typical web server count would reveal. We want to distinguish between a highly trafficed portal and a site that has only a couple visitors per month.

Since we don't have access to the actual traffic logs from all the different web servers in question, we focused on a different technique to determine the popularity of servers. The technique measures how authoritative a web server appears to be on the internet by how often it is referenced by other sites. The idea is that a server that is referenced by hundreds or thousands of sites wil in general have much more traffic than a web server that is referenced only a hand full of times, or not at all.

Our survey each month reports on the most popularly referred sites, which we call the popular referral report. This gives us a measure of how authoritative a site is. By combining this information with our existing knowledge of what server is running at the site in the report, we build a ranking of web servers based on referral popularity.

The scoring algorithm goes as follows:

  • Look up the next host in the referral report
  • Determine how many hosts referred to this web site
  • Determine the server signature (e.g. Apache, Netscape)
  • Add to this server type the number of hosts referring to the site
  • Go back to step 1 and look up the next host.

This is process is done 3 times, once for the top 50 sites, once for the top 250 sites, and once for the top 1000 sites. The percentages shown assume that the sites examined - 50, 250 or 1000 respectively - carried all internet traffic. While this is clearly not the case, the percentages are at least accurate relative to each other.

Finally, remember that the referral report (and thus how authoritative a site is) is based on a sample of the web. There is an inherent limitation in this report, which causes some site's referral count to be higher than warranted. Usually, this is a result of a suite of portals, all operated by the same group, each that refers to all the others. The effect of this is to lend a higher than justified weight to some sites. This results in a spiky historical trend line in the weighed graphs.

This problem is exasperated by the fact that our general crawler, up to and including the August 2000 survey, crawled our known list of sites alphabetically. This caused several of these portal "suites" to be clustered together due to their names. As of September 2000, the crawler has been randomized to limit the impact of this problem.

Because of the above problems, we recommend that the graphs be interpreted by paying limited attention to spikes in the graph, and more attention to general trends, such as server X appears to have more share than server "Y" when examined over the course of an 3-4 months.



© 1998-2024 E-Soft Inc. All rights reserved.