gemini://://gemini.bortzmeyer.org/software/lupa/stats.gmi
View on Gemini
#

Statistics on the Gemini space

This page presents some statistics on the current state of the Gemini space. It has been updated on 2024-05-18 03:04:00Z.

It cannot claim to represent the entire space. The real number of URIs is certainly higher. There are several reasons why many URIs are not in the database:

  • the capsule may forbid retrieval, through robots.txt,
  • we do not know all the URIs and some cannot be found from the ones we know,
  • Lupa has a maximum number of URIs per capsule, to save resources (currently 10000).

On this page, "working" means there was a successful connection recently. "recently" means "less than 31 days". "Dead" URLs and capsules are removed after 46 days and no longer appear in any statistics.

Currently, our database includes 614,296 URIs, 501,464 of them having been checked successfully (status code 20) and recently. Among the recently accessed, 384,183 URIs serve a Gemini content.

##

Resources

The average size of the resources is 60,059 bytes.

###

Quantiles

  • 10% of the resources are 251 bytes or less,
  • 20% of the resources are 500 bytes or less,
  • 30% of the resources are 834 bytes or less,
  • 40% of the resources are 1,376 bytes or less,
  • 50% of the resources are 2,501 bytes or less, MEDIAN
  • 60% of the resources are 4,539 bytes or less,
  • 70% of the resources are 6,841 bytes or less,
  • 80% of the resources are 15,235 bytes or less,
  • 90% of the resources are 81,810 bytes or less,
  • 100% of the resources are 4,156,230 bytes or less.

###

# Quantiles only for Gemini pages

  • 10% of the resources are 225 bytes or less,
  • 20% of the resources are 387 bytes or less,
  • 30% of the resources are 685 bytes or less,
  • 40% of the resources are 953 bytes or less,
  • 50% of the resources are 1,543 bytes or less, MEDIAN
  • 60% of the resources are 2,629 bytes or less,
  • 70% of the resources are 4,515 bytes or less,
  • 80% of the resources are 6,223 bytes or less,
  • 90% of the resources are 10,863 bytes or less,
  • 100% of the resources are 4,156,230 bytes or less.

###

Ranges

  • Less than 10 bytes: 3297 URLs (0.66 %)
  • 10 to 100 bytes: 13165 URLs (2.6 %)
  • 100 to 1000 bytes: 154066 URLs (30.7 %)
  • 1 to 10 kbytes: 211658 URLs (42.2 %)
  • 10 to 100 kbytes: 72844 URLs (14.5 %)
  • 100 to 1000 kbytes: 28497 URLs (5.7 %)
  • More than 1000 kbytes: 17937 URLs (3.58 %)

###

Most common media (MIME) types

  • text/gemini: 384,183 URLs
  • image/jpeg: 24,231 URLs
  • text/plain: 23,508 URLs
  • image/png: 22,871 URLs
  • application/octet-stream: 15,008 URLs
  • application/pdf: 11,972 URLs
  • application/zip: 2,701 URLs
  • image/svg+xml: 2,672 URLs
  • image/gif: 1,884 URLs
  • audio/mpeg: 1,407 URLs
  • text/xml: 1,209 URLs
  • text/x-diff: 873 URLs
  • text/x-go: 858 URLs
  • text/html: 752 URLs
  • application/json: 682 URLs
  • text/markdown: 680 URLs
  • application/atom+xml: 661 URLs
  • application/javascript: 573 URLs
  • image/webp: 481 URLs
  • audio/ogg: 338 URLs

###

Most common languages

  • Unspecified: 374,479 URLs
  • en: 93,048 URLs
  • de: 11,783 URLs
  • it: 7,216 URLs
  • fr: 6,689 URLs
  • es: 2,701 URLs
  • es_ar: 1,170 URLs
  • fa: 1,020 URLs
  • arb: 562 URLs
  • ja: 562 URLs
  • ru: 560 URLs
  • en_gb: 362 URLs
  • en_us: 211 URLs
  • en_au: 207 URLs
  • grc: 203 URLs
  • he: 142 URLs
  • pl: 98 URLs
  • eo: 84 URLs
  • sv: 72 URLs
  • gl: 52 URLs

###

Most common language tags

  • Unspecified: 374,433 URLs
  • en: 38,972 URLs
  • en-us: 30,841 URLs
  • en-gb: 22,271 URLs
  • de: 11,725 URLs
  • it: 7,216 URLs
  • fr: 5,870 URLs
  • es-es: 1,579 URLs
  • es_ar: 1,170 URLs
  • es: 1,091 URLs
  • fa: 1,020 URLs
  • fr-fr: 819 URLs
  • ja: 562 URLs
  • arb: 562 URLs
  • en-ie: 464 URLs
  • ru: 425 URLs
  • en_gb: 362 URLs
  • en-ca: 322 URLs
  • en_us: 211 URLs
  • en_au: 207 URLs

###

Most common encodings ("charsets") for all files

(Remember there exists testing capsules, with very exotic encodings, so don't be surprised by some strange ones.)

  • Unspecified: 425,501 URLs
  • utf-8: 75,666 URLs
  • binary: 205 URLs
  • us-ascii: 82 URLs
  • gzip: 5 URLs
  • xz: 2 URLs
  • bzip2: 2 URLs
  • iso-8859-1: 1 URLs

###

Most common encodings for gemtext files only

  • Unspecified: 318,975 URLs
  • utf-8: 65,207 URLs
  • iso-8859-1: 1 URLs

By the way, 1,616 of recently tested URLs (0.274 %) have a wrong encoding (it does not match the actual content).

###

Status codes

(Remember there are test capsules with funny status codes, to exercice Gemini clients.)

  • 20 (Success): 501,464 occurrences (91.09 %)
  • 51 (Not found): 19,734 occurrences (3.58 %)
  • 10 (Input request): 12,501 occurrences (2.27 %)
  • 60 (Client certificate request): 5,804 occurrences (1.05 %)
  • 40 (Temporary failure): 5,065 occurrences (0.92 %)
  • 30 (Temporary redirect): 4,362 occurrences (0.79 %)
  • 42 (CGI error): 898 occurrences (0.16 %)
  • 31 (Permanent redirect): 297 occurrences (0.05 %)
  • 44 (Slow down): 134 occurrences (0.02 %)
  • 59 (Bad request): 90 occurrences (0.02 %)
  • 50 (Permanent failure): 74 occurrences (0.01 %)
  • 53 (Proxy request refused): 68 occurrences (0.01 %)

##

Links

(We count only backlinks from external capsules, and at most one link per capsule. Also, we exclude links from capsules like search engines or directories.)

Maximum number of incoming links: 302

Average number of incoming links: 0.26

##

Capsules

There are 3749 capsules. We successfully connected recently to 2754 of them.

###

Most common capsules by number of working URLs

We have a limit of 10000 URLs per capsule.

  • musicbrainz.uploadedlobster.com: 9976 URLs
  • 1436.ninja: 9965 URLs
  • scholasticdiversity.us.to: 9910 URLs
  • tjp.lol: 9844 URLs
  • gemini.omarpolo.com: 9745 URLs
  • gemini.conman.org: 9744 URLs
  • taz.de: 9741 URLs
  • oracular.space: 9739 URLs
  • gemini.techrights.org: 9718 URLs
  • kvazar.duckdns.org: 9706 URLs
  • gemini.tuxmachines.org: 9686 URLs
  • gemlog.stargrave.org: 9678 URLs
  • caiofior.pollux.casa: 9673 URLs
  • gemini.autonomy.earth: 9666 URLs
  • gmi.noulin.net: 9601 URLs
  • bbs.geminispace.org: 9579 URLs
  • hoagie.space: 9576 URLs
  • gemini.knusbaum.com: 9486 URLs
  • mirrors.apple2.org.za: 9398 URLs
  • midnight.pub: 9397 URLs

###

Most common capsules by number of bytes in working URLs

We have a limit of bytes per URL.

  • 1436.ninja: 9569.6 megabytes
  • mirrors.apple2.org.za: 2642.8 megabytes
  • nytpu.com: 1347.9 megabytes
  • uscoffings.net: 899.0 megabytes
  • gem.librehacker.com: 869.5 megabytes
  • gael.mooo.com: 729.7 megabytes
  • librehacker.com: 723.0 megabytes
  • jpfox.fr: 609.3 megabytes
  • yam655.com: 579.9 megabytes
  • dfdn.info: 554.7 megabytes
  • hoagie.space: 480.6 megabytes
  • gemini.omarpolo.com: 376.7 megabytes
  • mikelynch.org: 365.0 megabytes
  • si3t.ch: 335.3 megabytes
  • library.inu.red: 300.0 megabytes
  • ecs.d2evs.net: 294.8 megabytes
  • tweek.zyxxyz.eu: 286.0 megabytes
  • canary.city: 239.2 megabytes
  • gemi.dev: 209.0 megabytes
  • going-flying.com: 187.2 megabytes
  • shit.cx: 178.9 megabytes

All working capsules:

###

Certificates

2489 (90.4 %) capsules are self-signed, 210 (7.6 %) use the Certificate Authority Let's Encrypt, 55 (2.0 %) are signed by another CA (may be not a trusted one).

74 capsules (2.70 %) have an expired certificate.

Algorithms:

  • ecdsa-with-SHA256: 1736 capsules
  • sha256WithRSAEncryption: 990 capsules
  • ED25519: 18 capsules
  • sha512WithRSAEncryption: 5 capsules
  • ecdsa-with-SHA512: 3 capsules
  • ecdsa-with-SHA384: 1 capsules
  • sha384WithRSAEncryption: 1 capsules

Key types:

  • ECDSA: 1788 capsules
  • RSA: 948 capsules
  • ED25519: 18 capsules

Key sizes for RSA:

  • 2048: 667 capsules
  • 4096: 271 capsules
  • 3072: 7 capsules
  • 1024: 3 capsules

Key sizes for ECDSA:

  • 256: 1724 capsules
  • 384: 63 capsules
  • 521: 1 capsules

###

TLS

98 % of the capsules use TLS 1.3, 2 % use TLS 1.2.

###

robots.txt

272 (10 %) the capsules have a robots.txt exclusion file.

###

Ports

18 working capsules (0.7 %) use an alternative port

###

Addresses

1186 IP addresses used. 18 % are IPv6.

###

# Addresses with most virtual hosts

  • 173.230.145.243: 866 vhosts
  • 68.133.1.71: 411 vhosts
  • 213.219.38.200: 252 vhosts
  • 46.23.81.157: 115 vhosts
  • 2a03:6000:1813:1337::157: 89 vhosts
  • 90.65.170.44: 31 vhosts
  • 109.237.26.252: 31 vhosts
  • 45.56.93.217: 19 vhosts
  • 128.140.115.191: 11 vhosts
  • 51.222.161.16: 9 vhosts
  • 212.71.248.87: 9 vhosts
  • 2a03:6000:6f67:624::99: 8 vhosts
  • 81.187.234.86: 8 vhosts
  • 2a01:4f8:c17:20f1::42: 8 vhosts
  • 23.88.35.144: 8 vhosts
  • 46.23.94.99: 8 vhosts
  • 174.138.124.169: 8 vhosts
  • 85.208.51.149: 7 vhosts
  • 64.226.23.78: 6 vhosts
  • 66.175.211.51: 6 vhosts

##

TLDs

There are 252 TLDs in the capsule's names, and 1822 registered domains.

###

Most common TLDs

###

# By number of registered domains

  • com: 288 domains
  • net: 154 domains
  • org: 151 domains
  • xyz: 116 domains
  • space: 88 domains
  • site: 59 domains
  • de: 54 domains
  • dev: 51 domains
  • me: 45 domains
  • eu: 34 domains
  • fr: 29 domains
  • uk: 27 domains
  • info: 27 domains
  • io: 26 domains
  • club: 22 domains
  • ca: 16 domains
  • online: 16 domains
  • se: 15 domains
  • ru: 15 domains
  • cc: 15 domains

###

# By number of capsules

(There's a strong bias towards TLDs which have hosting services such as flounder.online, which has many capsules in subdomains. See before the TLDs per registered domains, which are probably more useful.)

  • online: 874 capsules
  • org: 602 capsules
  • com: 348 capsules
  • pub: 264 capsules
  • net: 181 capsules
  • xyz: 131 capsules
  • space: 102 capsules
  • site: 61 capsules
  • de: 59 capsules
  • dev: 54 capsules
  • club: 48 capsules
  • me: 45 capsules
  • eu: 42 capsules
  • casa: 38 capsules
  • io: 33 capsules
  • info: 32 capsules
  • fr: 31 capsules
  • uk: 30 capsules
  • ca: 20 capsules
  • cc: 20 capsules

##

Other statistics on the geminispace

##

Contact

Maintained by Stéphane Bortzmeyer (email <stephane+gemini@bortzmeyer.org>). Comments and criticisms are welcome.