Inspired by this post commentary by Kev Quirk, I decided to look into my own logfiles of the past months and see how much bot visits I’m getting.
Some findings:
- No, bots are not eating my blog, two thirds of all requests seem human.
- But boy, there are a lot of bots around nowadays!
- Surprisingly, Googlebot isn’t the hungriest bot. It’s OpenAI/ChatGPT!
- Strong contender: Ahrefsbot – they’re really nosy! I can see why you would pay for their service, though, given that browsers generally don’t send referer headers anymore.
- Bytespider, operated by TikTok’s parent company ByteDance, also seems really into Amiga, demoscene, and Teletext content.
- Of course, all of this can only take bots into account that play nice and identify themselves as bots.
The top user agents of the past months are:
| # | Hits | Percentage | Identified as |
|---|---|---|---|
| 1 | 316,944 | 66.3% | User agent looking like a human-operated browser |
| 2 | 22,622 | 4.7% | openai.com |
| 3 | 21,574 | 4.5% | GoogleBot |
| 4 | 15,075 | 3.2% | AhrefsBot |
| 5 | 11,407 | 2.4% | Scrapy |
| 6 | 9,140 | 1.9% | Bytespider |
| 7 | 6,474 | 1.4% | bingbot |
| 8 | 5,815 | 1.2% | babbar.tech |
| 9 | 5,105 | 1.1% | ClaudeBot |
| 10 | 4,895 | 1.0% | Amazonbot |
| 11 | 4,298 | 0.9% | facebook.com |
| 12 | 4,185 | 0.9% | Applebot |
| 13 | 3,666 | 0.8% | PetalBot |
| 14 | 3,532 | 0.7% | SemrushBot |
| 15 | 3,449 | 0.7% | ImagesiftBot |
| 16 | 2,564 | 0.5% | MJ12bot |
| 17 | 2,401 | 0.5% | DuckDuckBot |
| 18 | 2,134 | 0.4% | BLEXBot |
| 19 | 1,961 | 0.4% | Empty (“-”) |
| 20 | 1,428 | 0.3% | YandexBot |
| 21 | 1,118 | 0.2% | curl |
| 22 | 1,100 | 0.2% | Dalvik |
| 23 | 1,058 | 0.2% | VelenPublicWebCrawler |
| 24 | 1,038 | 0.2% | Timpibot |
| 25 | 1,001 | 0.2% | serpstatbot |
| 26 | 970 | 0.2% | AwarioBot |
| 27 | 967 | 0.2% | SeznamBot |
| 28 | 963 | 0.2% | DotBot |
| 29 | 881 | 0.2% | DataForSeoBot |
| 30 | 870 | 0.2% | coccocbot |
The bot-vs-human ratio roughly ends up like this:
| Percentage of all requests | Category |
|---|---|
| 66.3% | Probably human |
| 33.7% | Bot |
So… go humans, I guess? That is, unless we filter out all file requests (CSS, images, other media). Then we get:
| Percentage of page requests | Category |
|---|---|
| 50.8% | Bot |
| 49.2% | Probably human |
In any case it was lovely to see some Amiga users visiting with IBrowse on AmigaOS 3.2 and 3.9! And I guess the “PSP (PlayStation Portable)” user agent is a nerdy joke? :)