internethistoryguy.com
robots.txt

Robots Exclusion Standard data for internethistoryguy.com

Resource Scan

Scan Details

Site Domain internethistoryguy.com
Base Domain internethistoryguy.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-10-15T02:45:10+00:00
Next Scan 2026-01-13T02:45:10+00:00

Last Successful Scan

Scanned2024-06-22T22:37:05+00:00
URL https://internethistoryguy.com/robots.txt
Domain IPs 185.230.63.171
Response IP 185.230.63.171
Found Yes
Hash e906be2d992343012d4ea8584810af57ced0796dbe87876a7ea8cf60bc05f012
SimHash 48d6ca42d716

Groups

*

Rule Path
Allow /

googlebot

Rule Path
Disallow *?lightbox=

adsbot-google-mobile
adsbot-google

Rule Path
Disallow /_api/*
Disallow /_partials*
Disallow /pro-gallery-webapp/v1/galleries/*

petalbot

Rule Path
Disallow /

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.internethistoryguy.com/sitemap.xml

Comments

  • Optimization for Google Ads Bot
  • Block PetalBot
  • Crawl delay for overly enthusiastic bots
  • Auto generated, go to SEO Tools > Robots.txt Editor to change this