billgath.com
robots.txt

Robots Exclusion Standard data for billgath.com

Resource Scan

Scan Details

Site Domain billgath.com
Base Domain billgath.com
Scan Status Ok
Last Scan2025-12-05T04:24:58+00:00
Next Scan 2026-01-04T04:24:58+00:00

Last Scan

Scanned2025-12-05T04:24:58+00:00
URL https://billgath.com/robots.txt
Domain IPs 104.21.56.120, 172.67.150.213, 2606:4700:3032::ac43:96d5, 2606:4700:3034::6815:3878
Response IP 104.21.56.120
Found Yes
Hash 161843e27b6961eff9d4e7dbf43a8e664669a3a2539b854224d5d2b0ee4020e9
SimHash 463e4733ced8

Groups

*

Rule Path
Disallow /api/
Disallow /cli/
Disallow /lts/
Disallow /mgmt/
Disallow /parentClasses/
Disallow /scruffy/cli/
Disallow /scruffy/logs/

Other Records

Field Value
crawl-delay 5

baiduspider

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

cazoodlebot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

exabot

Rule Path
Disallow /

mlbot

Rule Path
Disallow /

iisbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

speedy

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

siteexplorer

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

Warnings

  • 2 invalid lines.