stephen-gose.com
robots.txt

Robots Exclusion Standard data for stephen-gose.com

Resource Scan

Scan Details

Site Domain stephen-gose.com
Base Domain stephen-gose.com
Scan Status Ok
Last Scan2024-11-09T10:03:51+00:00
Next Scan 2024-11-16T10:03:51+00:00

Last Scan

Scanned2024-11-09T10:03:51+00:00
URL https://stephen-gose.com/robots.txt
Domain IPs 162.210.96.129
Response IP 162.210.96.129
Found Yes
Hash 6f45be0fb4df09ee2b78f6d53653223fe3f12a03bc45b4814d3a302f36bf961f
SimHash 2a566eb189b2

Groups

*

Rule Path
Disallow /admin/
Disallow /backups/*
Disallow /data/*
Disallow /plugins/*
Disallow /slm/*

mj12bot

Rule Path
Disallow /

Other Records

Field Value
sitemap http://www.www.stephen-gose.com/sitemap.xml

Comments

  • Unknown robot
  • Unknown
  • Java/1.7.0_21
  • Java/1.6.0_04
  • robot
  • spider
  • robot.txt
  • bot*
  • .bot
  • None
  • legs
  • crawler
  • crawl

Warnings

  • 39 invalid lines.
  • `host` is not a known field.