blog.theropod.tk
robots.txt

Robots Exclusion Standard data for blog.theropod.tk

Archived Snapshots

Resource Scan

Scan Details

Site Domain	blog.theropod.tk
Base Domain	theropod.tk
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't connect to server.
Last Scan	2025-05-18T18:37:18+00:00
Next Scan	2025-08-16T18:37:18+00:00

Last Successful Scan

Scanned	2023-04-07T17:51:59+00:00
URL	https://blog.theropod.tk/robots.txt
Domain IPs	104.21.21.160, 172.67.199.90, 2606:4700:3031::6815:15a0, 2606:4700:3036::ac43:c75a
Response IP	172.67.199.90
Found	Yes
Hash	931621fa3a2ef060d005b97cbc519845542ad48140ddff47b65dc6601a32c166
SimHash	584a9c402113

Groups

*

Rule	Path
Disallow	/images/
Disallow	/js/
Disallow	/css/

Rule

Path

Disallow

/images/

Disallow

/js/

Disallow

/css/

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

blexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

sistrix crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

sistrix

Rule	Path
Disallow	/

Rule

Path

Disallow

uptimerobot/2.0

Rule	Path
Disallow	/

Rule

Path

Disallow

ezooms robot

Rule	Path
Disallow	/

Rule

Path

Disallow

perl lwp

Rule	Path
Disallow	/

Rule

Path

Disallow

netestate ne crawler (+http://www.website-datenbank.de/)

Rule	Path
Disallow	/

Rule

Path

Disallow

wiseguys robot

Rule	Path
Disallow	/

Rule

Path

Disallow

turnitin robot

Rule	Path
Disallow	/

Rule

Path

Disallow

heritrix

Rule	Path
Disallow	/

Rule

Path

Disallow

pimonster

Rule	Path
Disallow	/

Rule

Path

Disallow

surdotlybot

Rule	Path
Disallow	/

Rule

Path

Disallow

zoominfobot

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://blog.theropod.tk/sitemap.xml

Field

Value

sitemap

https://blog.theropod.tk/sitemap.xml

Comments

Block SISTRIX
Block Uptime robot
Block Ezooms Robot
Block Perl LWP
Block netEstate NE Crawler (+http://www.website-datenbank.de/)
Block WiseGuys Robot
Block Turnitin Robot
Block Heritrix
Block pricepi

Warnings

4 invalid lines.

blog.theropod.tkrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

mj12bot

ahrefsbot

blexbot

sistrix crawler

sistrix

uptimerobot/2.0

ezooms robot

perl lwp

netestate ne crawler (+http://www.website-datenbank.de/)

wiseguys robot

turnitin robot

heritrix

pimonster

surdotlybot

zoominfobot

Other Records

Comments

Warnings

blog.theropod.tk
robots.txt