techiedelight.com
robots.txt

Robots Exclusion Standard data for techiedelight.com

Resource Scan

Scan Details

Site Domain techiedelight.com
Base Domain techiedelight.com
Scan Status Ok
Last Scan2024-06-26T15:56:02+00:00
Next Scan 2024-07-03T15:56:02+00:00

Last Scan

Scanned2024-06-26T15:56:02+00:00
URL https://techiedelight.com/robots.txt
Domain IPs 104.21.10.123, 172.67.190.36, 2606:4700:3030::ac43:be24, 2606:4700:3036::6815:a7b
Response IP 104.21.10.123
Found Yes
Hash 71217beb19c9c949bb10a305946d0797ae48b27040aeb4acdb9ab7c05432f329
SimHash 181ec9088fbb

Groups

*

Rule Path
Disallow /code/
Disallow /snippet/
Disallow /wp-admin/
Disallow /?blackhole
Disallow /practice/template/*
Allow /wp-admin/admin-ajax.php

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

httrack

Rule Path
Disallow /

httrack

Rule Path
Disallow /

wget

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /