dtruyen.net
robots.txt

Robots Exclusion Standard data for dtruyen.net

Resource Scan

Scan Details

Site Domain dtruyen.net
Base Domain dtruyen.net
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-01-04T05:30:46+00:00
Next Scan 2025-04-04T05:30:46+00:00

Last Successful Scan

Scanned2024-06-09T03:23:43+00:00
URL https://dtruyen.net/robots.txt
Domain IPs 91.195.240.94
Response IP 91.195.240.94
Found Yes
Hash ec0067fb0925e79d7a0158e78266ecc507f27b30733f13fad8df5958bfad6110
SimHash 4814d6008792

Groups

googlebot

Rule Path
Disallow /info/
Disallow /search/

mediapartners-google

Rule Path
Disallow /info/
Disallow /search/

yahoo! slurp

Rule Path
Allow /$
Disallow /

bingbot

Rule Path
Allow /$
Disallow /

yandex

Rule Path
Allow /$
Disallow /

baiduspider

Rule Path
Disallow /

sogou

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow

ips-agent

Rule Path
Disallow /parking.php4

blexbot

Rule Path
Disallow /

pandalytics

Rule Path
Disallow /info/
Disallow /search/

ioncrawl

Rule Path
Disallow /info/
Disallow /search/

*

Rule Path
Disallow /