izmitharunyakarsa.site
robots.txt

Robots Exclusion Standard data for izmitharunyakarsa.site

Resource Scan

Scan Details

Site Domain izmitharunyakarsa.site
Base Domain izmitharunyakarsa.site
Scan Status Ok
Last Scan2025-09-12T07:26:03+00:00
Next Scan 2025-10-12T07:26:03+00:00

Last Scan

Scanned2025-09-12T07:26:03+00:00
URL https://izmitharunyakarsa.site/robots.txt
Domain IPs 104.21.9.176, 172.67.161.36, 2606:4700:3032::6815:9b0, 2606:4700:3034::ac43:a124
Response IP 104.21.9.176
Found Yes
Hash 04a476bf44d7177a4f5419ef8be24c544980dc9e71cc2560ef93b99b331cb85e
SimHash 0b557cda0b78

Groups

ia_archiver

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

*

Rule Path
Disallow /*.html$
Disallow /*.shtml$
Disallow /*.xhtml$
Disallow /*.asp$
Disallow /*.php$
Disallow /*.cache$
Disallow /*.cgi$
Disallow /profile/
Disallow /*%3A*
Disallow /*?*
Disallow /?SI*
Disallow /*%21*
Disallow /*_*
Disallow /*%*