pendikguveninx.site
robots.txt

Robots Exclusion Standard data for pendikguveninx.site

Resource Scan

Scan Details

Site Domain pendikguveninx.site
Base Domain pendikguveninx.site
Scan Status Ok
Last Scan2025-09-13T09:23:38+00:00
Next Scan 2025-10-13T09:23:38+00:00

Last Scan

Scanned2025-09-13T09:23:38+00:00
URL https://pendikguveninx.site/robots.txt
Domain IPs 104.21.112.1, 104.21.16.1, 104.21.32.1, 104.21.48.1, 104.21.64.1, 104.21.80.1, 104.21.96.1, 2606:4700:3030::6815:1001, 2606:4700:3030::6815:2001, 2606:4700:3030::6815:3001, 2606:4700:3030::6815:4001, 2606:4700:3030::6815:5001, 2606:4700:3030::6815:6001, 2606:4700:3030::6815:7001
Response IP 104.21.96.1
Found Yes
Hash 04a476bf44d7177a4f5419ef8be24c544980dc9e71cc2560ef93b99b331cb85e
SimHash 0b557cda0b78

Groups

ia_archiver

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

*

Rule Path
Disallow /*.html$
Disallow /*.shtml$
Disallow /*.xhtml$
Disallow /*.asp$
Disallow /*.php$
Disallow /*.cache$
Disallow /*.cgi$
Disallow /profile/
Disallow /*%3A*
Disallow /*?*
Disallow /?SI*
Disallow /*%21*
Disallow /*_*
Disallow /*%*