izmitharunyakarsa.site
robots.txt
Robots Exclusion Standard data for izmitharunyakarsa.site
Resource Scan
Scan Details
Site Domain | izmitharunyakarsa.site |
Base Domain | izmitharunyakarsa.site |
Scan Status | Ok |
Last Scan | 2025-09-12T07:26:03+00:00 |
Next Scan | 2025-10-12T07:26:03+00:00 |
Last Scan
Scanned | 2025-09-12T07:26:03+00:00 |
URL | https://izmitharunyakarsa.site/robots.txt |
Domain IPs | 104.21.9.176, 172.67.161.36, 2606:4700:3032::6815:9b0, 2606:4700:3034::ac43:a124 |
Response IP | 104.21.9.176 |
Found | Yes |
Hash | 04a476bf44d7177a4f5419ef8be24c544980dc9e71cc2560ef93b99b331cb85e |
SimHash | 0b557cda0b78 |
Groups
*
Rule | Path |
---|---|
Disallow | /*.html$ |
Disallow | /*.shtml$ |
Disallow | /*.xhtml$ |
Disallow | /*.asp$ |
Disallow | /*.php$ |
Disallow | /*.cache$ |
Disallow | /*.cgi$ |
Disallow | /profile/ |
Disallow | /*%3A* |
Disallow | /*?* |
Disallow | /?SI* |
Disallow | /*%21* |
Disallow | /*_* |
Disallow | /*%* |