interestingearth.com
robots.txt
Robots Exclusion Standard data for interestingearth.com
Resource Scan
Scan Details
Site Domain | interestingearth.com |
Base Domain | interestingearth.com |
Scan Status | Ok |
Last Scan | 2024-09-28T08:00:11+00:00 |
Next Scan | 2024-10-05T08:00:11+00:00 |
Last Scan
Scanned | 2024-09-28T08:00:11+00:00 |
URL | https://interestingearth.com/robots.txt |
Domain IPs | 2406:da18:9d0:143f:2124:4e9c:36a9:d9de, 52.221.42.138 |
Response IP | 52.221.42.138 |
Found | Yes |
Hash | 4d2f39a072ec04634233824992a6cd73bf55d834c8ad356d58ff4402efdefd01 |
SimHash | 4004c9544153 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /dev/ |
Disallow | /adds/ |
Disallow | /js/ |
Disallow | /css/ |
Disallow | /sablonok/ |
Other Records
Field | Value |
---|---|
sitemap | http://www.interestingearth.com/sitemap.xml |