iloveindia.com
robots.txt
Robots Exclusion Standard data for iloveindia.com
Resource Scan
Scan Details
Site Domain | iloveindia.com |
Base Domain | iloveindia.com |
Scan Status | Ok |
Last Scan | 2024-11-08T14:05:57+00:00 |
Next Scan | 2024-11-15T14:05:57+00:00 |
Last Scan
Scanned | 2024-11-08T14:05:57+00:00 |
URL | https://iloveindia.com/robots.txt |
Domain IPs | 104.21.13.206, 172.67.133.25, 2606:4700:3032::ac43:8519, 2606:4700:3033::6815:dce |
Response IP | 172.67.133.25 |
Found | Yes |
Hash | 23a0e9181d5bf34f9c6cdf91e965fbfca10dedbc660cf9a2e8d226d9dad757b1 |
SimHash | b914cf646dd0 |
Groups
*
Rule | Path |
---|---|
Disallow | /adnetwork/ |
Disallow | /directory/ |
Disallow | /hotelsinindia/query/ |
Disallow | /index_new.html |
Disallow | /includes/ |
Disallow | /verify |