environmentportal.in
robots.txt
Robots Exclusion Standard data for environmentportal.in
Resource Scan
Scan Details
Site Domain | environmentportal.in |
Base Domain | environmentportal.in |
Scan Status | Ok |
Last Scan | 2024-09-03T16:29:30+00:00 |
Next Scan | 2024-10-03T16:29:30+00:00 |
Last Scan
Scanned | 2024-09-03T16:29:30+00:00 |
URL | https://environmentportal.in/robots.txt |
Redirect | https://www.environmentportal.in/robots.txt |
Redirect Domain | www.environmentportal.in |
Redirect Base | environmentportal.in |
Domain IPs | 104.21.93.173, 172.67.213.69, 2606:4700:3030::6815:5dad, 2606:4700:3031::ac43:d545 |
Redirect IPs | 104.21.93.173, 172.67.213.69, 2606:4700:3030::6815:5dad, 2606:4700:3031::ac43:d545 |
Response IP | 104.21.93.173 |
Found | Yes |
Hash | 3f435e509fd3c55c5a2ca0770f26fc452327b4ba4b08d511ad79364776b0272e |
SimHash | ee37c430bd90 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-json/ |
Disallow | /?s=* |
Disallow | /search/* |
Disallow | /cdn-cgi/bm/cv/ |
Disallow | /cdn-cgi/challenge-platform/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.environmentportal.in/sitemap_index.xml |
Comments