grazia.co.in
robots.txt
Robots Exclusion Standard data for grazia.co.in
Resource Scan
Scan Details
Site Domain | grazia.co.in |
Base Domain | grazia.co.in |
Scan Status | Ok |
Last Scan | 2024-05-25T03:56:30+00:00 |
Next Scan | 2024-06-01T03:56:30+00:00 |
Last Scan
Scanned | 2024-05-25T03:56:30+00:00 |
URL | https://grazia.co.in/robots.txt |
Redirect | https://www.grazia.co.in/robots.txt |
Redirect Domain | www.grazia.co.in |
Redirect Base | grazia.co.in |
Domain IPs | 184.50.85.132, 2600:1417:3f::b81c:eb2b, 2600:1417:3f::b81c:eb40, 96.17.180.24 |
Redirect IPs | 184.50.85.132, 2600:1413:a000::1734:2872, 2600:1413:a000::1734:2873, 96.17.180.24 |
Response IP | 184.50.85.132 |
Found | Yes |
Hash | 324c55969b1f011ed688dccd46ded6f137c3c18b98f60d1b9608f2e3866c8dba |
SimHash | 0f1e59605133 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /7176/* |
Disallow | /27489895/* |
Disallow | /temp/ |
Disallow | /2db/ |
Disallow | /static_pages/ |
Disallow | /tpl/ |
Disallow | /gateway/ |
Disallow | /common/ |
Disallow | /google_plus/ |
Disallow | /fb/ |
Disallow | /twitter_oauth/ |
Disallow | /crons/ |
Disallow | /SolrApi/ |
Disallow | /config/ |
Disallow | /api/ |
Disallow | /classes/ |
Disallow | /testEsi/ |
Disallow | /HTML/ |
Disallow | /captcha/ |
Disallow | /ssonew/ |
Disallow | /sso/ |
Disallow | /search/* |
Other Records
Field | Value |
---|---|
sitemap | https://www.grazia.co.in/sitemap.xml |
sitemap | https://www.grazia.co.in/gImageSiteMap.xml |
sitemap | https://www.grazia.co.in/gNewsSiteMap.xml |