grazia.co.in
robots.txt

Robots Exclusion Standard data for grazia.co.in

Resource Scan

Scan Details

Site Domain grazia.co.in
Base Domain grazia.co.in
Scan Status Ok
Last Scan2024-05-25T03:56:30+00:00
Next Scan 2024-06-01T03:56:30+00:00

Last Scan

Scanned2024-05-25T03:56:30+00:00
URL https://grazia.co.in/robots.txt
Redirect https://www.grazia.co.in/robots.txt
Redirect Domain www.grazia.co.in
Redirect Base grazia.co.in
Domain IPs 184.50.85.132, 2600:1417:3f::b81c:eb2b, 2600:1417:3f::b81c:eb40, 96.17.180.24
Redirect IPs 184.50.85.132, 2600:1413:a000::1734:2872, 2600:1413:a000::1734:2873, 96.17.180.24
Response IP 184.50.85.132
Found Yes
Hash 324c55969b1f011ed688dccd46ded6f137c3c18b98f60d1b9608f2e3866c8dba
SimHash 0f1e59605133

Groups

*

Rule Path
Allow /
Disallow /7176/*
Disallow /27489895/*
Disallow /temp/
Disallow /2db/
Disallow /static_pages/
Disallow /tpl/
Disallow /gateway/
Disallow /common/
Disallow /google_plus/
Disallow /fb/
Disallow /twitter_oauth/
Disallow /crons/
Disallow /SolrApi/
Disallow /config/
Disallow /api/
Disallow /classes/
Disallow /testEsi/
Disallow /HTML/
Disallow /captcha/
Disallow /ssonew/
Disallow /sso/
Disallow /search/*

screaming frog seo spider

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.grazia.co.in/sitemap.xml
sitemap https://www.grazia.co.in/gImageSiteMap.xml
sitemap https://www.grazia.co.in/gNewsSiteMap.xml