indianretailer.com
robots.txt

Robots Exclusion Standard data for indianretailer.com

Resource Scan

Scan Details

Site Domain indianretailer.com
Base Domain indianretailer.com
Scan Status Ok
Last Scan2024-06-23T15:58:49+00:00
Next Scan 2024-06-30T15:58:49+00:00

Last Scan

Scanned2024-06-23T15:58:49+00:00
URL https://indianretailer.com/robots.txt
Redirect https://www.indianretailer.com/robots.txt
Redirect Domain www.indianretailer.com
Redirect Base indianretailer.com
Domain IPs 104.21.83.134, 172.67.176.156, 2606:4700:3031::ac43:b09c, 2606:4700:3035::6815:5386
Redirect IPs 104.21.83.134, 172.67.176.156, 2606:4700:3031::ac43:b09c, 2606:4700:3035::6815:5386
Response IP 172.67.176.156
Found Yes
Hash 3cdfd90dc1514d166bf632a2ed9b0a72589bbfa252960280f64b0f55ffe2bb8b
SimHash 295c3c44e2b2

Groups

*

Rule Path
Disallow
Disallow /profiles/
Disallow /public/
Disallow /vendor/
Disallow /README.txt
Disallow /web.config
Disallow /admin/
Disallow /comment/reply/
Disallow /filter/tips
Disallow /node/add/
Disallow /search/
Disallow /user/register
Disallow /user/password
Disallow /user/login
Disallow /user/logout
Disallow /article/*/*/*?page=
Disallow /article/*/*/*?page=*
Disallow /news/*?page=
Disallow /news/*?page=*
Disallow /glossary/*?page=
Disallow /glossary/*?page=*

Other Records

Field Value
sitemap https://www.indianretailer.com/sitemap.xml
sitemap https://www.indianretailer.com/articles/sitemap.xml
sitemap https://www.indianretailer.com/sitemap_news.xml
sitemap https://www.indianretailer.com/sitemap_news_page.xml

Comments

  • Directories
  • Files
  • Paths (clean URLs)