indianretailer.com
robots.txt

Robots Exclusion Standard data for indianretailer.com

Resource Scan

Scan Details

Site Domain indianretailer.com
Base Domain indianretailer.com
Scan Status Ok
Last Scan2024-11-11T02:55:06+00:00
Next Scan 2024-11-18T02:55:06+00:00

Last Scan

Scanned2024-11-11T02:55:06+00:00
URL https://indianretailer.com/robots.txt
Redirect https://www.indianretailer.com/robots.txt
Redirect Domain www.indianretailer.com
Redirect Base indianretailer.com
Domain IPs 104.21.83.134, 172.67.176.156, 2606:4700:3031::ac43:b09c, 2606:4700:3035::6815:5386
Redirect IPs 104.21.83.134, 172.67.176.156, 2606:4700:3031::ac43:b09c, 2606:4700:3035::6815:5386
Response IP 104.21.83.134
Found Yes
Hash d7612cbbc8cb224567205195273235fe8a6cc623f68d28797c592f58485adfd6
SimHash 2d5c3c4462b0

Groups

*

Rule Path
Disallow
Disallow /profiles/
Disallow /public/
Disallow /vendor/
Disallow /README.txt
Disallow /web.config
Disallow /admin/
Disallow /comment/reply/
Disallow /filter/tips
Disallow /node/add/
Disallow /search/
Disallow /user/register
Disallow /user/password
Disallow /user/login
Disallow /user/logout
Disallow /article/*/*/*?page=
Disallow /article/*/*/*?page=*
Disallow /news/*?page=
Disallow /news/*?page=*
Disallow /glossary/*?page=
Disallow /glossary/*?page=*

Other Records

Field Value
sitemap https://www.indianretailer.com/sitemap.xml
sitemap https://www.indianretailer.com/articles/sitemap.xml
sitemap https://www.indianretailer.com/sitemap_news.xml
sitemap https://www.indianretailer.com/sitemap_news_page.xml
sitemap https://www.indianretailer.com/sitemap_1.xml
sitemap https://www.indianretailer.com/sitemap_2.xml

Comments

  • Directories
  • Files
  • Paths (clean URLs)