wrangler.in
robots.txt

Robots Exclusion Standard data for wrangler.in

Resource Scan

Scan Details

Site Domain wrangler.in
Base Domain wrangler.in
Scan Status Ok
Last Scan2024-06-24T05:38:25+00:00
Next Scan 2024-07-24T05:38:25+00:00

Last Scan

Scanned2024-06-24T05:38:25+00:00
URL https://wrangler.in/robots.txt
Redirect https://www.wrangler.in:443/robots.txt
Redirect Domain www.wrangler.in
Redirect Base wrangler.in
Domain IPs 35.244.129.183
Redirect IPs 108.156.133.119, 108.156.133.24, 108.156.133.49, 108.156.133.55, 2600:9000:2755:1000:4:2746:ca80:93a1, 2600:9000:2755:2000:4:2746:ca80:93a1, 2600:9000:2755:400:4:2746:ca80:93a1, 2600:9000:2755:5200:4:2746:ca80:93a1, 2600:9000:2755:5400:4:2746:ca80:93a1, 2600:9000:2755:6c00:4:2746:ca80:93a1, 2600:9000:2755:7000:4:2746:ca80:93a1, 2600:9000:2755:7a00:4:2746:ca80:93a1
Response IP 108.156.133.119
Found Yes
Hash 57e0a6319e5afae1b5f425cdf1705aabe057fb2c400f6ed1db8ba6b22a7de03f
SimHash 2814dc42c783

Groups

teoma

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

robozilla

Rule Path
Disallow /

nutch

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

psbot

Rule Path
Disallow /

*

Rule Path
Disallow
Disallow /catalogsearch/
Disallow /api/v1/trk
Disallow /media/catalog/

Other Records

Field Value
crawl-delay 60

Other Records

Field Value
sitemap https://www.wrangler.in/sitemap.xml