into-led.com
robots.txt

Robots Exclusion Standard data for into-led.com

Resource Scan

Scan Details

Site Domain into-led.com
Base Domain into-led.com
Scan Status Ok
Last Scan2025-09-17T16:40:34+00:00
Next Scan 2025-10-17T16:40:34+00:00

Last Scan

Scanned2025-09-17T16:40:34+00:00
URL https://into-led.com/robots.txt
Redirect https://www.into-led.com/robots.txt
Redirect Domain www.into-led.com
Redirect Base into-led.com
Domain IPs 216.150.1.1
Redirect IPs 216.150.1.1, 216.150.16.1
Response IP 216.150.1.65
Found Yes
Hash d647f9f5a946a62fb44a6bbfef561f7070ab87bf3810b10e3f8c5ff868127270
SimHash 6b57d91bc011

Groups

*

Rule Path
Disallow /lib/
Disallow /*.php$
Disallow /pkginfo/
Disallow /report/
Disallow /var/
Disallow /catalog/
Disallow /customer/
Disallow /sendfriend/
Disallow /review/
Disallow /*SID%3D%27
Disallow */search/?*
Disallow */account/*
Disallow */cart/*
Disallow */checkout/*

googlebot-image

Rule Path
Disallow

adsbot-google

Rule Path
Disallow

baiduspider
cazoodlebot
fasterfox
teracent-feed-processing
jyxobot
mj12bot
shopwiki

Rule Path
Disallow /

ingrid
msnbot
slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://www.into-led.com/sitemap.xml