lightbulbs.com
robots.txt

Robots Exclusion Standard data for lightbulbs.com

Resource Scan

Scan Details

Site Domain lightbulbs.com
Base Domain lightbulbs.com
Scan Status Ok
Last Scan2024-06-20T05:13:23+00:00
Next Scan 2024-07-20T05:13:23+00:00

Last Scan

Scanned2024-06-20T05:13:23+00:00
URL https://lightbulbs.com/robots.txt
Redirect https://www.lightbulbs.com/robots.txt
Redirect Domain www.lightbulbs.com
Redirect Base lightbulbs.com
Domain IPs 3.141.131.123
Redirect IPs 3.141.131.123
Response IP 3.141.131.123
Found Yes
Hash 4eb06e019958ee8915c52371175226defaa08d9a48dcf761e87cb5fd28590d02
SimHash 093cebd8a4d3

Groups

*

Rule Path
Disallow /search/
Disallow /category/*%26filter_manufacturer*

googlebot

Rule Path
Disallow /*source%3Dshoppingcse*
Disallow /*source%3DNexTagCSE*
Disallow /brand/philips-light-bulbs/?source=GooglePPC-Philips

ccbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider/2.0

Rule Path
Disallow /

blexbot/1.0

Rule Path
Disallow /

exabot/3.0

Rule Path
Disallow /

fatbot

Rule Path
Disallow /

fatbot 2.0

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

sogou web spider/4.0

Rule Path
Disallow /

sqlmap/1.0-dev

Rule Path
Disallow /

waypart-bot/2.1

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

Warnings

  • 2 invalid lines.