lighting.com
robots.txt

Robots Exclusion Standard data for lighting.com

Resource Scan

Scan Details

Site Domain lighting.com
Base Domain lighting.com
Scan Status Ok
Last Scan2024-08-27T18:38:36+00:00
Next Scan 2024-09-26T18:38:36+00:00

Last Scan

Scanned2024-08-27T18:38:36+00:00
URL https://lighting.com/robots.txt
Redirect https://www.lighting.philips.com/robots.txt
Redirect Domain www.lighting.philips.com
Redirect Base philips.com
Domain IPs 54.74.135.134
Redirect IPs 96.17.96.16, 96.17.96.22
Response IP 23.50.232.241
Found Yes
Hash ae82197b5190c5f8ca6ac0a8e93b8fbbb4f665df2d59816ac70720907a2e4475
SimHash 38447a3045b3

Groups

*

Rule Path
Disallow /*.eps$
Disallow /*.zip$
Disallow /*.doc$
Disallow /*.docx$
Disallow /*.ULD$
Disallow /*.IES$
Disallow /content/B2B_LI*
Disallow /*?product*
Disallow /*Energy_Efficiency_Label*

elastic-crawler

Rule Path
Allow /

iss_crawler_v1

Rule Path
Allow /consumer
Disallow /prof
Disallow /content/B2B_LI

Other Records

Field Value
sitemap https://www.lighting.philips.com/sitemap-b2b-philips-lighting-aa-index-sitemapindex_en_AA.xml

Comments

  • 3-2024 change date
  • extensions
  • content paths
  • Allow Elastic Crawler;
  • Allow ISS_CRAWLER_V1