www.lighting.philips.com
robots.txt

Robots Exclusion Standard data for www.lighting.philips.com

Resource Scan

Scan Details

Site Domain www.lighting.philips.com
Base Domain philips.com
Scan Status Ok
Last Scan2024-05-25T01:18:12+00:00
Next Scan 2024-06-24T01:18:12+00:00

Last Scan

Scanned2024-05-25T01:18:12+00:00
URL https://www.lighting.philips.com/robots.txt
Domain IPs 23.59.168.130
Response IP 23.59.168.130
Found Yes
Hash 9f895e52d3ab36dc788d0b8f1fe218ea54f8da32d5bf31a6e443b2aa1b1bfa14
SimHash 38447a3045b3

Groups

*

Rule Path
Disallow /*.eps$
Disallow /*.zip$
Disallow /*.doc$
Disallow /*.docx$
Disallow /*.ULD$
Disallow /*.IES$
Disallow /content/B2B_LI*
Disallow /*?product*
Disallow /*Energy_Efficiency_Label*

elastic-crawler

Rule Path
Disallow /consumer
Allow /

iss_crawler_v1

Rule Path
Allow /consumer
Disallow /prof
Disallow /content/B2B_LI

Other Records

Field Value
sitemap https://www.lighting.philips.com/sitemap-b2b-philips-lighting-aa-index-sitemapindex_en_AA.xml

Comments

  • 3-2024 change date
  • extensions
  • content paths
  • Allow Elastic Crawler;
  • Allow ISS_CRAWLER_V1