luxtimes.lu
robots.txt

Robots Exclusion Standard data for luxtimes.lu

Resource Scan

Scan Details

Site Domain luxtimes.lu
Base Domain luxtimes.lu
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-07-26T21:47:04+00:00
Next Scan 2025-10-24T21:47:04+00:00

Last Successful Scan

Scanned2024-09-30T12:27:09+00:00
URL https://luxtimes.lu/robots.txt
Redirect https://www.luxtimes.lu/robots.txt
Redirect Domain www.luxtimes.lu
Redirect Base luxtimes.lu
Domain IPs 104.18.40.129, 172.64.147.127
Redirect IPs 104.18.40.129, 172.64.147.127
Response IP 172.64.147.127
Found Yes
Hash c30f777b06de550bed534b5eb30082b5f19a320ac0b9c0b69dfbd89506dd7578
SimHash ea38b75186a5

Groups

*

Rule Path
Allow /
Allow /tags
Disallow /search/

googlebot-news

Rule Path
Disallow /sponsoredcontent/

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.luxtimes.lu/sitemap.xml
sitemap https://www.luxtimes.lu/sitemap-image.xml
sitemap https://www.luxtimes.lu/sitemap-news.xml
sitemap https://www.luxtimes.lu/sitemap-video.xml

Comments

  • All copyrights, neighbouring rights and database rights in the content and layout of this website/app are explicitly reserved and are for personal, non-commercial use only.
  • In accordance with Article 4 of the Directive on Copyright in the Digital Single Market (CDSM) and its transposition into the law of the applicable Member State,
  • all content of this website on which it is made available is not to be used for the purposes of text and data mining, extraction, scraping and/or the use of programs or robots
  • for automatic data collection and/or extraction of digital data, whether for machine learning or artificial intelligence purposes or otherwise.
  • See also the Terms and Conditions of this website.
  • robots.txt prod luxtimes
  • Disallow Internal Search
  • Disallow Sponsored Articles for Google News
  • Disallow Large Language Models
  • list sitemaps