luxtimes.lu
robots.txt

Robots Exclusion Standard data for luxtimes.lu

Archived Snapshots

Resource Scan

Scan Details

Site Domain	luxtimes.lu
Base Domain	luxtimes.lu
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2025-07-26T21:47:04+00:00
Next Scan	2025-10-24T21:47:04+00:00

Last Successful Scan

Scanned	2024-09-30T12:27:09+00:00
URL	https://luxtimes.lu/robots.txt
Redirect	https://www.luxtimes.lu/robots.txt
Redirect Domain	www.luxtimes.lu
Redirect Base	luxtimes.lu
Domain IPs	104.18.40.129, 172.64.147.127
Redirect IPs	104.18.40.129, 172.64.147.127
Response IP	172.64.147.127
Found	Yes
Hash	c30f777b06de550bed534b5eb30082b5f19a320ac0b9c0b69dfbd89506dd7578
SimHash	ea38b75186a5

Groups

*

Rule	Path
Allow	/
Allow	/tags
Disallow	/search/

Rule

Path

Allow

/tags

Disallow

/search/

googlebot-news

Rule	Path
Disallow	/sponsoredcontent/

Rule

Path

Disallow

/sponsoredcontent/

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

claude-web

Rule	Path
Disallow	/

Rule

Path

Disallow

cohere-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

diffbot

Rule	Path
Disallow	/

Rule

Path

Disallow

facebookbot

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

magpie-crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

omgili

Rule	Path
Disallow	/

Rule

Path

Disallow

omgilibot

Rule	Path
Disallow	/

Rule

Path

Disallow

perplexitybot

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://www.luxtimes.lu/sitemap.xml
sitemap	https://www.luxtimes.lu/sitemap-image.xml
sitemap	https://www.luxtimes.lu/sitemap-news.xml
sitemap	https://www.luxtimes.lu/sitemap-video.xml

Field

Value

sitemap

https://www.luxtimes.lu/sitemap.xml

sitemap

https://www.luxtimes.lu/sitemap-image.xml

sitemap

https://www.luxtimes.lu/sitemap-news.xml

sitemap

https://www.luxtimes.lu/sitemap-video.xml

Comments

All copyrights, neighbouring rights and database rights in the content and layout of this website/app are explicitly reserved and are for personal, non-commercial use only.
In accordance with Article 4 of the Directive on Copyright in the Digital Single Market (CDSM) and its transposition into the law of the applicable Member State,
all content of this website on which it is made available is not to be used for the purposes of text and data mining, extraction, scraping and/or the use of programs or robots
for automatic data collection and/or extraction of digital data, whether for machine learning or artificial intelligence purposes or otherwise.
See also the Terms and Conditions of this website.
robots.txt prod luxtimes
Disallow Internal Search
Disallow Sponsored Articles for Google News
Disallow Large Language Models
list sitemaps

luxtimes.lurobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

googlebot-news

amazonbot

anthropic-ai

bytespider

ccbot

chatgpt-user

claudebot

claude-web

cohere-ai

diffbot

facebookbot

google-extended

gptbot

magpie-crawler

omgili

omgilibot

perplexitybot

Other Records

Comments

luxtimes.lu
robots.txt