wort.lu
robots.txt

Robots Exclusion Standard data for wort.lu

Resource Scan

Scan Details

Site Domain wort.lu
Base Domain wort.lu
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-10-27T10:44:45+00:00
Next Scan 2024-11-26T10:44:45+00:00

Last Successful Scan

Scanned2024-09-28T10:43:04+00:00
URL https://wort.lu/robots.txt
Redirect https://www.wort.lu/robots.txt
Redirect Domain www.wort.lu
Redirect Base wort.lu
Domain IPs 104.18.40.193, 172.64.147.63
Redirect IPs 104.18.40.193, 172.64.147.63
Response IP 104.18.40.193
Found Yes
Hash 7f9b9c38a1b1b5e2c1d2f6d62984c24f44b8d2088b5ae686723c470d357e8b84
SimHash ea7897518fe4

Groups

*

Rule Path
Allow /
Allow /tags
Disallow /Suche/

googlebot-news

Rule Path
Disallow /sponsoredcontent/

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.wort.lu/sitemap.xml
sitemap https://www.wort.lu/sitemap-image.xml
sitemap https://www.wort.lu/sitemap-news.xml
sitemap https://www.wort.lu/sitemap-video.xml

Comments

  • All copyrights, neighbouring rights and database rights in the content and layout of this website/app are explicitly reserved and are for personal, non-commercial use only.
  • In accordance with Article 4 of the Directive on Copyright in the Digital Single Market (CDSM) and its transposition into the law of the applicable Member State,
  • all content of this website on which it is made available is not to be used for the purposes of text and data mining, extraction, scraping and/or the use of programs or robots
  • for automatic data collection and/or extraction of digital data, whether for machine learning or artificial intelligence purposes or otherwise.
  • See also the Terms and Conditions of this website.
  • robots.txt prod Wort
  • Disallow Internal Search
  • Disallow Sponsored Articles for Google News
  • Disallow Large Language Models
  • list sitemaps