ixtheo.de
robots.txt

Robots Exclusion Standard data for ixtheo.de

Resource Scan

Scan Details

Site Domain ixtheo.de
Base Domain ixtheo.de
Scan Status Ok
Last Scan2024-09-28T16:22:17+00:00
Next Scan 2024-10-28T16:22:17+00:00

Last Scan

Scanned2024-09-28T16:22:17+00:00
URL https://ixtheo.de/robots.txt
Domain IPs 134.2.5.111
Response IP 134.2.5.111
Found Yes
Hash fd01c92093d7c658b42e186e4ba4423ad36327602d2aec3ec9ceab2b64b74b2d
SimHash dd0ebc51e791

Groups

adscanner/
ahrefsbot
blexbot
bytespider
dotbot
imagesiftbot
mail.ru_bot/2.0
megaindex.ru
megaindex.com
semrushbot
semrushbot/2~bl
semrushbot-sa
seokicks-robot

Rule Path
Disallow /

claudebot
gptbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

*

Rule Path
Allow /AuthorityRecord/
Allow /Record/
Disallow /AJAX/
Disallow /Alphabrowse/
Disallow /Author/
Disallow /Authority/
Disallow /Browse/
Disallow /Combined/
Disallow /Feedback/
Disallow /FindbuchProxy/
Disallow /fulltextsnippetproxy/
Disallow /index.php/
Disallow /Keywordchainsearch/
Disallow /MyResearch/
Disallow /pdaproxy/
Disallow /proxy/
Disallow /RssFeed/
Disallow /Search/
Disallow /search/
Disallow /Search2/
Disallow /search2/
Disallow /Summon/
Disallow /SummonRecord/
Disallow /WikidataProxy/

Other Records

Field Value
sitemap https://ixtheo.de/sitemaps/ixtheo/sitemapIndex.xml