de-academic.com
robots.txt

Robots Exclusion Standard data for de-academic.com

Resource Scan

Scan Details

Site Domain de-academic.com
Base Domain de-academic.com
Scan Status Ok
Last Scan2024-11-12T16:21:09+00:00
Next Scan 2024-11-19T16:21:09+00:00

Last Scan

Scanned2024-11-12T16:21:09+00:00
URL https://de-academic.com/robots.txt
Domain IPs 2a01:4f9:c01e:78::1, 95.217.170.197
Response IP 95.217.170.197
Found Yes
Hash 4e91be44e39877dd454621ad216087fa1b59ccbf43275e8884e64c9eaf15ad11
SimHash 527ccc62eb88

Groups

*

Rule Path
Disallow

Other Records

Field Value
crawl-delay 0.1

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

riddler

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

trovitbot

Rule Path
Disallow /

detectify

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

linkpadbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

flipboardproxy

Rule Path
Disallow /

knowledge ai

Rule Path
Disallow /

femtosearchbot

Rule Path
Disallow *

Other Records

Field Value
sitemap https://de-academic.com/sitemaps/de-academic.com/sitemaps_index.xml