anemoon.org
robots.txt

Robots Exclusion Standard data for anemoon.org

Resource Scan

Scan Details

Site Domain anemoon.org
Base Domain anemoon.org
Scan Status Ok
Last Scan2025-11-27T23:37:28+00:00
Next Scan 2025-12-27T23:37:28+00:00

Last Scan

Scanned2025-11-27T23:37:28+00:00
URL https://anemoon.org/robots.txt
Domain IPs 2a01:7c8:aabc:42e::1, 37.97.133.112
Response IP 37.97.133.112
Found Yes
Hash 60e781d5b8490203ac92f6b6f196c63685da40eb5a3f0bd8852ca5ef68b938e4
SimHash 831d0179cf85

Groups

*

Rule Path
Disallow /Flora-en-Fauna/Soortenoverzicht/
Disallow /beheer/
Disallow /App_Browsers/
Disallow /App_Code/
Disallow /App_Data/
Disallow /App_GlobalResources/
Disallow /bin/
Disallow /Components/
Disallow /Config/
Disallow /contest/
Disallow /controls/
Disallow /DesktopModules/
Disallow /Documentation/
Disallow /HttpModules/
Disallow /Install/
Disallow /Portals/
Disallow /Providers/
Disallow /Resources/ContentRotator/
Disallow /Resources/ControlPanel/
Disallow /Resources/Dashboard/
Disallow /Resources/FeedBrowser/
Disallow /Resources/OpenForceAd/
Disallow /Resources/Search/
Disallow /Resources/Shared/
Disallow /Resources/SkinWidgets/
Disallow /Resources/TabStrip/
Disallow /Resources/Widgets/
Disallow /User-Profile/
Disallow /beheer/
Disallow /publicaties/spuisluis/
Disallow /anm/

googlebot

Rule Path
Allow /*.js*
Allow /*.css*

amazonbot

Rule Path
Disallow /

amazon adbot

Rule Path
Disallow /