activeplanet.com
robots.txt

Robots Exclusion Standard data for activeplanet.com

Resource Scan

Scan Details

Site Domain activeplanet.com
Base Domain activeplanet.com
Scan Status Ok
Last Scan2025-09-21T19:56:16+00:00
Next Scan 2025-10-21T19:56:16+00:00

Last Scan

Scanned2025-09-21T19:56:16+00:00
URL https://activeplanet.com/robots.txt
Domain IPs 104.21.51.224, 172.67.189.2, 2606:4700:3030::6815:33e0, 2606:4700:3037::ac43:bd02
Response IP 172.67.189.2
Found Yes
Hash 201e0ea212aecedf5783c8f7a8813c27e6dc1b7a40cbf21cb212f202b7929834
SimHash 481905464697

Groups

*

Rule Path
Disallow /assets/components/
Allow /assets/components/pdotools/
Allow /assets/components/minifyx/
Allow /assets/components/ajaxform/
Disallow /assets/console_script/
Disallow /assets/active_cyprus/elements/
Disallow /core/
Disallow /connectors/
Disallow /manager/
Disallow /index.php
Disallow /index
Disallow /?
Disallow /*?
Disallow /kalendar-sobyitij/
Disallow /kalendar-sobyitij/*
Disallow /mediczina/
Disallow /mediczina/*
Disallow /en/calendar-of-events/
Disallow /en/calendar-of-events/*
Disallow /en/medicine/
Disallow /en/medicine/*

yanga

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

dolphin

Rule Path
Disallow /

Other Records

Field Value
sitemap https://activeplanet.com/sitemap.xml
sitemap https://activeplanet.com/en/sitemap.xml
sitemap https://activeplanet.com/sitemap-v1.xml

Warnings

  • `host` is not a known field.