corsibarman.org
robots.txt

Robots Exclusion Standard data for corsibarman.org

Resource Scan

Scan Details

Site Domain corsibarman.org
Base Domain corsibarman.org
Scan Status Ok
Last Scan2026-03-05T22:06:51+00:00
Next Scan 2026-04-04T22:06:51+00:00

Last Scan

Scanned2026-03-05T22:06:51+00:00
URL https://corsibarman.org/robots.txt
Domain IPs 31.11.35.183
Response IP 31.11.35.183
Found Yes
Hash 217b829ff9a98ecc55d7ff08048c272712c6a475e23238625a3e9ed86d536246
SimHash 6a45dd10c551

Groups

*

Rule Path
Allow /

googlebot
googlebot per smartphone

Rule Path
Allow /
Allow googlebot

*

Rule Path
Disallow /
Allow /calendar/$
Allow /calendar/about/
Allow /calendar$

*

Rule Path
Allow /calendar/$
Allow /calendar/about/
Allow /calendar$

*

Rule Path
Allow /
Allow googlebot

ia_archiver
teoma
msnbot
slurp
abachobot
fireball
voilabot
yandex

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://www.corsibarman.org/sitemap.xml
sitemap https://www.google.com/calendar/about/corsibarman-org.xml
sitemap https://www.google.com/calendar/about/corsibarman-org.xml

Comments

  • Google
  • Allow all
  • Alexa
  • Ask
  • MSN
  • Yahoo!
  • Abacho
  • Fireball
  • Voila.fr
  • Yandex
  • Others

Warnings

  • 4 invalid lines.