forum.clubalfa.it
robots.txt

Robots Exclusion Standard data for forum.clubalfa.it

Resource Scan

Scan Details

Site Domain forum.clubalfa.it
Base Domain clubalfa.it
Scan Status Ok
Last Scan2024-11-05T03:34:19+00:00
Next Scan 2024-11-19T03:34:19+00:00

Last Scan

Scanned2024-11-05T03:34:19+00:00
URL https://forum.clubalfa.it/robots.txt
Domain IPs 176.9.25.79, 2a01:4f8:150:230a::4
Response IP 176.9.25.79
Found Yes
Hash d9cc9517a1f893d162da9c765467276157d6e59b84a69fd8f7d08dd226f5d684
SimHash a03179e8c6b3

Groups

mediapartners-google

Rule Path
Disallow

adsbot-google

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

googlebot-mobile

Rule Path
Disallow

googlebot-news

Rule Path
Disallow

msnbot

Rule Path
Disallow

bingbot

Rule Path
Disallow

msnbot

Rule Path
Disallow

bingbot

Rule Path
Disallow

proximic

Rule Path
Disallow /wp-admin/

grapeshot

Rule Path
Disallow

npbot

Rule Path
Disallow /

doc

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

exabot

Rule Path
Disallow /

vagabondo

Rule Path
Disallow /

zao

Rule Path
Disallow /

fetch

Rule Path
Disallow /

httrack

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

linko

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webzip

Rule Path
Disallow /

xenu

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

*

Rule Path
Disallow /login/
Disallow /account/
Disallow /admin.php
Disallow /weeklydigest/manage/
Disallow /events/birthdays/
Disallow /events/monthly
Disallow /events/weekly
Disallow /thanks.php
Disallow /search/
Disallow /members/
Disallow /*?simplePage=1
Disallow /uix/
Disallow /attachments?simplePage
Disallow /reply
Disallow */reactions

Other Records

Field Value
sitemap https://forum.clubalfa.it/sitemap.php

Comments

  • Google AdSense
  • Google AdS-bot
  • Google Image
  • Google mobile
  • Google News
  • Hits many times per second, not acceptable.
  • http://www.nameprotect.com/botinfo.html
  • A list of misbehaving crawlers.
  • Some bots are known to be trouble, particularly those designed to copy entire sites.