puntoblog.it
robots.txt

Robots Exclusion Standard data for puntoblog.it

Resource Scan

Scan Details

Site Domain puntoblog.it
Base Domain puntoblog.it
Scan Status Ok
Last Scan2024-11-15T23:58:20+00:00
Next Scan 2024-11-22T23:58:20+00:00

Last Scan

Scanned2024-11-15T23:58:20+00:00
URL https://puntoblog.it/robots.txt
Domain IPs 37.59.148.211
Response IP 37.59.148.211
Found Yes
Hash c707c8b75508fb162a2cdafe611c1c0937b1d3d4701f286aed87dc3e672cec80
SimHash ac59488ac6b7

Groups

*

Rule Path
Disallow /wp-
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-content/
Allow /wp-content/uploads
Disallow /tag/
Disallow /author
Disallow /trackback/
Disallow /feed/
Disallow /comments/
Disallow */trackback/
Allow /*?*
Allow /*?

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

ia_archiver

Rule Path
Disallow /

duggmirror

Rule Path
Disallow /

netmechanic

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

teleport

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

wget

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

npbot

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

Other Records

Field Value
sitemap http://www.puntoblog.it/sitemap_index.xml