blogdeculturilla.com
robots.txt

Robots Exclusion Standard data for blogdeculturilla.com

Resource Scan

Scan Details

Site Domain blogdeculturilla.com
Base Domain blogdeculturilla.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-10-07T20:40:04+00:00
Next Scan 2024-12-06T20:40:04+00:00

Last Successful Scan

Scanned2024-08-09T20:39:01+00:00
URL https://blogdeculturilla.com/robots.txt
Domain IPs 149.202.147.247
Response IP 149.202.147.247
Found Yes
Hash 883e9a23d51b375f1c40bc8bf3580c5e6583c1f05f54dbe6bfb1f14fab0aa1ab
SimHash c85a5b808ff3

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /author/
Allow /wp-admin/admin-ajax.php

unisterbot

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

maxthon

Rule Path
Disallow /

cncdialer

Rule Path
Disallow /

Other Records

Field Value
sitemap https://blogdeculturilla.com/sitemap.xml

Warnings

  • 2 invalid lines.