conmuchagula.com
robots.txt

Robots Exclusion Standard data for conmuchagula.com

Resource Scan

Scan Details

Site Domain conmuchagula.com
Base Domain conmuchagula.com
Scan Status Ok
Last Scan2024-11-16T19:48:37+00:00
Next Scan 2024-11-23T19:48:37+00:00

Last Scan

Scanned2024-11-16T19:48:37+00:00
URL https://conmuchagula.com/robots.txt
Domain IPs 35.214.195.209
Response IP 35.214.195.209
Found Yes
Hash 608b44f5a6e59546eb015c6794377df2801aa142cd6f5c67e25c7c65b2ff46e3
SimHash a1d95883cea3

Groups

orthogaffe

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

gsa-crawler

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

npbot

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

cncdialer

Rule Path
Disallow /

maxthon

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin
Disallow /wp-admin/
Disallow /wp-json/complianz/*
Disallow /trackback
Disallow */trackback
Disallow */comments
Disallow /*/comment-page-*
Disallow */imagenes/*
Disallow /?s=*
Disallow /category/*/*
Disallow */hemeroteca/*

Other Records

Field Value
sitemap https://www.conmuchagula.com/sitemap.xml