catolicus.com
robots.txt

Robots Exclusion Standard data for catolicus.com

Resource Scan

Scan Details

Site Domain catolicus.com
Base Domain catolicus.com
Scan Status Ok
Last Scan2024-09-28T21:08:29+00:00
Next Scan 2024-10-05T21:08:29+00:00

Last Scan

Scanned2024-09-28T21:08:29+00:00
URL https://catolicus.com/robots.txt
Domain IPs 23.108.55.81
Response IP 23.108.55.81
Found Yes
Hash e271f0c53ff58b6ee02697bfdcaa3911b9b3a5a98ee081df5cbfe26dbdcbf757
SimHash 7b568250c830

Groups

*

Rule Path
Allow /wp-admin/admin-ajax.php
Disallow /wp-admin/

compspybot
curious george
cybeye.com
docomo
exb language crawler
ezooms
flamingo_searchengine
genieo
genio
lwnutch
lexxebot
openwebindex
rediffnewsbot
seoengworldbot
scanmine
screaming frog seo spider
shopwiki
showyoubot
sosospider
wocbot
yeti
youdaobot
daumoa
gsa-crawler
libcrawl
linkdex
magpie-crawler
repparser
rogerbot
sindice-site-manager
sogou spider
sogou
woriobot
yacybot
yolinkbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://catolicus.com/sitemap_index.xml