criticalrace.org
robots.txt

Robots Exclusion Standard data for criticalrace.org

Resource Scan

Scan Details

Site Domain criticalrace.org
Base Domain criticalrace.org
Scan Status Ok
Last Scan2024-09-20T08:34:54+00:00
Next Scan 2024-10-20T08:34:54+00:00

Last Scan

Scanned2024-09-20T08:34:54+00:00
URL https://criticalrace.org/robots.txt
Domain IPs 209.133.206.132, 2604:4500:0:135::8451:8cc6
Response IP 209.133.206.132
Found Yes
Hash 03d19c6839d0475abca503406ca5af9b857ecb6d5536f742de3c60d77ece658a
SimHash 6b1cd072c698

Groups

*

Rule Path
Disallow /wp/wp-admin/
Allow /wp/wp-admin/admin-ajax.php

*

Rule Path
Disallow /wp/wp-comments-post.php
Disallow /app/plugins/
Disallow */archive/*
Disallow */feed/*
Disallow /search/
Disallow /account/*
Disallow /members/*
Disallow /groups/*
Disallow /profile/*
Disallow /forum/
Disallow /checkout/*
Disallow /checkouts/*
Disallow /cart/*

Other Records

Field Value
crawl-delay 1

yandexbot
yandeximages
yandeximageresizer
ahrefsbot
seznambot
zoombot
seekrbot
the knowledge ai
blexbot
mojeekbot
megaindex.ru/2.0
seekportbot
seokicks
barkrowler
claudebot
python/3.8 aiohttp/3.9.5
python/3.9 aiohttp/3.9.4
amazonbot
mediatoolkitbot
yacybot
baiduspider
dataforseobot
paqlebot
trendictionbot
semrushbot
bytedance
bytespider
repolookoutbot
sogou web spider
censysinspect

Rule Path
Disallow /

Other Records

Field Value
sitemap https://criticalrace.org/sitemap.xml

Comments

  • Last updated: September 20, 2024 at 4:34am ET