patterico.com
robots.txt

Robots Exclusion Standard data for patterico.com

Resource Scan

Scan Details

Site Domain patterico.com
Base Domain patterico.com
Scan Status Ok
Last Scan2024-05-25T02:49:52+00:00
Next Scan 2024-06-24T02:49:52+00:00

Last Scan

Scanned2024-05-25T02:49:52+00:00
URL https://patterico.com/robots.txt
Domain IPs 209.133.206.134, 2604:4500:0:135::1d
Response IP 209.133.206.134
Found Yes
Hash dedd49f3e808839bcf3a0f2baec2a9b69803f92666dc1b2e7573d04c89a34b0b
SimHash 691dd8704619

Groups

*

Rule Path
Disallow /wp/wp-admin/
Allow /wp/wp-admin/admin-ajax.php

*

Rule Path
Disallow /wp/wp-comments-post.php
Disallow /app/plugins/
Disallow */page/*
Disallow */archive/*
Disallow */feed/*
Disallow /search/
Disallow /account/
Disallow /members/
Disallow /groups
Disallow /profile/
Disallow /forum/
Disallow /checkout/
Disallow /checkouts/
Disallow /cart/

Other Records

Field Value
crawl-delay 1

yandexbot
yandeximages
yandeximageresizer
ahrefsbot
seznambot
zoombot
seekrbot
the knowledge ai
blexbot
mojeekbot
megaindex.ru/2.0
seekportbot
seokicks
barkrowler
claudebot
python/3.8 aiohttp/3.9.5
python/3.9 aiohttp/3.9.4
amazonbot
mediatoolkitbot
yacybot
baiduspider
dataforseobot
paqlebot
trendictionbot
semrushbot
bytedance
bytespider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://patterico.com/sitemap.xml

Comments

  • Last updated: May 24, 2024 at 7:33pm ET