pjharvey.net
robots.txt

Robots Exclusion Standard data for pjharvey.net

Resource Scan

Scan Details

Site Domain pjharvey.net
Base Domain pjharvey.net
Scan Status Ok
Last Scan2024-05-29T05:37:56+00:00
Next Scan 2024-06-28T05:37:56+00:00

Last Scan

Scanned2024-05-29T05:37:56+00:00
URL https://pjharvey.net/robots.txt
Domain IPs 104.21.17.70, 172.67.223.56, 2606:4700:3030::6815:1146, 2606:4700:3036::ac43:df38
Response IP 104.21.17.70
Found Yes
Hash bf896595be6d7457842025bfe4febd3690e931e493b28fb532542fd52714f17c
SimHash 7018db40a2a0

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /?s=
Disallow /page/*/?s=
Disallow /search/

adsbot-google
amazonbot
anthropic-ai
applebot
awariorssbot
awariosmartbot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
dataforseobot
facebookbot
google-extended
googleother
gptbot
imagesiftbot
magpie-crawler
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
youbot

Rule Path
Disallow /
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://pjharvey.net/sitemap.xml