rmjohnson.com
robots.txt

Robots Exclusion Standard data for rmjohnson.com

Resource Scan

Scan Details

Site Domain rmjohnson.com
Base Domain rmjohnson.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-09-25T11:46:43+00:00
Next Scan 2025-12-24T11:46:43+00:00

Last Successful Scan

Scanned2025-02-28T03:41:35+00:00
URL https://rmjohnson.com/robots.txt
Redirect https://www.rmjohnson.com/robots.txt
Redirect Domain www.rmjohnson.com
Redirect Base rmjohnson.com
Domain IPs 45.33.7.253
Redirect IPs 45.33.7.253
Response IP 45.33.7.253
Found Yes
Hash d0c345bd77bd827a2c1570eeaf51762070810eeafeecad52a233337f5657f88f
SimHash 6a14d04e5697

Groups

aa-site-audit-crawler

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

ravencrawler

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

*

Rule Path
Disallow /account
Disallow /cart
Disallow /checkout
Disallow /jewelrybox
Disallow /newsletter
Disallow /search

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.rmjohnson.com/sitemap.xml

Comments

  • Slow down bots
  • Disable certain pages