getrichslowly.org
robots.txt

Robots Exclusion Standard data for getrichslowly.org

Resource Scan

Scan Details

Site Domain getrichslowly.org
Base Domain getrichslowly.org
Scan Status Ok
Last Scan2024-11-15T21:52:58+00:00
Next Scan 2024-11-22T21:52:58+00:00

Last Scan

Scanned2024-11-15T21:52:58+00:00
URL https://getrichslowly.org/robots.txt
Domain IPs 104.21.12.56, 172.67.193.177, 2606:4700:3034::ac43:c1b1, 2606:4700:3036::6815:c38
Response IP 172.67.193.177
Found Yes
Hash a4f5abd0d51b8d8cdc2576917d24d985c41bcc5df8ec97b50e47a7a3211a36bd
SimHash 7620191183a2

Groups

*

Rule Path
Disallow /xmlrpc.php
Disallow /go/
Disallow /amazon/
Disallow /link-not-found/
Allow /wp-admin/admin-ajax.php

ai2bot
ai2bot-dolma
anthropic-ai
applebot-extended
awariorssbot
awariosmartbot
bytespider
ccbot
chatgpt-user
claude-web
claudebot
cohere-ai
dataforseobot
diffbot
duckassistbot
facebookbot
facebookexternalhit
friendlycrawler
google-cloudvertexbot
google-extended
gptbot
iaskspider/2.0
icc-crawler
imagesiftbot
img2dataset
isscyberriskcrawler
kangaroo bot
magpie-crawler
meta-externalagent
meta-externalfetcher
newsnow
news-please
oai-searchbot
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
petalbot
quora-bot
scrapy
sidetrade indexer bot
timpibot
turnitinbot
velenpublicwebcrawler
webzio-extended
youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.getrichslowly.org/sitemap_index.xml

Comments

  • Directory and File Rules
  • Disallow Bots
  • Sitemap