themoneypages.com
robots.txt

Robots Exclusion Standard data for themoneypages.com

Resource Scan

Scan Details

Site Domain themoneypages.com
Base Domain themoneypages.com
Scan Status Ok
Last Scan2024-10-26T12:07:00+00:00
Next Scan 2024-11-02T12:07:00+00:00

Last Scan

Scanned2024-10-26T12:07:00+00:00
URL https://themoneypages.com/robots.txt
Redirect https://www.themoneypages.com/robots.txt
Redirect Domain www.themoneypages.com
Redirect Base themoneypages.com
Domain IPs 52.31.232.116, 52.50.7.251
Redirect IPs 52.31.232.116, 52.50.7.251
Response IP 52.50.7.251
Found Yes
Hash 8035fe0f001300290c4a8bf533173056986e38f340e8c4f13c74349aa33ab566
SimHash 5354c170ed91

Groups

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

seekr

Rule Path
Disallow /

neticlebot

Rule Path
Disallow /

crawler4j

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

livelapbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

*

Rule Path
Disallow /wp-admin
Disallow /account/
Disallow /?orderby*
Disallow /?s=*
Disallow /news/email-template/*