magicmaman.com
robots.txt

Robots Exclusion Standard data for magicmaman.com

Resource Scan

Scan Details

Site Domain magicmaman.com
Base Domain magicmaman.com
Scan Status Ok
Last Scan2024-05-02T13:07:01+00:00
Next Scan 2024-05-09T13:07:01+00:00

Last Scan

Scanned2024-05-02T13:07:01+00:00
URL https://magicmaman.com/robots.txt
Redirect https://www.magicmaman.com/robots.txt
Redirect Domain www.magicmaman.com
Redirect Base magicmaman.com
Domain IPs 195.200.116.195
Redirect IPs 195.200.116.192
Response IP 195.200.116.192
Found Yes
Hash 54ab5919a8bfe7a038d27d1ea7a1e425909814cf5a93ab21a91fff7c459e9b22
SimHash 0836d840e313

Groups

*

Rule Path
Disallow /*?firstId=
Disallow /ope/
Disallow /adsite-under/
Disallow /recherche
Disallow /direct/
Disallow /photo/*/*/*
Disallow /search?
Disallow /search/
Disallow /sondage
Disallow /print/article/*

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /