manyfoto.com
robots.txt

Robots Exclusion Standard data for manyfoto.com

Resource Scan

Scan Details

Site Domain manyfoto.com
Base Domain manyfoto.com
Scan Status Ok
Last Scan2025-05-25T06:15:26+00:00
Next Scan 2025-06-24T06:15:26+00:00

Last Scan

Scanned2025-05-25T06:15:26+00:00
URL https://manyfoto.com/robots.txt
Domain IPs 104.21.27.218, 172.67.169.198, 2606:4700:3031::ac43:a9c6, 2606:4700:3036::6815:1bda
Response IP 104.21.27.218
Found Yes
Hash f366434985268ab85ac836743bd41f6a492fe5c6a1719a34b49ae2b4f2340d23
SimHash 564541f0c783

Groups

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

mail.ru_bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

psbot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

npbot-1/2.0

Rule Path
Disallow /

npbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

domainstatsbot

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

linespider

Rule Path
Disallow /

infotigerbot

Rule Path
Disallow /

adsbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

zoombot

Rule Path
Disallow /

admantx

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

neevabot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

amazon-kendra-web-crawler-*

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

*

Rule Path
Disallow /pub/
Disallow /inc/
Disallow /js/
Disallow /policies/