4media.com
robots.txt

Robots Exclusion Standard data for 4media.com

Resource Scan

Scan Details

Site Domain 4media.com
Base Domain 4media.com
Scan Status Ok
Last Scan2024-06-29T12:29:23+00:00
Next Scan 2024-07-06T12:29:23+00:00

Last Scan

Scanned2024-06-29T12:29:23+00:00
URL https://4media.com/robots.txt
Redirect https://www.4media.com/robots.txt
Redirect Domain www.4media.com
Redirect Base 4media.com
Domain IPs 104.21.73.34, 172.67.157.113, 2606:4700:3031::ac43:9d71, 2606:4700:3035::6815:4922
Redirect IPs 104.21.73.34, 172.67.157.113, 2606:4700:3031::ac43:9d71, 2606:4700:3035::6815:4922
Response IP 172.67.157.113
Found Yes
Hash 4e15acf6929d2dab0ad6a1753ff96ad86a0f673993d1e5dc9ae31557f8029790
SimHash 2c9108d2a993

Groups

*

Rule Path
Disallow /companies/
Disallow /signin/
Disallow /firmy/
Disallow /wydarzenia/
Disallow /events/
Disallow /ogloszenia/
Disallow /classifieds/
Disallow /nekrologi/
Disallow /obituaries/
Disallow /konto/
Disallow /account/

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

copilot

Rule Path
Disallow /