mazilla.ph
robots.txt

Robots Exclusion Standard data for mazilla.ph

Resource Scan

Scan Details

Site Domain mazilla.ph
Base Domain mazilla.ph
Scan Status Ok
Last Scan5/20/2025, 12:48:47 PM
Next Scan 5/27/2025, 12:48:47 PM

Last Scan

Scanned5/20/2025, 12:48:47 PM
URL https://mazilla.ph/robots.txt
Domain IPs 104.21.13.189, 172.67.133.16, 2606:4700:3033::6815:dbd, 2606:4700:3033::ac43:8510
Response IP 104.21.13.189
Found Yes
Hash 6cb778a79366327a98f3ead93920e00c2ded3d1524b7fae8a4fa8599b8d168f9
SimHash 491d5668cf00

Groups

*

Rule Path
Disallow /admin/
Disallow /signup/
Disallow /landing/
Disallow /account/
Disallow /payments/
Disallow /o/
Disallow /api/
Disallow /offerwall/

*

Rule Path
Disallow /admin/
Disallow /signup/
Disallow /landing/
Disallow /account/
Disallow /payments/
Disallow /o/
Disallow /api/
Disallow /offerwall/
Disallow /*?*

yandex

Rule Path
Disallow /admin/
Disallow /signup/
Disallow /landing/
Disallow /account/
Disallow /payments/
Disallow /o/
Disallow /api/
Disallow /offerwall/
Disallow /*?*
Disallow /*gclid
Disallow /*splash

googlebot

Rule Path
Disallow /admin/
Disallow /signup/
Disallow /landing/
Disallow /account/
Disallow /payments/
Disallow /o/
Disallow /api/
Disallow /offerwall/
Disallow /*?*
Disallow /*gclid
Disallow /*splash

twitterbot

Rule Path
Disallow *
Allow /*utm_
Allow /*source%3D

seekportbot
comparserbot
trendictionbot
blexbot
baiduspider
dataforseobot
emailcollector
emailsiphon
linkpadbot
mj12bot
msiecrawler
npbot
npbot-1/2.0
nutch
offline explorer
seekportbot
semrushbot
sitesnagger
teleport
teleportpro
turnitinbot
webcopier
webstripper
dotbot
larbin
linkdexbot
moget
psbot
serpstatbot
sogou spider
trovitbot
gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://mazilla.ph/sitemap.xml

Warnings

  • `clean-param` is not a known field.