pupha.net
robots.txt

Robots Exclusion Standard data for pupha.net

Resource Scan

Scan Details

Site Domain pupha.net
Base Domain pupha.net
Scan Status Ok
Last Scan2025-11-23T15:02:15+00:00
Next Scan 2025-11-30T15:02:15+00:00

Last Scan

Scanned2025-11-23T15:02:15+00:00
URL https://pupha.net/robots.txt
Domain IPs 104.21.69.106, 172.67.207.109, 2606:4700:3032::ac43:cf6d, 2606:4700:3034::6815:456a
Response IP 104.21.69.106
Found Yes
Hash f9dea5c1246d225f1b3a2c761bdb8798bc298c902544a9f3a53e41fb6fea9c0e
SimHash 49361840c5b4

Groups

bingbot

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Allow /

googlebot

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Allow /

mediapartners-google

Rule Path
Allow /wp-admin/admin-ajax.php
Allow /

adsbot-google

Rule Path
Allow /wp-admin/admin-ajax.php
Allow /

slurp

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Allow /

facebot

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Allow /

twitterbot

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Allow /

ccbot

Rule Path
Disallow /

megalodon

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

*

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.pupha.net/sitemap.xml

Comments

  • MJ12bot measures