wphoto.tw
robots.txt

Robots Exclusion Standard data for wphoto.tw

Resource Scan

Scan Details

Site Domain wphoto.tw
Base Domain wphoto.tw
Scan Status Ok
Last Scan2024-09-21T11:20:30+00:00
Next Scan 2024-09-28T11:20:30+00:00

Last Scan

Scanned2024-09-21T11:20:30+00:00
URL https://wphoto.tw/robots.txt
Domain IPs 104.21.66.151, 172.67.205.54, 2606:4700:3033::6815:4297, 2606:4700:3035::ac43:cd36
Response IP 104.21.66.151
Found Yes
Hash 5c26c33e1188c9d6c2771f9051ffbeef760385ccb8a0e82e864e875ecb9c3218
SimHash 097c88006b12

Groups

googlebot

Rule Path
Allow *.css
Allow *.js

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

Comments

  • Googlebot
  • Other bot spider
  • Block malicious crawlers