physislaboratory.com
robots.txt

Robots Exclusion Standard data for physislaboratory.com

Resource Scan

Scan Details

Site Domain physislaboratory.com
Base Domain physislaboratory.com
Scan Status Ok
Last Scan2025-08-17T07:34:38+00:00
Next Scan 2025-09-16T07:34:38+00:00

Last Scan

Scanned2025-08-17T07:34:38+00:00
URL https://physislaboratory.com/robots.txt
Domain IPs 104.21.94.56, 172.67.220.40, 2606:4700:3031::ac43:dc28, 2606:4700:3037::6815:5e38
Response IP 104.21.94.56
Found Yes
Hash 38288158fe304dc09f2d1aa2aeb0d94b11d9a111d6ed796a1ba75936748d69b1
SimHash e016415ae693

Groups

*

Rule Path
Disallow /admin/
Disallow /login/
Disallow /checkout/
Disallow /cart/
Disallow /wp-admin/
Disallow /wp-includes/
Allow /wp-includes/js/
Allow /wp-includes/css/
Disallow /?s=*
Disallow /*.sql$
Disallow /*.log$
Allow /index.php
Disallow /*.php$

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

googlebot-news

Rule Path
Disallow

googlebot-video

Rule Path
Disallow

bingbot

Rule Path
Disallow /

duckduckbot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

yandex

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /
Disallow /*?utm_*
Disallow /*%26utm_*
Disallow /*?fbclid=*

Other Records

Field Value
sitemap https://physislaboratory.com/sitemap_index.xml

Comments

  • robots.txt for physislaboratory.com
  • Allow all legitimate crawlers except specific ones you want to block
  • Allow essential assets within wp-includes (e.g., styles and scripts)
  • Block unnecessary or sensitive file types
  • Allow critical PHP pages, block others
  • Allow Google crawlers (main)
  • Block Bingbot
  • Block DuckDuckBot
  • Block Slurp (Yahoo)
  • Block unwanted bots you had already listed
  • Optional: Block UTM and tracking parameters (explained below)
  • Sitemap location