maria-johnsen.com
robots.txt

Robots Exclusion Standard data for maria-johnsen.com

Resource Scan

Scan Details

Site Domain maria-johnsen.com
Base Domain maria-johnsen.com
Scan Status Ok
Last Scan2026-01-26T10:59:50+00:00
Next Scan 2026-02-02T10:59:50+00:00

Last Scan

Scanned2026-01-26T10:59:50+00:00
URL https://maria-johnsen.com/robots.txt
Domain IPs 104.21.34.250, 172.67.166.221, 2606:4700:3031::ac43:a6dd, 2606:4700:3034::6815:22fa
Response IP 172.67.166.221
Found Yes
Hash 796002b36cf973fae954779737c860a906b4991da11dfbdb4c7d54a8f98804fb
SimHash f91d9203a7b0

Groups

*

Rule Path
Allow /
Disallow /admin/
Disallow /wp-admin/
Disallow /login/
Disallow /dashboard/
Disallow /wp-content/uploads/private/
Disallow /media/private/
Disallow /user/
Disallow /search/
Disallow /under-construction/
Disallow /staging/
Disallow /test/
Disallow /*.pdf$
Disallow /*.doc$
Disallow /*.zip$

googlebot-image

Rule Path
Allow /wp-content/uploads/
Allow /media/
Disallow /wp-content/uploads/private/

googlebot-video

Rule Path
Allow /wp-content/uploads/
Allow /media/
Disallow /wp-content/uploads/private/

Other Records

Field Value
sitemap https://www.maria-johnsen.com/sitemap.xml
sitemap https://www.maria-johnsen.com/multilingualSEO-blog/sitemap.xml
sitemap https://www.maria-johnsen.com/video-sitemap.xml
sitemap https://www.maria-johnsen.com/million-dollar-blog/sitemap.xml
sitemap https://www.maria-johnsen.com/deutschblog/sitemap.xml
sitemap https://www.maria-johnsen.com/lesarticles/sitemap.xml
sitemap https://www.maria-johnsen.com/danskblog/sitemap.xml

Comments

  • robots.txt for https://www.maria-johnsen.com/
  • Optimized for Google, Bing, and other major crawlers
  • Disallow sensitive or duplicate areas
  • Block file types that don't need indexing
  • Allow Googlebot-Image and Googlebot-Video access to public media
  • Sitemap locations