media-weber.com
robots.txt

Robots Exclusion Standard data for media-weber.com

Resource Scan

Scan Details

Site Domain media-weber.com
Base Domain media-weber.com
Scan Status Ok
Last Scan3/22/2025, 4:08:29 PM
Next Scan 4/21/2025, 4:08:29 PM

Last Scan

Scanned3/22/2025, 4:08:29 PM
URL https://media-weber.com/robots.txt
Domain IPs 104.21.77.69, 172.67.205.38, 2606:4700:3030::6815:4d45, 2606:4700:3033::ac43:cd26
Response IP 172.67.205.38
Found Yes
Hash c984266430917c70b203f2ed82db2b8fef14ab28f0f7c2e7b0aeb26dabc058f0
SimHash 7d35ab50d5b0

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-
Disallow /wp/
Disallow /?
Disallow *?s=
Disallow *%26s%3D
Disallow /search
Disallow /author/
Disallow */embed$
Disallow */xmlrpc.php
Disallow *utm*%3D
Disallow *openstat%3D
Allow */wp-*/*ajax*.php
Allow */wp-sitemap
Allow */uploads
Allow */wp-*/*.js
Allow */wp-*/*.css
Allow */wp-*/*.png
Allow */wp-*/*.jpg
Allow */wp-*/*.jpeg
Allow */wp-*/*.gif
Allow */wp-*/*.svg
Allow */wp-*/*.webp
Allow */wp-*/*.swf
Allow */wp-*/*.pdf

Other Records

Field Value
sitemap /sitemap.xml