theworldmaterial.com
robots.txt

Robots Exclusion Standard data for theworldmaterial.com

Resource Scan

Scan Details

Site Domain theworldmaterial.com
Base Domain theworldmaterial.com
Scan Status Ok
Last Scan2025-04-20T10:54:15+00:00
Next Scan 2025-04-27T10:54:15+00:00

Last Scan

Scanned2025-04-20T10:54:15+00:00
URL https://theworldmaterial.com/robots.txt
Domain IPs 104.21.34.176, 172.67.163.142, 2606:4700:3033::6815:22b0, 2606:4700:3033::ac43:a38e
Response IP 104.21.34.176
Found Yes
Hash a794ab356a97d05d287d2c675ec23e6b162c4aacd5d4c968b9e883a6bbf08d48
SimHash d856d8c0e017

Groups

*

Rule Path
Disallow /?s=
Disallow /?
Disallow /%21
Disallow /*%21
Disallow /*?
Disallow /page/*/?s=
Disallow /search/
Disallow /wp-json/
Disallow /?rest_route=
Disallow /*/feed$
Disallow /*/1000$

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.theworldmaterial.com/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK