enterprise.wikimedia.com
robots.txt

Robots Exclusion Standard data for enterprise.wikimedia.com

Resource Scan

Scan Details

Site Domain enterprise.wikimedia.com
Base Domain wikimedia.com
Scan Status Ok
Last Scan2024-11-15T03:10:25+00:00
Next Scan 2024-12-15T03:10:25+00:00

Last Scan

Scanned2024-11-15T03:10:25+00:00
URL https://enterprise.wikimedia.com/robots.txt
Domain IPs 13.33.88.110, 13.33.88.25, 13.33.88.30, 13.33.88.87
Response IP 13.33.88.25
Found Yes
Hash 67b45826996a88a76f27288fd9121a8196405cb085426c61648770a8bf3a81fe
SimHash 83045902a5d2

Groups

*

Rule Path
Disallow /*.php$
Disallow /comment-page-*
Disallow /?s=*
Disallow /search/*
Disallow /trackback/
Disallow /wp-json/

Other Records

Field Value
sitemap https://enterprise.wikimedia.com/sitemap.xml

Comments

  • robots.txt for wordpress
  • https://gist.github.com/chuckreynolds/135728