wordpress.org
robots.txt

Robots Exclusion Standard data for wordpress.org

Resource Scan

Scan Details

Site Domain wordpress.org
Base Domain wordpress.org
Scan Status Ok
Last Scan2024-11-04T16:09:39+00:00
Next Scan 2024-11-18T16:09:39+00:00

Last Scan

Scanned2024-11-04T16:09:39+00:00
URL https://wordpress.org/robots.txt
Domain IPs 198.143.164.252
Response IP 198.143.164.252
Found Yes
Hash 70818390dfa21d322b2510c3e8e1554c270fedb106194c6ed8c2bfbc532c41ab
SimHash 6b05f628eb2a

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Allow /wp-admin/load-scripts.php
Allow /wp-admin/load-styles.php

*

Rule Path
Disallow /search
Disallow /?s=

*

Rule Path
Disallow /plugins/search/

Other Records

Field Value
sitemap https://wordpress.org/sitemap.xml
sitemap https://wordpress.org/news-sitemap.xml
sitemap https://wordpress.org/themes/sitemap.xml
sitemap https://wordpress.org/plugins/sitemap.xml
sitemap https://wordpress.org/news/sitemap.xml
sitemap https://wordpress.org/showcase/sitemap.xml
sitemap https://wordpress.org/documentation/sitemap.xml
sitemap https://wordpress.org/patterns/sitemap.xml
sitemap https://wordpress.org/photos/sitemap.xml