thecontentma.wordpress.com
robots.txt
Robots Exclusion Standard data for thecontentma.wordpress.com
Resource Scan
Scan Details
| Site Domain | thecontentma.wordpress.com |
| Base Domain | wordpress.com |
| Scan Status | Failed |
| Failure Stage | Fetching resource. |
| Failure Reason | Server returned a client error. |
| Last Scan | 2026-03-05T23:18:07+00:00 |
| Next Scan | 2026-06-03T23:18:07+00:00 |
Last Successful Scan
| Scanned | 2025-01-17T11:39:20+00:00 |
| URL | https://thecontentma.wordpress.com/robots.txt |
| Domain IPs | 192.0.78.12, 192.0.78.13 |
| Response IP | 192.0.78.13 |
| Found | Yes |
| Hash | 1ac6fdfd46030e11d2e522654a428d56a5abbc549c4e4aa55618ec45f99fb380 |
| SimHash | b3179a0c2ef7 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /wp-admin/ |
| Allow | /wp-admin/admin-ajax.php |
| Disallow | /wp-login.php |
| Disallow | /wp-signup.php |
| Disallow | /press-this.php |
| Disallow | /remote-login.php |
| Disallow | /activate/ |
| Disallow | /cgi-bin/ |
| Disallow | /mshots/v1/ |
| Disallow | /next/ |
| Disallow | /public.api/ |
Other Records
| Field | Value |
|---|---|
| sitemap | https://thecontentma.wordpress.com/sitemap.xml |
| sitemap | https://thecontentma.wordpress.com/news-sitemap.xml |
Comments