theabusites.com
robots.txt
Robots Exclusion Standard data for theabusites.com
Resource Scan
Scan Details
| Site Domain | theabusites.com |
| Base Domain | theabusites.com |
| Scan Status | Failed |
| Failure Stage | Fetching resource. |
| Failure Reason | Server returned a client error. |
| Last Scan | 2025-10-27T15:02:37+00:00 |
| Next Scan | 2025-11-26T15:02:37+00:00 |
Last Successful Scan
| Scanned | 2025-09-28T05:48:56+00:00 |
| URL | https://theabusites.com/robots.txt |
| Redirect | https://www.theabusites.com/robots.txt |
| Redirect Domain | www.theabusites.com |
| Redirect Base | theabusites.com |
| Domain IPs | 68.168.220.124 |
| Redirect IPs | 68.168.220.124 |
| Response IP | 68.168.220.124 |
| Found | Yes |
| Hash | 5103cb065bceb0d623b62ba21774e52d0353496f6b4cd2f36f5b8d67c33ef769 |
| SimHash | 21b199e01c70 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /cgi-bin |
| Disallow | /wp- |
| Disallow | /?s= |
| Disallow | *%26s%3D |
| Disallow | /search |
| Disallow | /author/ |
| Disallow | *?attachment_id= |
| Disallow | */feed |
| Disallow | */rss |
| Disallow | */embed |
| Allow | /wp-content/uploads/ |
| Allow | /wp-content/themes/ |
| Allow | /*/*.js |
| Allow | /*/*.css |
| Allow | /wp-*.png |
| Allow | /wp-*.jpg |
| Allow | /wp-*.jpeg |
| Allow | /wp-*.gif |
| Allow | /wp-*.svg |
| Allow | /wp-*.pdf |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.theabusites.com/sitemap.xml |