rethe106.com
robots.txt

Robots Exclusion Standard data for rethe106.com

Resource Scan

Scan Details

Site Domain rethe106.com
Base Domain rethe106.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-11-02T01:22:04+00:00
Next Scan 2025-01-31T01:22:04+00:00

Last Successful Scan

Scanned2024-07-06T01:21:03+00:00
URL https://rethe106.com/robots.txt
Domain IPs 192.0.78.190, 192.0.78.221
Response IP 192.0.78.190
Found Yes
Hash 1db2a329aba4092bbf8a68573182bb10768258d961c3df5f94b80f7a66f9e8a0
SimHash 04918b601c3a

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-
Disallow /?s=
Disallow *%26s%3D
Disallow /search
Disallow /author/
Disallow *?attachment_id=
Disallow */feed
Disallow */rss
Disallow */embed
Allow /wp-content/uploads/
Allow /wp-content/themes/
Allow /*/*.js
Allow /*/*.css
Allow /wp-*.png
Allow /wp-*.jpg
Allow /wp-*.jpeg
Allow /wp-*.gif
Allow /wp-*.svg
Allow /wp-*.pdf

Other Records

Field Value
sitemap https://rethe106.com/sitemap.xml
sitemap https://rethe106.com/news-sitemap.xml
sitemap https://rethe106.com/sitemap_index.xml