thenewsindependent.com
robots.txt

Robots Exclusion Standard data for thenewsindependent.com

Resource Scan

Scan Details

Site Domain thenewsindependent.com
Base Domain thenewsindependent.com
Scan Status Ok
Last Scan2025-03-08T10:20:16+00:00
Next Scan 2025-04-07T10:20:16+00:00

Last Scan

Scanned2025-03-08T10:20:16+00:00
URL https://thenewsindependent.com/robots.txt
Domain IPs 104.21.41.26, 172.67.159.18, 2606:4700:3032::ac43:9f12, 2606:4700:3035::6815:291a
Response IP 172.67.159.18
Found Yes
Hash 9e4a67d798020137165a2755e3c79922f81b1f44780855616ec3c4b693e7b394
SimHash 209509e01af8

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /feedback
Disallow /privacy-policy
Disallow *%26s%3D
Disallow /terms-and-conditions
Disallow /author/
Disallow *?attachment_id=
Disallow */feed
Disallow */rss
Disallow */embed
Allow /wp-content/uploads/
Allow /wp-content/themes/
Allow /*/*.js
Allow /*/*.css
Allow /wp-*.png
Allow /wp-*.jpg
Allow /wp-*.jpeg
Allow /wp-*.gif
Allow /wp-*.svg
Allow /wp-*.pdf

Other Records

Field Value
sitemap http://thenewsindependent.com/wp-sitemap.xml