newsaffinity.com
robots.txt

Robots Exclusion Standard data for newsaffinity.com

Resource Scan

Scan Details

Site Domain newsaffinity.com
Base Domain newsaffinity.com
Scan Status Ok
Last Scan2025-10-18T10:24:47+00:00
Next Scan 2025-10-25T10:24:47+00:00

Last Scan

Scanned2025-10-18T10:24:47+00:00
URL https://newsaffinity.com/robots.txt
Domain IPs 104.21.75.209, 172.67.181.254, 2606:4700:3031::6815:4bd1, 2606:4700:3036::ac43:b5fe
Response IP 104.21.75.209
Found Yes
Hash 59020ec608ad6ecd78c7182ab0bfd9bbdd074407382f1d7d73ad27545821b834
SimHash 7534d94a56b4

Groups

*
*

Rule Path
Disallow /cgi-bin/
Disallow /page/
Disallow /blog/page/*
Disallow /amp/page/*
Disallow /dgd_scrollbox/
Disallow /?s=*
Disallow /go/
Disallow /recommended/
Disallow /comments/feed/
Disallow /category/uncategorized/
Disallow /trackback/
Disallow /index.php
Disallow /xmlrpc.php
Disallow /search?
Disallow /?p=*
Disallow *?replytocom
Disallow */trackback
Disallow */feed
Disallow */comments
Allow /tag/
Disallow /author/
Disallow /author/*

Other Records

Field Value
sitemap https://newsaffinity.com/sitemap.xml
sitemap https://newsaffinity.com/sitemap-news.xml

Comments

  • XML Sitemap & Google News version 5.2.7 - https://status301.net/wordpress-plugins/xml-sitemap-feed/

Warnings

  • 1 invalid line.