thenewslens.com
robots.txt

Robots Exclusion Standard data for thenewslens.com

Resource Scan

Scan Details

Site Domain thenewslens.com
Base Domain thenewslens.com
Scan Status Ok
Last Scan2024-11-01T02:52:44+00:00
Next Scan 2024-11-08T02:52:44+00:00

Last Scan

Scanned2024-11-01T02:52:44+00:00
URL https://thenewslens.com/robots.txt
Redirect https://www.thenewslens.com/robots.txt
Redirect Domain www.thenewslens.com
Redirect Base thenewslens.com
Domain IPs 34.160.188.69
Redirect IPs 34.160.188.69
Response IP 34.160.188.69
Found Yes
Hash 96c9194cd72d9913a80bd6b1ae47718a299791a53c5c7bcc0ae89a20a9a97d40
SimHash 5214d970e751

Groups

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

grapeshot

Rule Path
Disallow

*

Rule Path
Allow /
Disallow /search
Disallow /api/article/infinite