newsinfo.inquirer.net
robots.txt

Robots Exclusion Standard data for newsinfo.inquirer.net

Resource Scan

Scan Details

Site Domain newsinfo.inquirer.net
Base Domain inquirer.net
Scan Status Ok
Last Scan2024-11-14T08:09:15+00:00
Next Scan 2024-11-21T08:09:15+00:00

Last Scan

Scanned2024-11-14T08:09:15+00:00
URL https://newsinfo.inquirer.net/robots.txt
Domain IPs 104.22.70.88, 104.22.71.88, 172.67.7.74, 2606:4700:10::6816:4658, 2606:4700:10::6816:4758, 2606:4700:10::ac43:74a
Response IP 172.67.7.74
Found Yes
Hash 250bffbe2eeb58ddf40d8f9f5ce195500460c289da612db4d4eedccee53f60fb
SimHash e0314106ea15

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /xmlrpc.php
Disallow /feed-karera
Disallow /feed-lotto
Disallow /feed-full-horoscope
Disallow /feed-karera-tips
Disallow /bandera-custom-feed
Disallow /custom-feed-pagesuite
Disallow /4711/

*

Rule Path
Disallow /factiva*
Disallow /61405*
Disallow /.well-known*

petalbot
gptbot

Rule Path
Disallow /