larespubblica.com
robots.txt

Robots Exclusion Standard data for larespubblica.com

Resource Scan

Scan Details

Site Domain larespubblica.com
Base Domain larespubblica.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-05-27T06:48:25+00:00
Next Scan 2024-08-25T06:48:25+00:00

Last Successful Scan

Scanned2023-10-31T05:07:49+00:00
URL https://larespubblica.com/robots.txt
Domain IPs 104.21.75.101, 172.67.221.83, 2606:4700:3031::ac43:dd53, 2606:4700:3034::6815:4b65
Response IP 104.21.75.101
Found Yes
Hash bbb39d52e2de9ee754a39d695000f5ea84cee27b2e69a55d7ccf945081dc6292
SimHash 65105050c391

Groups

*

Rule Path
Disallow /404
Disallow /data-deletion
Disallow /logout
Disallow /goto
Disallow /goto/
Disallow /search-article
Disallow /search
Disallow /tim-kiem/
Disallow /tim-kiem-truyen
Disallow /w/
Disallow /wp-content/
Disallow /sw.js

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

yandex

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

petalbot

Rule Path
Disallow /