learnubuntu.com
robots.txt

Robots Exclusion Standard data for learnubuntu.com

Resource Scan

Scan Details

Site Domain learnubuntu.com
Base Domain learnubuntu.com
Scan Status Ok
Last Scan2024-09-19T16:12:18+00:00
Next Scan 2024-09-26T16:12:18+00:00

Last Scan

Scanned2024-09-19T16:12:18+00:00
URL https://learnubuntu.com/robots.txt
Domain IPs 104.21.60.241, 172.67.202.141, 2606:4700:3032::ac43:ca8d, 2606:4700:3037::6815:3cf1
Response IP 104.21.60.241
Found Yes
Hash a17834e9d50cd98a9bce6e1624007fda30099d5994d72d7554e4a30c404b38d8
SimHash e0144514fd13

Groups

*

Rule Path
Disallow /ghost/
Disallow /email/
Disallow /members/api/comments/counts/
Disallow /r/
Disallow /webmentions/receive/

Other Records

Field Value
sitemap https://learnubuntu.com/sitemap.xml