joinsaint.com
robots.txt

Robots Exclusion Standard data for joinsaint.com

Resource Scan

Scan Details

Site Domain joinsaint.com
Base Domain joinsaint.com
Scan Status Ok
Last Scan2025-11-23T14:47:29+00:00
Next Scan 2025-12-23T14:47:29+00:00

Last Scan

Scanned2025-11-23T14:47:29+00:00
URL https://joinsaint.com/robots.txt
Domain IPs 104.21.0.137, 172.67.186.6, 2606:4700:3031::ac43:ba06, 2606:4700:3036::6815:89
Response IP 172.67.186.6
Found Yes
Hash 565325d85f929962d1e7442a2a6b4a5fa3f1c78cc3d7bf504629cdb9578b3bba
SimHash 7910fc00c4a3

Groups

*

Rule Path
Allow /
Allow /blog/
Allow /blog/*/
Allow /es/blog/
Allow /es/blog/*/
Allow /fr/blog/
Allow /fr/blog/*/
Allow /pt/blog/
Allow /pt/blog/*/
Allow /ru/blog/
Allow /ru/blog/*/
Disallow /privacy-policy/
Disallow /terms-of-service/

Other Records

Field Value
sitemap https://joinsaint.com/sitemap.xml