sanfrancisco.lovetoknow.com
robots.txt

Robots Exclusion Standard data for sanfrancisco.lovetoknow.com

Resource Scan

Scan Details

Site Domain sanfrancisco.lovetoknow.com
Base Domain lovetoknow.com
Scan Status Ok
Last Scan2024-09-17T04:20:22+00:00
Next Scan 2024-10-17T04:20:22+00:00

Last Scan

Scanned2024-09-17T04:20:22+00:00
URL https://sanfrancisco.lovetoknow.com/robots.txt
Domain IPs 108.157.254.123, 108.157.254.15, 108.157.254.59, 108.157.254.69, 2600:9000:2753:2000:1c:98c3:8340:93a1, 2600:9000:2753:3800:1c:98c3:8340:93a1, 2600:9000:2753:7000:1c:98c3:8340:93a1, 2600:9000:2753:7c00:1c:98c3:8340:93a1, 2600:9000:2753:b000:1c:98c3:8340:93a1, 2600:9000:2753:e600:1c:98c3:8340:93a1, 2600:9000:2753:fa00:1c:98c3:8340:93a1, 2600:9000:2753:fc00:1c:98c3:8340:93a1
Response IP 108.157.254.15
Found Yes
Hash b52f518f0cbcd98f0d5a39f4b1c1f043651258546a2ba7853961620367e7da4a
SimHash 3c4218542110

Groups

*

Rule Path
Disallow /1004147*
Disallow /frontend.
Disallow /frontend_
Disallow /backend.
Disallow /backend_
Disallow /css/
Disallow /js/
Disallow /sf/
Disallow /print/
Disallow /search?
Disallow /image/
Disallow /advice$
Disallow /advice/
Disallow /botCheck.json

mediapartners-google

Rule Path
Disallow /index.
Disallow /wiki/index.
Disallow /frontend.
Disallow /frontend_
Disallow /backend.
Disallow /backend_
Disallow /css/
Disallow /js/
Disallow /sf/
Disallow /print/
Disallow /search?
Disallow /advice$
Disallow /advice/
Disallow /botCheck.json

Other Records

Field Value
sitemap https://sanfrancisco.lovetoknow.com/sitemap-index.xml