join.com
robots.txt

Robots Exclusion Standard data for join.com

Resource Scan

Scan Details

Site Domain join.com
Base Domain join.com
Scan Status Ok
Last Scan2024-05-06T22:44:12+00:00
Next Scan 2024-05-20T22:44:12+00:00

Last Scan

Scanned2024-05-06T22:44:12+00:00
URL https://join.com/robots.txt
Domain IPs 104.26.12.164, 104.26.13.164, 172.67.71.101, 2606:4700:20::681a:ca4, 2606:4700:20::681a:da4, 2606:4700:20::ac43:4765
Response IP 104.26.13.164
Found Yes
Hash 4ff3f3e9048137063af7432b540d5e8f3e218b755caff0b67e3ef93e1e6f88e2
SimHash 2b2058908b91

Groups

*

Rule Path
Disallow /lp/*
Disallow /join-job-ad-quality-guidelines

Other Records

Field Value
sitemap https://join.com/sitemap-index.xml
sitemap https://join.com/de/sitemap-index.xml
sitemap https://join.com/fr/sitemap-index.xml
sitemap https://join.com/nl/sitemap-index.xml
sitemap https://join.com/es/sitemap-index.xml