findtalent.network
robots.txt

Robots Exclusion Standard data for findtalent.network

Archived Snapshots

Resource Scan

Scan Details

Site Domain	findtalent.network
Base Domain	findtalent.network
Scan Status	Ok
Last Scan	2025-11-17T17:18:33+00:00
Next Scan	2025-12-17T17:18:33+00:00

Last Scan

Scanned	2025-11-17T17:18:33+00:00
URL	https://findtalent.network/robots.txt
Domain IPs	104.21.5.176, 172.67.133.174, 2606:4700:3032::ac43:85ae, 2606:4700:3036::6815:5b0
Response IP	172.67.133.174
Found	Yes
Hash	d7be671f49a308e24550992df3d07a8bb270061bebae9eb7a3df2c00efb71a8a
SimHash	ba9d0d857340

Groups

dataprovider

Rule	Path
Disallow	/

Rule

Path

Disallow

/

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

blexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

googlebot

Rule	Path
Disallow	/jobs/view/
Disallow	/jobs/snippet/

Rule

Path

Disallow

/jobs/view/

Disallow

/jobs/snippet/

*

Rule	Path
Disallow	/jobs/view/
Disallow	/jobs/snippet/

Rule

Path

Disallow

/jobs/view/

Disallow

/jobs/snippet/

Back to top

Comments

See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
To ban all spiders from the entire site uncomment the next two lines:
User-agent: *
Disallow: /

Back to top

findtalent.networkrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

dataprovider

semrushbot

ahrefsbot

mj12bot

blexbot

googlebot

*

Comments

findtalent.network
robots.txt