manpower.it
robots.txt

Robots Exclusion Standard data for manpower.it

Resource Scan

Scan Details

Site Domain manpower.it
Base Domain manpower.it
Scan Status Ok
Last Scan2024-11-04T21:23:48+00:00
Next Scan 2024-11-18T21:23:48+00:00

Last Scan

Scanned2024-11-04T21:23:48+00:00
URL https://manpower.it/robots.txt
Redirect https://www.manpower.it/robots.txt
Redirect Domain www.manpower.it
Redirect Base manpower.it
Domain IPs 104.18.38.209, 172.64.149.47, 2606:4700:4400::6812:26d1, 2606:4700:4400::ac40:952f
Redirect IPs 104.18.38.209, 172.64.149.47, 2606:4700:4400::6812:26d1, 2606:4700:4400::ac40:952f
Response IP 104.18.38.209
Found Yes
Hash d6c1f3d472b9544fbead2ffb264f63fbac0ff18ddc873b5735bfd191e0438b90
SimHash 2a50b111c591

Groups

*

Rule Path
Disallow /

slurp

Rule Path
Allow /
Disallow /en
Disallow /candidate

google

Rule Path
Allow /
Disallow /en
Disallow /candidate

teoma

Rule Path
Allow /
Disallow /en
Disallow /candidate

msnbot

Rule Path
Allow /
Disallow /en
Disallow /candidate

exabot

Rule Path
Allow /
Disallow /en
Disallow /candidate

bingbot

Rule Path
Allow /
Disallow /en
Disallow /candidate

googlebot

Rule Path
Allow /
Disallow /en
Disallow /candidate

applebot

Rule Path
Allow /
Disallow /en
Disallow /candidate

aolbuild

Rule Path
Allow /
Disallow /en
Disallow /candidate

deepcrawl

Rule Path
Allow /
Disallow /en
Disallow /candidate

ahrefsbot

Rule Path
Allow /
Disallow /en
Disallow /candidate

twitterbot

Rule Path
Allow /
Disallow /en
Disallow /candidate

bingpreview

Rule Path
Allow /
Disallow /en
Disallow /candidate

duckduckbot

Rule Path
Allow /
Disallow /en
Disallow /candidate

ia_archiver

Rule Path
Allow /
Disallow /en
Disallow /candidate

facebot/1.0

Rule Path
Allow /
Disallow /en
Disallow /candidate

siteauditbot

Rule Path
Allow /
Disallow /en
Disallow /candidate

adsbot-google

Rule Path
Allow /
Disallow /en
Disallow /candidate

semrushbot-sa

Rule Path
Allow /
Disallow /en
Disallow /candidate

semrushbot-ba

Rule Path
Allow /
Disallow /en
Disallow /candidate

semrushbot-si

Rule Path
Allow /
Disallow /en
Disallow /candidate

semrushbot-ct

Rule Path
Allow /
Disallow /en
Disallow /candidate

semrushbot-bm

Rule Path
Allow /
Disallow /en
Disallow /candidate

semrushbot-swa

Rule Path
Allow /
Disallow /en
Disallow /candidate

splitsignalbot

Rule Path
Allow /
Disallow /en
Disallow /candidate

archive.org_bot

Rule Path
Allow /
Disallow /en
Disallow /candidate

mediapartners-google

Rule Path
Allow /
Disallow /en
Disallow /candidate

screaming frog seo spider

Rule Path
Allow /
Disallow /en
Disallow /candidate

Other Records

Field Value
sitemap https://www.manpower.it/sitemap.xml
sitemap https://www.manpower.it/sitemap/italy/it-manpower/sitemap1.xml
sitemap https://www.manpower.it/sitemap/italy/it-manpower/sitemap2.xml
sitemap https://www.manpower.it/sitemap/italy/it-manpower/sitemap3.xml