alljobvacancies.com
robots.txt

Robots Exclusion Standard data for alljobvacancies.com

Resource Scan

Scan Details

Site Domain alljobvacancies.com
Base Domain alljobvacancies.com
Scan Status Ok
Last Scan2024-06-02T11:20:08+00:00
Next Scan 2024-06-09T11:20:08+00:00

Last Scan

Scanned2024-06-02T11:20:08+00:00
URL https://alljobvacancies.com/robots.txt
Domain IPs 104.21.80.52, 172.67.174.91, 2606:4700:3036::6815:5034, 2606:4700:3037::ac43:ae5b
Response IP 104.21.80.52
Found Yes
Hash ec8d90ea74185671548359fe4c51651ccb8fa2c367b800abf43cd54c44e1f144
SimHash 60014baef2b3

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /linkout/
Disallow /recommended/
Disallow /comments/feed/
Disallow /trackback/
Disallow /index.php
Disallow /xmlrpc.php

ninjabot

Rule Path
Allow /

mediapartners-google*

Rule Path
Allow /

googlebot-image

Rule Path
Allow /wp-content/uploads/

adsbot-google

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

Other Records

Field Value
sitemap https://alljobvacancies.com/sitemap_index.xml