immi-usa.com
robots.txt

Robots Exclusion Standard data for immi-usa.com

Resource Scan

Scan Details

Site Domain immi-usa.com
Base Domain immi-usa.com
Scan Status Ok
Last Scan2024-10-23T05:45:01+00:00
Next Scan 2024-11-22T05:45:01+00:00

Last Scan

Scanned2024-10-23T05:45:01+00:00
URL https://immi-usa.com/robots.txt
Domain IPs 104.26.8.125, 104.26.9.125, 172.67.71.26, 2606:4700:20::681a:87d, 2606:4700:20::681a:97d, 2606:4700:20::ac43:471a
Response IP 104.26.8.125
Found Yes
Hash ab0addec571c3f115d86717419f4bbd85a36b689330fade7720af1b44b0eb085
SimHash dd62cc089593

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /wp-content/uploads/wpforms/
Disallow /readme.html

nuclei
wikido
riddler
petalbot
zoominfobot
go-http-client
node/simplecrawler
cazoodlebot
majestic
deepcrawl
rogerbot
dotbot/1.0
gigabot
barkrowler
blexbot
magpie-crawler

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.immi-usa.com/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • Allow internal-search pages marked noindex to be crawled
  • Disallow: /?s=
  • Disallow: /search/
  • ---------------------------
  • END YOAST BLOCK
  • Ban bots-----------------------------