supplygem.com
robots.txt

Robots Exclusion Standard data for supplygem.com

Resource Scan

Scan Details

Site Domain supplygem.com
Base Domain supplygem.com
Scan Status Ok
Last Scan2024-11-16T06:03:55+00:00
Next Scan 2024-12-16T06:03:55+00:00

Last Scan

Scanned2024-11-16T06:03:55+00:00
URL https://supplygem.com/robots.txt
Domain IPs 104.16.150.108, 104.16.151.108, 2606:4700::6810:966c, 2606:4700::6810:976c
Response IP 104.16.150.108
Found Yes
Hash 9acfd2a34c23eb0ac01480c20a1d86bc79bf23e144de4478217c74cbd138947b
SimHash 1af1dc12c802

Groups

*

Rule Path
Disallow /?s=
Disallow /search/
Disallow /wp-admin
Disallow /*/feed/
Disallow /wp-login.php
Disallow /wp-admin/
Disallow /visit/
Disallow /share/
Disallow /trackback/
Disallow /wp-register.php
Allow /wp-admin/admin-ajax.php

spbot
exabot
gigabot
superbot
superhttp
netspider
website\ extractor
website\ quester
webstripper
webwhacker
surfbot
oncrawl
deepcrawl
rytebot
dotbot
claritybot
rogerbot
sitebulb
semrushbot
semrushbot-sa
semrushbot-ba
semrushbot-si
semrushbot-swa
semrushbot-ct
semrushbot-bm
semrushbot-seoab
semrush
mj12bot
xenu
searchmetricsbot
sistrix crawler
seokicks-robot
ia_archiver
archive.org_bot
ia_archiver-web.archive.org
ravencrawler
blexbot
seo-powersuite-bot

Rule Path
Disallow /

googlebot

Rule Path
Allow *.js
Allow *.css

Other Records

Field Value
sitemap https://supplygem.com/sitemap_index.xml