g2deal.com
robots.txt

Robots Exclusion Standard data for g2deal.com

Resource Scan

Scan Details

Site Domain g2deal.com
Base Domain g2deal.com
Scan Status Ok
Last Scan2024-10-30T23:58:38+00:00
Next Scan 2024-11-29T23:58:38+00:00

Last Scan

Scanned2024-10-30T23:58:38+00:00
URL https://g2deal.com/robots.txt
Redirect https://www.g2deal.com/robots.txt
Redirect Domain www.g2deal.com
Redirect Base g2deal.com
Domain IPs 3.233.122.99
Redirect IPs 3.233.122.99
Response IP 3.233.122.99
Found Yes
Hash 8b66b1b2ae9d0527350a91e340a5a4e4719bdfb29fda20a26135983a1a416770
SimHash 3307e01047a0

Groups

*

Rule Path
Allow /

amazon-kendra

Product Comment
amazon-kendra Amazon Kendra Web Crawler
Rule Path Comment
Disallow / disallow access to any pages

petalbot
appinsights
semrushbot
semanticscholarbot
dotbot
whatcms
rogerbot
trendictionbot
blexbot
linkfluence
magpie-crawler
mj12bot
mediatoolkitbot
aspiegelbot
domainstatsbot
cincraw
nimbostratus
httrack
serpstatbot
omgili
grapeshotcrawler
megaindex
semanticbot
cocolyzebot
domcopbot
traackr
bomborabot
linguee
webtechbot
domainstatsbot
clickagy
sqlmap
internet-structure-research-project-bot
seekport
awariosmartbot
onalyticabot
buck
riddler
sbl-bot
df bot 1.0
pubmatic crawler bot
bvbot
sogou
barkrowler
admantx
adbeat
embed.ly
semantic-visions
voluumdsp
wc-test-dev-bot
gulperbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.g2deal.com/sitemap.xml