gearnuke.com
robots.txt

Robots Exclusion Standard data for gearnuke.com

Resource Scan

Scan Details

Site Domain gearnuke.com
Base Domain gearnuke.com
Scan Status Ok
Last Scan2024-11-13T16:33:51+00:00
Next Scan 2024-11-20T16:33:51+00:00

Last Scan

Scanned2024-11-13T16:33:51+00:00
URL https://gearnuke.com/robots.txt
Redirect https://www.gearnuke.com/robots.txt
Redirect Domain www.gearnuke.com
Redirect Base gearnuke.com
Domain IPs 104.21.94.167, 172.67.138.68, 2606:4700:3032::6815:5ea7, 2606:4700:3037::ac43:8a44
Redirect IPs 104.21.94.167, 172.67.138.68, 2606:4700:3032::6815:5ea7, 2606:4700:3037::ac43:8a44
Response IP 172.67.138.68
Found Yes
Hash 19bed9eacf6f21631fc95d9a45e5fff9846416afa915f1d714b0e6995b8bffed
SimHash 431dca529aba

Groups

*

Rule Path
Allow /wp-content/uploads/
Allow /wp-admin/admin-ajax.php
Disallow /wp-admin/
Disallow /widgets/
Disallow /page/
Disallow /page
Disallow /constellationwidget/
Disallow /mediaapi/
Disallow /wp_cron.php
Disallow /xmlrpc.php
Disallow /.well-known/amphtml/apikey.pub
Disallow /search/*
Disallow /tag/*
Disallow /user/*
Disallow /profile/*
Disallow /taxonomy/*
Disallow /filter/*
Disallow /custom-home
Disallow /wp-json/*
Disallow /profiles/*
Disallow /?*
Disallow /*?*

ahrefsbot
semrushbot
dotbot
mauibot
mj12bot
claudebot

Rule Path
Disallow /

facebookbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://www.gearnuke.com/sitemap.xml
sitemap https://www.gearnuke.com/googlenews.xml