gearnuke.com
robots.txt

Robots Exclusion Standard data for gearnuke.com

Resource Scan

Scan Details

Site Domain gearnuke.com
Base Domain gearnuke.com
Scan Status Ok
Last Scan2024-09-25T16:32:05+00:00
Next Scan 2024-10-02T16:32:05+00:00

Last Scan

Scanned2024-09-25T16:32:05+00:00
URL https://gearnuke.com/robots.txt
Redirect https://www.gearnuke.com/robots.txt
Redirect Domain www.gearnuke.com
Redirect Base gearnuke.com
Domain IPs 104.21.94.167, 172.67.138.68, 2606:4700:3032::6815:5ea7, 2606:4700:3037::ac43:8a44
Redirect IPs 104.21.94.167, 172.67.138.68, 2606:4700:3032::6815:5ea7, 2606:4700:3037::ac43:8a44
Response IP 172.67.138.68
Found Yes
Hash 4bda34cccd0364031d2b453d2168e981eebf87a508136478b36d5b635ef49338
SimHash 431dc2529aba

Groups

*

Rule Path
Allow /wp-content/uploads/
Allow /wp-admin/admin-ajax.php
Disallow /wp-admin/
Disallow /widgets/
Disallow /page/
Disallow /page
Disallow /constellationwidget/
Disallow /mediaapi/
Disallow /wp_cron.php
Disallow /xmlrpc.php
Disallow /.well-known/amphtml/apikey.pub
Disallow /search/*
Disallow /tag/*
Disallow /user/*
Disallow /profile/*
Disallow /taxonomy/*
Disallow /filter/*
Disallow /custom-home
Disallow /wp-json/*
Disallow /profiles/*
Disallow /?*
Disallow /*?*

ahrefsbot
semrushbot
dotbot
mauibot
mj12bot
claudebot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.gearnuke.com/sitemap.xml
sitemap https://www.gearnuke.com/googlenews.xml