thabet.gg
robots.txt

Robots Exclusion Standard data for thabet.gg

Resource Scan

Scan Details

Site Domain thabet.gg
Base Domain thabet.gg
Scan Status Ok
Last Scan5/13/2025, 8:20:57 PM
Next Scan 5/27/2025, 8:20:57 PM

Last Scan

Scanned5/13/2025, 8:20:57 PM
URL http://thabet.gg/robots.txt
Redirect https://airjordan11.uk.com/robots.txt
Redirect Domain airjordan11.uk.com
Redirect Base airjordan11.uk.com
Domain IPs 162.255.119.164
Redirect IPs 104.21.68.184, 172.67.197.209, 2606:4700:3030::6815:44b8, 2606:4700:3036::ac43:c5d1
Response IP 104.21.68.184
Found Yes
Hash cca60f4272e286e8e7f14c255cb87b73e05245b94c52e1a3de961c3ee5ae37a0
SimHash af437842c901

Groups

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webzip

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

web downloader

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

offline explorer pro

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

httrack website copier

Rule Path
Disallow /

offline commander

Rule Path
Disallow /

leech

Rule Path
Disallow /

websnake

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

http weazel

Rule Path
Disallow /

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /search?q=*
Disallow /mobilenav/
Disallow /mobilebtmnav/
Disallow /bottomnav/
Disallow /nav/
Allow /wp-admin/admin-ajax.php
Disallow */attachment/*
Disallow /images/
Disallow *?replytocom
Disallow /page/
Allow /*.js$
Allow /*.css$

Other Records

Field Value
sitemap https://soilman.uk.com/sitemap_index.xml