infohotjob.com
robots.txt

Robots Exclusion Standard data for infohotjob.com

Resource Scan

Scan Details

Site Domain infohotjob.com
Base Domain infohotjob.com
Scan Status Ok
Last Scan2025-12-19T04:26:43+00:00
Next Scan 2025-12-26T04:26:43+00:00

Last Scan

Scanned2025-12-19T04:26:43+00:00
URL https://infohotjob.com/robots.txt
Domain IPs 104.21.27.63, 172.67.141.195, 2606:4700:3030::6815:1b3f, 2606:4700:3037::ac43:8dc3
Response IP 172.67.141.195
Found Yes
Hash e7bb823bada31c2e80adff2b1b2ec436fa770c29df35c09f52c2ead6c7d3d027
SimHash 68455f43cac6

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /wp-content/themes
Disallow /feed
Disallow /*/feed
Disallow /comments
Disallow /author
Disallow /tag
Disallow /archives
Disallow /iframes
Disallow /privacy-policy.html
Disallow /web-site-agreement.html
Disallow /category/*/*
Disallow /page/*/*
Disallow */trackback%5C

googlebot

Rule Path
Disallow /*.php$
Disallow /*.js$
Disallow /*.inc$
Disallow /*.css$
Disallow /*.gz$
Disallow /*.wmv$
Disallow /*.tar$
Disallow /*.tgz$
Disallow /*.cgi$
Disallow /*.xhtml$
Disallow */feed/
Disallow */trackback/
Allow /feed/
Disallow /*?*
Disallow /*?

googlebot-image

Rule Path
Allow /*
Disallow

googlebot

Rule Path
Allow /*
Disallow

mediapartners-google*

Rule Path
Allow /*
Disallow

adsbot-google

Rule Path
Allow /*
Disallow

googlebot-mobile

Rule Path
Allow /*
Disallow

Other Records

Field Value
sitemap https://infohotjob.com/sitemap.xml

Comments

  • The Googlebot is the main search bot for google
  • Disallow all files ending with these extensions
  • Disallow Google from parsing indididual post feeds and trackbacks..
  • Disallow all files with ? in url
  • The Googlebot-Image is the image bot for google
  • Allow Everything
  • This is the ad bot for google