fra.proz.com
robots.txt

Robots Exclusion Standard data for fra.proz.com

Resource Scan

Scan Details

Site Domain fra.proz.com
Base Domain proz.com
Scan Status Ok
Last Scan2024-04-21T10:56:41+00:00
Next Scan 2024-05-21T10:56:41+00:00

Last Scan

Scanned2024-04-21T10:56:41+00:00
URL https://fra.proz.com/robots.txt
Domain IPs 104.26.6.235, 104.26.7.235, 172.67.73.12, 2606:4700:20::681a:6eb, 2606:4700:20::681a:7eb, 2606:4700:20::ac43:490c
Response IP 172.67.73.12
Found Yes
Hash bb86636ebb438dc79990626c39bfd1f60146309632040030b1251f32320c4c2c
SimHash cf474c94d2d2

Groups

mediapartners-google*

Rule Path
Disallow

petalbot

Rule Path
Disallow /

*

Rule Path
Disallow /crawler-pit/
Disallow /profile$
Disallow /profile/$
Disallow /profile?
Disallow /profile/?
Disallow /translator/2372$
Disallow /profile/127329$
Disallow /?sp=login
Disallow /?sp=404
Disallow /translation-news/wp-admin
Disallow

Other Records

Field Value
crawl-delay 3

Other Records

Field Value
sitemap https://www.proz.com/sitemap__index_Main.xml.gz
sitemap https://www.proz.com/sitemap__index_SoftwareComparisonTool.xml.gz
sitemap https://www.proz.com/sitemap__index_Kudoz.xml.gz
sitemap https://www.proz.com/sitemap__index_Forums.xml.gz
sitemap https://www.proz.com/sitemap__index_Businesses.xml.gz
sitemap https://www.proz.com/sitemap__index_BlueBoard.xml.gz
sitemap https://www.proz.com/sitemap__index_Profiles.xml.gz
sitemap https://www.proz.com/sitemap__index_ClassicalJobPostings.xml.gz
sitemap https://www.proz.com/sitemap__index_TranslatorDirectory.xml.gz