Latest Jobs  
 
 
 
Job Information
Job title

Sr. Search Engineer - Search Industry, Extracting Data, SQL

Company Cybercoders.com
Wage between $0.00 - $0.00 Annually
Location United States, California, Palo Alto
Employment type Full Time
Education Not Specified
Year Experience 4 - 5 Years of Practical Experience
Travel Not Specified
Published on 10/13/2009
Description

Sr. Search Engineer - Search Industry, Extracting Data, SQL - Web Crawling, Extracting semantic meaning from unstructured Web data

Skills Required - Vertical Search experience, Web Crawling/Mining, Extracting meaning from unstructured Web data, Unix command line, Java, XML, Regular Expressions, Algorithms for de-duplication/classification/clustering, High Performing SQL, Distributed Processing Frameworks (Nutch or Hadoop)

Sr. Search Engineer - Web Crawling, Search Industry, Java, XML, Regular Expression, High Performing SQL

Based in beautiful Palo Alto, CA, we are a fast-growing and well financed Internet company that specializes in providing a unique and much needed travel service.
Due to tremendous growth, we are looking to hire for an outstanding Sr. Search/Web Mining Engineer who possesses strong Web-tier development skills and a burning desire to create a great product. Candidates will have a direct impact on our core data acquisition and processing infrastructure enabling us to scale our search index to massive scale.

RESPONSIBILITIES:
Create a world-class Web data-mining system, capable of extracting meaning and structure from unstructured Web documents. Create a system that can efficiently and effectively discover, and understand travel-specific knowledge in hundreds of millions of Web Pages. The successful candidate will own the crawling and data mining algorithms and work with a systems architect to make them fault tolerant and highly scalable across distributed systems. A successful candidate will have previously worked in small start-up vertical search environment and have a desire to do so again.

Must Have Skills:
1.) 5 years hands on experience and world-class expertise in Web crawling and extracting structured information (semantic meaning) from millions of Web Pages (unstructured Web data). This experience is CRUCIAL and Non-negotiable. You should have strong experience in the SEARCH industry.
2.) Native speaker of the Unix command line, XML and regular expressions.
3.) Experience with algorithms for de-duplication, classification, clustering and with processes for iterative improvement using training data.
4.) Enjoys working in a collaborative team environment. Able to defend their architectural and design decisions with a very talented technical team.
5.) Ability to shift gears quickly in a start-up environment.

EXPERIENCE (ideal):
1.) Hands on experience using high performing SQL with very large data sets.
2.) 5+ years experience and world-class skills in Java development including intensive and highly performing SQL.
3.) Real-world experience with distributed processing frameworks systems such as Nutch, Hadoop.
4.) Is very familiar with common collaboration and code/build management tools such as SVN, Ant, Maven.

Nice to Have Skills):
1.) Rich experience with Web application technologies: Web services, XML, SOAP, SAX, Ruby on Rails, Active Record and/or J2EE technologies including EJB, Spring, Hibernate.

For your hard work, you will be rewarded with a strong offer ($90,000 - $140,000), Stock Options, excellent benefits, and other cool perks! Please apply immediately as we are conducting interviews this week and next week before making a decision. Local candidates preferred.

df-tc


Experience/Skills
See Above
 
Bookmark and Share