Data Scientist (Biotechnology Domain)

Gene Solutions
366 Nguyễn Trãi, District 5, Ho Chi Minh
At office
Posted 3 days ago

Job description

Gene Solutions is a leading company in Vietnam, specializing in the research and development of advanced genetic testing for precision medicine in screening, diagnosis, and treatment of cancer. Over the years, Gene Solutions has consistently invested in building a strong and highly skilled R&D team, with prominent publications in related fields.

Job descriptions

We are seeking a bioinformatician to expand our expertise in bioinformatics data analysis and pipeline development. As a bioinformatician – nextflow developer in our Data team at Gene Solutions, you will be working in a dynamic, interdisciplinary and fast-paced research-oriented environment. You will collaborate closely with data scientists, machine learning engineers, biologists and immunologists to design, develop, implement and optimize bioinformatics pipelines, which use Nextflow, to process and analyze high-throughput next-generation sequencing (NGS) data. Key tasks also include maintaining and improving existing pipelines to ensure that all production pipelines meet optimal standards; such as pipeline performance, scalability, well-written documentation and reproducibility. You will contribute to multiple research projects, requiring not only a strong data management capability, but also excellent personal planning skills. If you are passionate about bioinformatics and eager to apply your expertise in genomic data science research, we strongly encourage you to apply and join our team. 

Key Responsibilities

  • Apply and maintain existing bioinformatics data pipelines and optimize them to specific problems depending on the requirement of the research projects. 
  • Conduct rigorous exploratory bioinformatics analysis using state-of-the-art methods to explore and extract insights from NGS data. Implement and deploy new bioinformatics pipeline. 
  • Responsible for the end-to-end development and deployment of a bioinformatics pipeline, including data pre-processing, data standardization, pipeline deployment on-premises server or cloud-based platforms, simple GUI design (if required, e.g Shiny apps, Flask apps), automation with CI/CD pipeline, data storage management.
  • Work with large and complex NGS data sets. 
  • Provide bioinformatics support and consultancy for other team members. Maintain close collaboration with a multidisciplinary team throughout the project to ensure a successful delivery of the new bioinformatics pipeline.
  • Communicate findings to Data team and R&D team in the form of weekly presentations and reports.  

Your skills and experience

  • A master's degree in bioinformatics, computer science, computational biology or related field with at least 1 year of experience, or equivalently, a bachelor's degree with +3 years of working experience in bioinformatics. Ph.D. is a plus. 
  • Solid knowledge (both theoretical motivation and applications) and strong hands-on experience in bioinformatics data analysis using common bioinformatics tools for NGS data. 
  • Strong proficiency and hands-on experience with developing data analysis code in R and/or python is a must. Nice to have: Hands-on experience with BASH scripting, awk/sed, or Perl.
  • Strong proficiency in developing bioinformatics pipeline using Nextflow (highly preferred) or other data pipeline orchestration tools. 
  • Solid understanding of common bioinformatics data formats and standard processing workflow. 
  • Solid knowledge in basic statistics, such as hypothesis testing and experiment design, is a must. 
  • Hands-on experience in working with Linux environments, high performance clusters (HPC) using Slurm scheduler. Experience with containerization tools, e.g docker, singularity. 
  • Excellent verbal and written communication skills to present bioinformatics data analysis findings with effective data visualizations in weekly multidisciplinary team meeting.
  • Nice to have: experience with cloud-based platforms (AWS).
  • Nice to have: background in machine learning, deep learning.  
  • Nice to have: experience working with nf-core pipeline standards. 
  • Nice to have: experience with GitHub/GitLab in collaboration.
  • Strong motivation, self-driven and curiosity to learn.

Why you'll love working here

  • 13th month salary. 
  • Holidays and full annual leave in accordance with State regulations.
  • Participate in all insurances (social insurance, health insurance, unemployment insurance), enjoy full policies and regimes according to the Labor Law.
  • Dynamic and humane working environment Annual trip.

Gene Solutions

View company

Gene Solutions

Company type
IT Product
Company industry
Healthcare
Company size
501-1000 employees
Country
Vietnam
Working days
Monday - Friday
Overtime policy
No OT

More jobs for you

Get similar jobs by email