Việc làm này đã được thêm vào mục Việc làm đã lưu.
Bạn đã lưu tối đa 20 việc làm. Nếu bạn muốn lưu mới, hãy cập nhật Việc làm đã lưu.
Mô tả công việc
Your responsibilities will cover the entire lifecycle of an AI model, from receiving it from the research team to ensuring its efficient operation on the end device.
1. AI Model Optimization & Deployment:
Analyze trained AI models (from TensorFlow, PyTorch) to identify performance bottlenecks and resource requirements (memory, compute).
Convert models into inference-optimized formats such as ONNX, TensorRT, and TFLite.
Apply advanced optimization techniques, including quantization (INT8/FP16) and pruning, to reduce model size and accelerate processing speed while maintaining accuracy.
Profile and benchmark model performance on target hardware (e.g., NVIDIA Jetson, ARM CPUs) to ensure latency and throughput criteria are met.
2. Application Software Development:
Build high-performance applications and libraries in C++/Python to load, manage, and execute AI models in both Linux and Windows environments.
Develop end-to-end data processing pipelines, from pre-processing input data (images, video) to post-processing model outputs.
Create and maintain unit and integration tests to ensure the stability and accuracy of AI features.
3. Research & Improvement:
Stay current with the latest technologies, algorithms, and tools in embedded AI and efficient machine learning.
Research and experiment with new AI models to evaluate their feasibility and potential for product application.
Participate in troubleshooting, debugging, and continuously improving deployed AI systems to enhance performance and reliability.
4. Collaboration & Technical Support:
Collaborate closely with AI/ML scientists to understand model architectures and deployment requirements.
Work with hardware engineering teams to leverage specialized on-chip acceleration features.
Support other teams (such as QA and Product) in integrating and testing AI solutions.
Yêu cầu công việc
Essential Qualifications:
- 3+ years of relevant professional experience.
- Bachelor's or Master’s degree in Computer Science, Computer Vision, or a closely related technical field.
- Proficiency in programming languages such as C/C++, Python, and/or CUDA.
- Hands-on experience with edge device deployment technologies (e.g., TensorRT, TFlite, ONNX).
- Experience in developing and deploying applications on Linux and Windows operating systems.
- Solid understanding and practical experience with prominent AI frameworks (e.g., TensorFlow, PyTorch) for model development and training.
- Experience or knowledge of deploying AI models directly onto specialized hardware chips.
- Good command of the English language, both written and verbal.
Preferred Qualifications:
Relevant academic coursework, significant projects, or internships in Artificial Intelligence/Machine Learning or hardware optimization.
Active participation in technology communities, open-source contributions, or successful hackathon experiences.
Tại sao bạn sẽ yêu thích làm việc tại đây
- Compensation & bonus: 13th & 14th salary, AIP bonus, Holidays, Tet, and Long year service …
- Social insurance, Health insurance, Unemployment insurance: by Social Insurance and Labor Law
- The regime of annual leave, company trip, and checkup examination
- Award for marriage, newborn
- We have AON insurance package for employee, spouse, and children every year
- You will be trained, learned & work with the best technical managers who help you improve various dev skills & career path
- You’ll love working in our dynamic environment employees, young & active
- We love sport activities, as marathon, football, swimming,...
- Working time: From Monday to Friday | 08:30-12:00 & 13:00-17.30
Solving for safer !