Technology Fields
Speech/Audio Signal Processing
Machine Learning
Position Summary
Sony Group Corporation has an opening for a Research Engineer Intern in AI/ML
for speech processing including large-scale speech foundation modeling, diarization and synthesis in one of our R&D laboratories. The candidate selected for this position will be responsible for research and development in the areas of speech-based large language modeling, diarization, or speech synthesis with state-of-the-art AI/ML technologies, to contribute to Sony's businesses such as games, films, music and other products and services.
Responsibilities
Responsibilities include the investigation and utilization of complex engineering and mathematical principles to develop core technologies, compose algorithmic structures, and find solutions to problems. As a Research Engineer Intern, you will work on problems of diverse scope, where data analysis requires the evaluation of identifiable factors and the demonstration of good judgement in the selection of methods and techniques for finding solutions.
Required qualifications
■ Ph.D. Degree (graduated or currently pursuing) OR M.S. Degree (graduated or currently pursuing) in computer science, AI/ML, or a related field.
■ Research background in AI/ML, speech processing, and/or related areas.
■ Excellent neural network modeling and analysis skills are required, and the ability to formulate optimization flow using PyTorch, TensorFlow, or an equivalent deep learning framework is essential.
■ Excellent programming skills in Python and/or modern C++ for rapid algorithm prototyping.
Preferred qualifications
■ Qualification at the Ph.D. level or higher in computer science, AI/ML, or a related field.
■ 3 years of experience in computer science, AI/ML, or a related field.
■ Knowledge and experience in natural language processing is a plus.
Product, Service
Games (PlayStation games, smartphone applications, etc.), movies/music (content creation support), video analysis (broadcast content, online video, etc.), robots (aibo), financial services (human operation support and data analysis), etc.
Development Environment
■ OS: Windows and Linux
■ Programming Languages: Python, C/C++, MATLAB, etc.
■ PC, Server, Cloud Computing
Application Requirements
Essay: Not Required
Coding test: Not Required
Required Skills:
Audio Signal Processing, Machine Learning, Speech ProcessingOptional Skills: