Posted 1y ago

Internship - Research Engineer for Speech/Audio Signal Processing_ICASSP

@ Sony
Tokyo, Japan
OnsiteFull Time
Responsibilities:Developing core technologies, Composing algorithmic structures, Evaluating identifiable factors
Requirements Summary:Ph.D. or M.S. in computer science, AI/ML, or related field; research background in AI/ML and speech processing; excellent programming skills in Python and/or C++.
Technical Tools Mentioned:PyTorch, TensorFlow, Python, C++, MATLAB
Save
Mark Applied
Hide Job
Report & Hide
Job Description

Technology Fields

Speech/Audio Signal Processing
Machine Learning

Position Summary

Sony Group Corporation has an opening for a Research Engineer Intern in AI/ML

for speech processing including large-scale speech foundation modeling, diarization and synthesis in one of our R&D laboratories. The candidate selected for this position will be responsible for research and development in the areas of speech-based large language modeling, diarization, or speech synthesis with state-of-the-art AI/ML technologies, to contribute to Sony's businesses such as games, films, music and other products and services.

Responsibilities

Responsibilities include the investigation and utilization of complex engineering and mathematical principles to develop core technologies, compose algorithmic structures, and find solutions to problems. As a Research Engineer Intern, you will work on problems of diverse scope, where data analysis requires the evaluation of identifiable factors and the demonstration of good judgement in the selection of methods and techniques for finding solutions.

Required qualifications

■ Ph.D. Degree (graduated or currently pursuing) OR M.S. Degree (graduated or currently pursuing) in computer science, AI/ML, or a related field.
■ Research background in AI/ML, speech processing, and/or related areas.
■ Excellent neural network modeling and analysis skills are required, and the ability to formulate optimization flow using PyTorch, TensorFlow, or an equivalent deep learning framework is essential.
■ Excellent programming skills in Python and/or modern C++ for rapid algorithm prototyping.

Preferred qualifications

■ Qualification at the Ph.D. level or higher in computer science, AI/ML, or a related field.
■ 3 years of experience in computer science, AI/ML, or a related field.
■ Knowledge and experience in natural language processing is a plus.

Product, Service

Games (PlayStation games, smartphone applications, etc.), movies/music (content creation support), video analysis (broadcast content, online video, etc.), robots (aibo), financial services (human operation support and data analysis), etc.

Development Environment

■ OS: Windows and Linux
■ Programming Languages: Python, C/C++, MATLAB, etc.
■ PC, Server, Cloud Computing

Application Requirements

Essay: Not Required

Coding test: Not Required

Required Skills:

Audio Signal Processing, Machine Learning, Speech Processing

Optional Skills: