招聘城市:新加坡
岗位职责:
Business Unit
What the Role Entails
Responsibilities:
1. Conduct research on RL algorithms for multimodal models, including diffusion models for image, video, and 3D generation, autoregressive models for multimodal understanding, and potentially unified multimodal frameworks.
2. Design and develop RL infrastructure and reward modeling strategies to enable efficient large-scale training, improve training stability, and mitigate reward hacking and related failure modes.
3. Explore next-generation RL paradigms that more directly and effectively learn from environment feedback.
Who We Look For
Requirements:
1. Currently enrolled as a PhD student in Computer Science or a closely related field.
2. Demonstrated strong research capability, with publications in top-tier conferences such as ICML, NeurIPS, ICLR, CVPR, ICCV, ECCV, SIGGRAPH.
3. Strong hands-on programming skills, with solid experience in deep learning system implementation, model training and inference optimization, CPU/GPU acceleration, and distributed training and inference.
4. Prior experience with…
招聘城市:新加坡
…provides users with a full range of customer services. As the operator of the largest networking, devices, and data center in Asia,TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation.
What the Role Entails
Conduct research on RL algorithms for multimodal models, including diffusion models for image, video, and 3D generation, autoregressive models for multimodal understanding, and potentially unified multimodal frameworks
Design and develop RL infrastructure and reward modeling strategies to enable efficient large-scale training, improve training stability, and mitigate reward hacking and related failure modes.
Explore nextgeneration RL paradigms that more directly and effectively learn from environment feedback.
Who We Look For
Currently enrolled as a PhD student in Computer Science or a closely related field
Demonstrated strong research capability, with publications in top-tier conferences such as ICML, NeurIPS…
招聘城市:新加坡
…provides users with a full range of customer services. As the operator of the largest networking, devices, and data center in Asia,TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation.
What the Role Entails
Conduct research on RL algorithms for multimodal models, including diffusion models for image, video, and 3D generation, autoregressive models for multimodal understanding, and potentially unified multimodal frameworks
Design and develop RL infrastructure and reward modeling strategies to enable efficient large-scale training, improve training stability, and mitigate reward hacking and related failure modes.
Explore nextgeneration RL paradigms that more directly and effectively learn from environment feedback.
Who We Look For
Currently enrolled as a PhD student in Computer Science or a closely related field
Demonstrated strong research capability, with publications in top-tier conferences such as ICML, NeurIPS…
招聘城市:新加坡
…provides users with a full range of customer services. As the operator of the largest networking, devices, and data center in Asia,TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation.
What the Role Entails
Conduct research on RL algorithms for multimodal models, including diffusion models for image, video, and 3D generation, autoregressive models for multimodal understanding, and potentially unified multimodal frameworks
Design and develop RL infrastructure and reward modeling strategies to enable efficient large-scale training, improve training stability, and mitigate reward hacking and related failure modes.
Explore nextgeneration RL paradigms that more directly and effectively learn from environment feedback.
Who We Look For
Currently enrolled as a PhD student in Computer Science or a closely related field
Demonstrated strong research capability, with publications in top-tier conferences such as ICML, NeurIPS…
招聘城市:新加坡
…provides users with a full range of customer services. As the operator of the largest networking, devices, and data center in Asia,TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation.
What the Role Entails
Conduct research on RL algorithms for multimodal models, including diffusion models for image, video, and 3D generation, autoregressive models for multimodal understanding, and potentially unified multimodal frameworks
Design and develop RL infrastructure and reward modeling strategies to enable efficient large-scale training, improve training stability, and mitigate reward hacking and related failure modes.
Explore nextgeneration RL paradigms that more directly and effectively learn from environment feedback.
Who We Look For
Currently enrolled as a PhD student in Computer Science or a closely related field
Demonstrated strong research capability, with publications in top-tier conferences such as ICML, NeurIPS…
招聘城市:新加坡
…provides users with a full range of customer services. As the operator of the largest networking, devices, and data center in Asia,TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation.
What the Role Entails
Conduct research on RL algorithms for multimodal models, including diffusion models for image, video, and 3D generation, autoregressive models for multimodal understanding, and potentially unified multimodal frameworks
Design and develop RL infrastructure and reward modeling strategies to enable efficient large-scale training, improve training stability, and mitigate reward hacking and related failure modes.
Explore nextgeneration RL paradigms that more directly and effectively learn from environment feedback.
Who We Look For
Currently enrolled as a PhD student in Computer Science or a closely related field
Demonstrated strong research capability, with publications in top-tier conferences such as ICML, NeurIPS…
招聘城市:新加坡
…management and data centers, TEG provides users with a full range of customer services. As the operator of the largest networking, devices, and data center in Asia,TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation.
What the Role Entails
1. Conduct research on RL algorithms for multimodal models, including diffusion models for image, video, and 3D generation, autoregressive models for multimodal understanding, and potentially unified multimodal frameworks.
2. Design and develop RL infrastructure and reward modeling strategies to enable efficient large-scale training, improve training stability, and mitigate reward hacking and related failure modes.
3. Explore next-generation RL paradigms that more directly and effectively learn from environment feedback.
Who We Look For
1. Currently enrolled as a PhD student in Computer Science or a closely related field.
2. Demonstrated strong research…