Job Description We are seeking a software engineer to drive the implementation andperformance optimization of generative AI workloads on Intel GPUs as part ofthe OpenVINO GPU team. This role focuses on building high-performance, HW-aware software thatenables efficient execution of AI models on current and future Intel GPUarchitectures. You will work across multiple layers of the stack—AI models,runtime systems, and GPU hardware—and take ownership of complex performanceproblems that require deep technical insight and careful trade-off analysis. You will work on state-of-the-art AI models that push the limits of GPUperformance. Your work directly impacts real-world AI performance experiencedby developers and customers. About OpenVINO OpenVINO( https://github.com/openvinotoolkit/openvino )is a performance-focused AI inference runtime designed to efficiently executedeep learning models across Intel architectures. The GPU plugin is a core component of OpenVINO that bridges high-levelAI models and low-level GPU execution, covering areas such as graphtransformation, kernel dispatch, memory management, and hardware-specificoptimizations. The codebase is performance-critical, largely written in modern C++, andrequires strong understanding of system-level software design, debugging, andoptimization. What You Will Do Take technical ownership of performance-critical paths for generative AI workloads (e.g., LLMs, diffusion models) on Intel GPUs Analyze end-to-end execution of AI models to identify compute, memory, bandwidth, and parallelism bottlenecks Implement and optimize generative AI techniques, adapting state-of-the-art ideas to efficiently run on Intel GPU architectures Translate deep understanding of GPU hardware architecture into efficient, scalable, and maintainable software designs Optimize workloads for both current and future Intel GPU platforms, including hardware that is still under development Diagnose and resolve complex issues that span runtime, kernel, driver, and hardware boundaries Collaborate with global teams across software, hardware architecture, and validation to deliver optimized solutions Required Qualifications Computer science, computer engineering, or a related field with 3+ years of professional software engineering experience Strong programming skills in C and C++; working experience with Python Experience working with large and complex C++ codebases, with attention to performance, correctness, and maintainability Proven analytical thinking and strong problem-solving abilities, especially for ambiguous or under-specified problems Preferred Qualifications Experience with GPU programming or parallel computing, such as multi-threading, SIMD, or accelerator programming models Strong understanding of computer and GPU architecture, and how hardware characteristics impact software performance Technical understanding of generative AI models from a system and performance perspective Familiarity with AI runtimes or frameworks Solid foundation in computer science fundamentals, including data structures, algorithms, and operating systems Ability to communicate technical ideas clearly in written and spoken English Work Model This role follows a structured hybrid work model. The team regularly combines remote work and in-office collaboration, with a designated in-office days each week, while the remaining days are remote.
IT개발·데이터>소프트웨어·하드웨어>소프트웨어|IT개발·데이터>응용개발자>Linux|IT개발·데이터>응용개발자>C·C++|IT개발·데이터>응용개발자>AI개발|IT개발·데이터>웹개발자>Python