Meta Pixel ALT
Remote allowed
HPC System Administrator
About the product

Intro

Our partner is one of the largest industrial companies specializing in producing construction materials. They are leaders in high-quality industrial products, exceptional customer experience, and innovation in the market.

We aim to enhance the product and development process through R&D, innovation, supply chain process improvement, and overall social environment.

Technology stack

Python, С, Perl

HPC Schedulers and Workload Managers (e.g., Slurm), Cluster provisioning tools (Warewulf, SaltStack), VMware, Parallel file systems (e.g., Lustre), Linux system

GCP, Cloud HPC

Infiniband networking, HPC system hardware maintenance, Linux file systems (e.g., Ext3), MPI, OpenMP

Your team

You will join a team of professionals passionate about delivering quality engineering work. Together with the team, you will manage the development, execution, and continuous support of data services, providing scalable data collection, near real-time analytics, offline analytics, distributed search, and utilizing AI and ML for security, engineering, and business analytics purposes.

Culture

We are committed to implementing high standards in the technology industry, and it is the fundamental principle of our work. To achieve this goal, the company and all its members are constantly evolving.

Each of us has the opportunity to contribute to the product, the company, the team, the industry, and our personal development. At Techstack, you have the option to choose from various growth opportunities that align with your interests:

- meetups, where you can share your knowledge and develop simultaneously by sharing your experiences within the company and local technical communities;

- roles such as a mentor, a technical expert, or a technical lead. In any of these roles, you will assist junior professionals and share your knowledge and experiences with them;

- participation in our technical Guilds, where you can engage in discussions about technical solutions, approaches, and industry trends.

All of these elements contribute to shaping the culture and expertise within both, our team and the company as a whole.

Your responsibilities

Oversee the smooth operation of the HPC cluster in support of various R&D initiatives.

Perform installation, testing, maintenance, upgrades and administration of operating system and application software.

Fine-tune system configuration for reliability and performance.

Perform file management and administration tasks, troubleshoot problems, ensure the system remains operational and assist with access to the system.

Analyze malfunctions, troubleshoot and resolve problems in response to system/security.

Implement system policies to adhere to relevant company policies and standards, recommend policies where applicable.

Research and recommend configurations for new systems based on vendor and industry trends and contacts.

Maintain up-to-date knowledge of the HPC hardware and management tools.

Perform account maintenance and user management activities.

It's about you

Have experience with Linux systems administration.

Have experience with network and security administration.

Understand of Linux file systems (e.g., Ext 3).

Have experience with cluster design and system tunings.

Ability to provide technical support to users.

Have experience with programming with modern languages and experience with parallel application software, protocols, tools and utilities.

Have experience with hardware maintenance of HPC systems.

Understand of parallel file systems (e.g., Lustre).

Have experience with deploying and managing virtual environments (e.g. VMware etc.).

Use HPC Schedulers and Workload Managers (e.g., Slurm).

Have skills in excellent problem identification and troubleshooting, system performance tuning.

Have excellent organizational and communication skills.

Have an upper-intermediate or higher level of English proficiency.

Ability to clearly communicate technical concepts to a non-technical audience.

It would be a plus if you have:

Experience with Management and Design of HPC systems, fundamentals of Infiniband networking.

Experience with cluster provisioning and configuration tools (Warewulf, SaltStack).

Knowledge in Application Parallelization (MPI and OpenMP).

Working knowledge of core programming languages (C, Python, Perl).

Working knowledge of parallel application installation, debugging, and support.

Experience with cloud HPC offerings from leading providers.

Experience with cloud-bursting.

What we have for you

Stable and long-term position in an experienced team.

Broad opportunities for professional and career growth, including professional challenges that encourage personal development, meetups, hackathons, professional communities, and more.

Direct communication with all stakeholders and the ability to influence product development.

Horizontal connections and absence of micromanagement, fostering a collaborative environment where all team members are accessible to each other for any concerns.

Hubs in Kharkiv, Kyiv, Lviv, and Wrocław (Poland) or everything necessary for remote work.

Up to 50% compensation for the cost of educational courses and conferences to support professional development.

Free English language and business English courses.

Legal and accounting support.

Appreciation gifts for significant events and occasions.

How to join Techstack

Pre-screening with Recruiter.

Expert review of your resume.

English check.

Interview with our experts.

Interview with our partner.

About us

Techstack is a technology product engineering company that sets an example for high development standards in the IT industry. We empower each team member to influence the development of the product, company, and processes.

Learn more about Techstack

Want to make an impact?

You're in the right place.

© 2024 Techstack. All rights reserved.
clutch icon
behance iconlinkedin iconinstagram iconclutch icon
behance iconlinkedin iconinstagram icon
© 2024 Techstack. All rights reserved.
clutch icon