Join Us: Revolutionise AI/ML with Renewable Energy at Elemental Compute 🌍⚡
Elemental Compute is on a mission to build the cheapest, fastest, cleverest, greenest, and kindest AI computer platform in the world. We're constructing a distributed renewable energy-powered supercomputer with innovative heat recovery and grid balancing capabilities. Are you driven to make a positive impact through AI/ML in a fast-paced, hands-on environment? If so, we want you on our team! Based in Birmingham, we offer a mixture of hybrid and fully office-based roles.
About Us
At Elemental Compute, we believe in harnessing the power of renewable energy to push the boundaries of AI and machine learning. Our state-of-the-art platform is designed to support groundbreaking research and applications, all while maintaining a commitment to sustainability and social responsibility.
Why Join Us?
- Innovative Mission: Be part of building a distributed supercomputer that leverages renewable energy for a greener future.
- Cutting-Edge Technology: Work with the latest advancements in HPC, NVMe-oF, DPDK networking, SPDK storage, and distributed inference.
- Dynamic Team: Collaborate with top-tier experts in AI/ML, hardware optimization, and large language model (LLM) routing, including multimodal vision and audio.
- Positive Impact: Contribute to a project designed to make AI more accessible, efficient, and environmentally friendly.
- Ethical Commitment: We are dedicated to ensuring our platform is not used for gambling, adult content, weapons development, or any socially harmful activities. We actively drive and encourage the usage of our technology for positive societal impact.
- Competitive Compensation: Very competitive salary and early valuable share options.
- Flexible Work Environment: Based in Birmingham, we offer both hybrid and fully office-based roles to suit your needs.
We’re Looking For:
Machine Learning / AI
We are looking for AI/ML experts with a passion for cutting-edge technologies. Backgrounds might include:
- AI/ML Experts: Deep knowledge of machine learning algorithms, model training, deployment, and optimization.
- Vision Models and Generative AI: Knowledge in computer vision models and generative AI.
- LLM Optimizations: Expertise in optimizing large language models, including routing and mixture of agents, quantization, and fine-tuning (LoRA).
- Inference Servers: Experience with platforms like RedisAI, OpenVINO Model Server, ONNX Runtime, TorchServe, TensorFlow Serving, and multi-model, multi-GPU, and multi-node deployments.
- AI Frameworks and Libraries: Proficiency in TensorFlow, PyTorch, JAX, and other AI frameworks.
- Specialized AI Roles: Experience with models like LLaMA3, SDXL, DeepSeek, Nemotron, and Phi3 AI agents.
- Self-Play Optimization: Expertise in self-play techniques for training AI agents.
- NeRF (Neural Radiance Fields): Experience in creating immersive content using NeRF and Gaussian Splatting.
- Chip Design Engineers (AI): Experience in designing custom chips for AI workloads, ASICs, and FPGAs.
- Model Parallelization Experts: Knowledge of different approaches to model parallelization and distributed training.
Hardware, Storage, and Networking
We are searching for hardware enthusiasts who can build, optimize, and benchmark our computers for energy efficiency and performance. Ideal candidates will have:
- Hardware Building and Optimization: Experience in assembling and optimizing computer hardware, flashing drivers, and BIOS optimizations.
- Programming Skills: Proficiency in C++, Python, and CUDA for software development and hardware interfacing.
- Benchmarking: Skills in performing and analyzing benchmarks to improve system performance and energy usage.
- Storage Systems: Experience with storage solutions like Lustre, Ceph, Red Hat, Ubuntu, and CentOS.
- Networking: Expertise in NVMe-over-Fabric (NVMe-oF), Data Plane Development Kit (DPDK) networking, SPDK storage, RDMA, RoCE, and InfiniBand networking (Mellanox experience is a plus).
Linux Sys Admin / Security
We are looking for Linux system administrators with a focus on security and networking. Candidates will need:
- Linux Skills: Proficiency in managing and optimizing Linux-based systems.
- Security Expertise: Knowledge in securing Linux systems and networks.
- Networking: Familiarity with high-speed networking technologies.
- Infrastructure Management: Experience in maintaining and troubleshooting large-scale computing environments.
ML Ops
We are seeking professionals experienced in integrating and automating software and machine learning workflows. Candidates should have:
- DevOps & MLOps: Experience in integrating and automating software and machine learning workflows.
- Containerization and Orchestration: Familiarity with Ray Server, Kubernetes, Docker, and other container technologies.
- Infrastructure as Code: Proficiency in tools like Terraform and Ansible.
- CI/CD Pipelines: Experience in setting up and managing continuous integration and deployment pipelines.
Desired Skills and Experience
Candidates should have deep experience in a couple of the following areas and a passion for more, ideally evidenced by projects in their own time:
- Hardware Optimization: Proficiency in NVMe-over-Fabric (NVMe-oF), Data Plane Development Kit (DPDK) networking, and Storage Performance Development Kit (SPDK).
- Distributed Inference: Experience with Nvidia Triton Server, TensorRT, and distributed inference models.
- Home Labs: Experience with multi-GPU setups, VLLM, DeepSpeed, and other advanced configurations.
- Distributed Data Centres: Interest in designing and managing distributed data centres with heat recovery for buildings and agriculture.
- Containerization and Orchestration: Experience with Ray Server, Kubernetes, Docker, and containerized workloads.
- Active Community Involvement: Strong presence in the AI/ML community, with active GitHub and Twitter profiles.
- Vision Models & Generative AI: Experience with computer vision models, generative AI, and 3D modeling.
- Programming and Scripting: Proficiency in C++, Python, TensorFlow, and PyTorch.
- Low-latency Networking: Expertise in low-latency networking for high-performance applications.
- BIOS and Firmware: Experience with BIOS optimizations and Coreboot.
Responsibilities
- Design, implement, and optimize AI/ML systems powered by renewable energy.
- Collaborate with cross-functional teams to ensure seamless integration of hardware and software.
- Drive innovation in machine learning techniques and infrastructure.
- Contribute to research and development initiatives aimed at improving efficiency and performance.
What We Offer
- Competitive Salary: Very competitive salary and early valuable share options.
- Growth Opportunities: Continuous learning and development in a cutting-edge field.
- Impactful Work: Play a pivotal role in a project with significant environmental and societal benefits.
- Collaborative Environment: Work alongside passionate professionals committed to making a difference.
- Flexible Work Environment: Based in Birmingham, with options for hybrid and fully office-based roles.
How to Apply
Are you ready to make a meaningful impact? Join us at Elemental Compute and help us build the future of AI and machine learning. Apply now by sending your resume, a cover letter detailing your experience and passion for this role, and links to your GitHub, Twitter and or relevant profiles to jobs@elementalcompute.com
Elemental Compute - Building the Future of AI with Renewable Energy 🌱💡