Abdeljalil EL MAJJODI

MACHINE LEARNING ENGINEER

Specializing in advanced AI model development with a focus on cultural linguistics and bridging the gap between theoretical research and real-world needs.

Who I Am

A machine learning engineer specializing in advanced AI model development with a focus on cultural linguistics. I design and implement sophisticated neural networks that capture the nuances of regional dialects and deliver practical solutions to complex challenges.

Driven by a passion for transforming cutting‑edge technology into impactful applications that bridge the gap between theoretical research and real‑world needs.

Phone

(+212) 651513594

GitHub

Elma-dev

Skills

Programming

C C++ Python Java Linux shell JavaScript

Web Development

HTML CSS Angular Node.js

Data Science

Pandas NumPy Matplotlib Scikit-Learn PyTorch

Web Services

SOAP REST GraphQL gRPC

DBMS

MySQL SQL Server PostgreSQL Neo4j

DevOps & Cloud

Git Docker Kubernetes AWS

Big Data

HDFS Map Reducer Kafka Spark

ML/DL

Transformers LLMs LangChain LlamaIndex

Frameworks

Spring Boot Spring Cloud Spring Security

Experience

Machine Learning Engineer

August 2024 - Present

Sawalni, Remote

  • Development of Darija Language Models: Led the design, development, and fine‑tuning of large language models specifically tailored for the Darija dialect.
  • Data Collection and Analysis: Designed and implemented efficient data pipelines using Python and Pandas to collect and preprocess data for machine learning tasks.
  • Pretraining and Fine‑Tuning LLMs: Pretrained and fine‑tuned large language models using PyTorch, Transformers (Hugging Face), and Langchain, enhancing model performance for language understanding and text generation.
  • Model Evaluation and Optimization: Employed tools like scikit‑learn and Hugging Face datasets for benchmarking and performance tuning, improving accuracy and computational efficiency.
  • Cross‑Functional Collaboration: Worked closely with data scientists and linguists to ensure that models reflected the linguistic nuances of Darija and aligned model outputs with project goals.

Software Engineer

February 2024 - August 2024

XcomSolution, Mohammedia

  • Design and Develop of a Digital Marketing Automation.
  • Microservices Architecture: Developed and implemented a robust and scalable microservices architecture to meet stringent performance and flexibility requirements.
  • Full‑Stack Development: Contributed to both backend and frontend development, ensuring seamless integration and functionality across the platform.
  • Cross‑Functional Collaboration: Partnered with business teams to align technical solutions with operational needs, ensuring successful project outcomes.

Junior Data Scientist

July 2023 - October 2023

JPTRACK: JP&Co, Casablanca

  • Developed Advanced Fuel Consumption Tracking Systems.
  • Algorithm Design: Engineered and deployed custom fuel consumption algorithms optimized for various vehicle types.
  • Theft Detection: Created machine learning models to detect and prevent fuel theft, significantly enhancing security measures.
  • Data Analysis: Performed detailed analysis to optimize fuel efficiency and reduce operational costs, contributing to more sustainable operations.
  • Client Collaboration: Worked closely with clients and engineering teams to tailor solutions, ensuring successful implementation and satisfaction.

Software Developer and Data Scientist

May 2024 - Present

AtlasAI

  • Building the next generation of Moroccan AI Models.
  • LLM Development: Creation and refinement of large language models tailored specifically for Darija, the Moroccan Arabic dialect, ensuring cultural and linguistic relevance.
  • Data Collection Platforms: Designed and developed platforms to efficiently collect and preprocess data, providing a strong foundation for training high‑quality LLMs.
  • Collaborative Innovation: Worked closely with a multidisciplinary team of data scientists and engineers to integrate these AI solutions into various applications, driving forward the AI capabilities of the organization.

Projects

Al-Atlas: Moroccan Darija Language Model

Jan 2025 - Mar 2025

Developed Al‑Atlas, a 0.5B parameter language model, the first dedicated foundation model for Moroccan Darija, fine‑tuned from Qwen‑2.5.

  • Dataset Curation: Collected and preprocessed a high‑quality dataset of 155M tokens
  • Cultural & Linguistic Impact: Designed to enhance NLP applications for Moroccan Arabic
Huggingface Transformers PyTorch
View Project

TODa: Tamazight Open Dataset

Nov 2024 - Jan 2025

Conceptualized and developed a groundbreaking open‑source project to preserve and advance the Tamazight language.

  • Extensive linguistic dataset for NLP applications
  • Preservation of Tamazight language
Pandas Pyplot Huggingface
View Project

Tarjman-AI: Moroccan Chat-bot

May 2024 - May 2024

Developed a multilingual question‑answering platform that allows Moroccan users to interact with advanced LLMs in native languages.

  • Supports Darija and other native languages
  • Advanced question-answering capabilities
LLMs Postgres Docker LangChain
View Project

EmbedPrepro: Text Analysis Library, CLI

Apr 2024 - May 2024

Created a command‑line tool and library designed for text analysis tasks, including embedding, clustering, dimensionality reduction, and visualization.

Python Click PyPi
View Project

Ask Documents

Apr 2024 - Apr 2024

Developed an advanced Retrieval‑Augmented Generation (RAG) system for document‑based question answering, enhancing information retrieval and response accuracy.

Ollama LlamaIndex ngrok LLMs
View Project

Secure Health Data Storage & Skin Cancer Classifier

March 2023 - March 2023

Developed a decentralized system using blockchain technology for securely storing patient health data and a skin cancer classifier.

  • Blockchain-based health data storage
  • Skin cancer classification model
Angular Solidity IPFS
View Project

Education

Master's of Artificial Intelligence & Distributed Systems

2022 - 2024

HASSAN II University, Higher Normal School of Technical Education (ENSET), Mohammedia, Morocco

Bachelor's of Mathematics and Computer Science

2021 - 2022

IBN Zohr University, Faculty of Sciences, Agadir, Morocco

Baccalaureate in Mathematics Sciences

2018 - 2019

Dakhla High School Ouled Berhil, Taroudant, Morocco

Languages

Amazigh Native
Arabic Native
English Advanced
French Intermediate

Hobbies

Coding
Gaming
Reading
Photography
Cooking
GYM

Publications

Atlaset Dataset for Moroccan Darija: From Data Collection, Analysis, to Model Trainings

Read Publication

Finding Moroccan Arabic (Darija) in Fineweb 2

Read Publication

Darija Chatbot Arena: Making LLMs Compete in the Moroccan Dialect

Read Publication

Awards

Decentralized Health Data Storage System

Developed a decentralized and secure storage system for patient health data and created AI Models that help doctors to discover complex diseases from the stored data.

Get In Touch

Contact Information

Location

Mohammedia, Morocco

Phone

(+212) 651513594

Social Media

Made with DeepSite LogoDeepSite - 🧬 Remix