Amine Bousmah
Data & AI Engineer
I turn data into decisions. I shape messy data into simple answers, work closely with teams, and ship things that truly help people. π€π
Technical Skills
Data Engineering & Analytics
Foundations for reliable data systems.
Machine Learning & Modeling
Pragmatic ML with strong evaluation discipline.
BI & Data Visualization
Make results clear, trusted, and actionable.
Application Design & API
Product-minded developer focused on clean, secure services.
Cloud & DevOps
Ship small, observe, and iterate.
Data Finance & Revenue Analytics
Applied analytics for markets, risk, and growth.
Selected Projects
Vinted Extension β Smart auto-repost to boost visibility
Browser extension that automatically republishes listings to leverage Vintedβs algorithmic boost. Features safe scheduling, anti-duplicate logic and local anti-tracking to maximize views and click-throughs without manual effort.
Key results
Technical implementation
- βΉChrome Extension (content + background service worker)
- βΉTask scheduler, de-duplication & cooldown management
- βΉLocal headers/cookies handling; optional Express helper
- βΉImage helpers (crop/compress) when needed

Tribara β Talent Matching Optimization
AI-powered recruitment optimization to automate candidate screening and ranking, integrated with ATS for seamless workflows. Delivered faster shortlists and more relevant matches for recruiters.
Key results
Technical implementation
- βΉPython ETL & ML pipeline (parsing + scoring)
- βΉNLP-based candidate ranking with continuous fine-tuning
- βΉATS integration (webhooks/API) & scoring feedback loop
- βΉDashboard & export for recruiter decision support

Face Recognition β Find all photos of a person
Application that lets you upload a few photos of yourself to automatically detect all occurrences within an event album (ideal for team building/seminars).
Key results
Technical implementation
- βΉFace detection: RetinaFace/SCRFD (InsightFace)
- βΉFace embeddings: ArcFace (InsightFace, 512-d vectors)
- βΉSimilarity search & scaling: FAISS (IVF/PQ or HNSW)
- βΉDe-duplication & robustness: thresholding + DBSCAN; multi-reference averaging

11Field β Football analytics & scouting suite
End-to-end scouting toolkit: xG/xGA, role-based radars, league comparators, match reports and player similarity. Adds ML models for clustering and explainability to support recruitment decisions.
Key results
Technical implementation
- βΉData ingestion from public football APIs (FBref/ESPN/ClubElo, etc.)
- βΉInteractive dashboards (Streamlit + Plotly)
- βΉPCA + KMeans for playing-style clusters
- βΉRandomForest + SHAP for explainable player ranking

Modern Data Capabilities
Ingestion & Connectivity
- REST/GraphQL, webhooks, SaaS & DB connectors
- Batch files (CSV/Parquet) + CDC/event streams
- Secrets, retries, backoff, idempotency
Workflow Orchestration
- Reproducible DAGs with clear SLAs
- Idempotent tasks, alerts, backfills
- Data-aware scheduling & dependency management
Lakehouse Storage & Formats
- Object storage + warehouse interoperability
- Parquet/Delta/Iceberg, partitioning & compaction
- Schema evolution, time travel & ACID tables
Modeling & ELT
- Layered models (staging β core β marts)
- Data contracts & tests (quality as code)
- SCD patterns, surrogate keys, audit columns
Data Quality & Observability
- Freshness, completeness, accuracy monitors
- Column-level lineage & impact analysis
- Anomaly detection with playbooks/runbooks
BI & Semantic Layer
- Governed metrics/semantic layer for consistency
- Row-level security & policy-based access
- Drill-through dashboards, alerts & subscriptions
Data Apps & UX
- API-first apps (Next.js/React) with great UX
- Accessible, fast, mobile-friendly interfaces
- Shareable exports & decision-ready views
Realtime & Streaming
- CDC & event-driven pipelines (micro-batch/stream)
- Live dashboards via WebSockets/SSE
- Materialized views & low-latency caches
ML & MLOps
- Feature pipelines with reproducible training
- Experiment tracking, registry & versioning
- Drift/fairness monitoring & A/B evaluations
LLM & RAG
- Embeddings & chunking with prompt versioning
- Hybrid retrieval + guardrails & citations
- Privacy-aware grounding on enterprise data
Vector Search
- ANN indexes (HNSW, IVF-PQ) at scale
- Hybrid keyword + vector retrieval
- Deduplication & clustering for discovery
Governance, Privacy & Security
- RBAC/ABAC, masking/tokenization of PII
- Catalog, lineage, ownership & audit logs
- Compliance by design (GDPR, ISO 27001)
FinOps & Performance
- Cost tags/budgets & storage lifecycle
- Pruning, partition pushdown, caching
- Autoscaling, SLAs/SLOs with clear error budgets
Data CI/CD & DevEx
- Git-based reviews, tests & linters for data code
- Reproducible builds & artifact versioning
- Ephemeral preview envs & safe rollbacks
Interop & Internal APIs
- OpenAPI/JSON Schema contracts & governance
- Reverse ETL to operational tools
- Pagination, rate limits & idempotent writes
Let's work together π
Are you looking for a Data & AI profile capable of combining technical expertise with a human touch, someone who turns data into meaningful stories, builds solutions that matter, and works hand in hand with teams to create impact?