Technical summary
PROTEONEXT Technical Summary
PROTEONEXT is conceived as a sovereign R&D platform to accelerate the discovery and prioritization of antimicrobial peptides against multidrug-resistant pathogens.
The defensible promise is not to create a clinical antibiotic within a few months. The defensible promise is to build a reproducible platform that reduces the search space, prioritizes candidates, and closes a first lab-model loop without moving sensitive data.
Natural role of SYNTAX
SYNTAX should lead the technology layer:
- Azure landing zone.
- Microsoft Entra ID, RBAC, PIM, and workload identities.
- Private network, Private Link, Azure Firewall, and private DNS.
- Key Vault or Managed HSM.
- Federated analytics and federated learning.
- MLOps, model registry, and traceability.
- Microsoft Purview for governance and lineage.
- Fabric and Power BI for project control with aggregated data.
- Defender for Cloud and Sentinel for security.
- Confidential computing and attestation when the risk justifies it.
Natural role of scientific partners
Scientific and clinical partners should contribute:
- Clinical microbiology.
- Bacterial isolates.
- Antibiograms, MIC, and resistance phenotypes.
- Bacterial genomics when available.
- MIC/MBC, hemolysis, cytotoxicity, and stability assays.
- Scientific criteria on pathogens, panels, and candidates.
Security rule
Sensitive data stays at source. In the federated design, queries, code, models, metrics, protected gradients, or authorized artifacts travel — not raw rows.
First didactic laboratory
The Desarrollo folder starts with synthetic data and simulated nodes. This allows learning and demonstrating:
- What an antibiogram is.
- What MIC means.
- How to aggregate AMR metrics without moving rows.
- How to train a basic federated model.
- How to score a peptide in a simplified way.
- Where confidential computing fits in Azure.