Global Citizen-Science Initiative Seeks Public Help to Verify Potential Universal Pattern in Protein Structures

Researchers turn to global volunteers to validate a newly identified amino acid clustering pattern across protein structures

  • A new citizen-science project is investigating a possible universal structural pattern in protein 3D architecture
  • The pattern involves amino acids clustering by chemical type into mosaic-like arrangements
  • Findings are based on analysis of over 160,000 protein structures and computational simulations
  • Volunteers contribute by generating and submitting protein visualization images as independent evidence
  • The project uses accessible tools like Jmol, allowing participation with minimal time and basic knowledge
  • A growing repository of images is being built to support validation through collective observation
  • If confirmed, the discovery could improve understanding of protein function and support drug design research

(NEWS) MADRID, 17-Mar-2026 — /EuropaWire/ — A new citizen-science initiative is inviting volunteers worldwide to help investigate what researchers describe as a potentially universal structural feature in proteins, one of the fundamental building blocks of life.

The project, known as Proteins Mosaic Q, focuses on a recently identified pattern in the three-dimensional (3D) structure of proteins, where amino acids appear to cluster according to their chemical properties. These groupings form a mosaic-like arrangement that can be both visually observed and mathematically quantified, according to project materials (EuropaWire press release).

The initiative is built on large-scale statistical and computational analysis of more than 160,000 experimentally determined protein structures, complemented by stochastic simulations. These analyses indicate a strong correlation (R² = 97.8%) supporting the presence of a specific amino acid clustering pattern within protein architectures. Researchers describe this configuration as non-trivial and dependent on defined groupings, or clusters, of amino acids.

According to project documentation, amino acids tend to organize into four main chemical categories—polar, hydrophobic, acidic, and basic—forming clusters that are typically composed of around eight residues located within close spatial proximity. This recurring arrangement is referred to as the “Mosaic Q” pattern and is suggested to be widespread across proteins (Proteins Mosaic Q Project).

The model is consistent with established principles such as the hydrophobic core of proteins, while proposing an additional layer of organization in which amino acids of similar chemical types group together in distinct spatial clusters. Visual confirmation of this pattern has been reported across multiple protein examples in a growing online repository (project repository).

The project is led by Francisco Javier Lobo Cabrera, a biotechnologist affiliated with Pablo de Olavide University, alongside Prof. José Antonio Prado-Bassas of the University of Seville and María Ángeles García-Rosado Delgado, also from the University of Seville. It is currently featured on citizen-science platforms including SciStarter and EU-Citizen.Science, expanding its visibility and accessibility to a global audience.

Unlike traditional research efforts confined to laboratories, Proteins Mosaic Q relies on distributed participation. Volunteers act as independent observers by generating visualizations of protein structures using open-source tools such as Jmol and submitting snapshot images as evidence. Each contribution helps build a collective dataset of visual confirmations.

The process is designed to be accessible: participants can select a protein structure, apply predefined visualization settings, and produce an image highlighting amino acid groupings. The task typically takes around 15 minutes and requires only basic familiarity with biomolecules, according to the project guidelines (SciStarter project page).

Project organizers say that expanding the number of independently generated examples is key to strengthening the evidence base. In addition to direct submissions, contributors are encouraged to share their images publicly using the hashtag #ProteinsMosaicQProject to raise awareness and engagement.

The initiative’s open and collaborative approach reflects a broader trend in citizen science, where public participation supports data collection and validation in complex research domains. If confirmed, the Mosaic Q pattern could contribute to a deeper understanding of protein structure and function, with potential implications for fields such as drug design and biotechnology.

Further technical details, including source code and extended analysis, are available through the project’s GitHub repository and associated preprint publication (GitHub documentation; bioRxiv preprint).

EDITOR'S PICK:

Comments are closed.