Genetics & Computational Biology Group

We are part of the Genetics Laboratory, Biotechnology Department, School of Applied Biology and Biotechnology at the Agricultural University of Athens.

Our research interests revolve around three distinct, yet intersecting, areas: i) The functional role of long non-coding RNAs, ii) Stress mechanisms and homeostasis via nuclear receptors & iii) Computational and molecular virology

How life began and what drives evolution? How can it be that more than 60% of the human genome is of viral origin? We now know that viruses have seriously influenced the course of life on our planet and that retro-transposable elements in some species make up the majority of their genome.

The concept of “dark DNA” has been adopted by the field of astrophysics, where cosmologists for many years refer to the part of matter and energy that cannot pinpoint or detect with the existing technology, as dark. However, it has been proven that this dark side of matter and energy is not just very real, but it also makes up more than 90% of the universe. Likewise, in genetics, dark DNA refers to that part of the genome, whose role is not clear, or seems to do not much in the cellular processes essential to life. Consequently, the role of non-coding RNA and the role of introns in alternative splicing is of uttermost priority in our research.

We develop and apply our research to a repertoire of scientific domains. Namely, we have been investigating the role of nuclear receptors in gene expression and regulation and in particular the role of the glucocorticoid receptor in stress. Then, we look into the realm of viruses and in particular into the ssRNA family of flaviviridae and ebola virus for the investigation of the genetic elasticity of viral genomes (antigenic drift/shift) and how epigenetics could annihilate current vaccination protocols. Finally, recently, we have embarked onto the very fascinating field of neurodegeneration, with the genetic and structural study of LRRK2.

In our lab we use a wide spectrum of methodologies ranging from state-of-the-art bioinformatics, molecular modelling and hypercomputing pipelines to traditional genetic and biochemistry in vitro and in vivo techniques. On one hand, we have developed our own software for machine learning, deep learning and data mining for multimodal fusion of information from genetic, structural and physicochemical analyses. On the other hand, we use DNA engineering, SDM, cloning, expression and protein purification techniques to biologically test and evaluate the validity and reliability of our models.

Novel Anti-viral Gene Editing Platforms

Developing a new drug entity, from drafting its structure to market launching, is a complex process which can take many years and cost millions of dollars. It may also take many years to grow an idea of supporting evidence before selecting the appropriate pharmacological target for a rather costly drug discovery program to be deployed. This reflects a massive investment in terms of time, money, human and other potential resources. Only for the human immunodeficiency virus (HIV) vaccine, scientists have been working over 30 years and it is estimated that $500 million is being spent every year over the last decade on a candidate HIV vaccine. Creating an effective drug against viral infections represents one of the greatest biological and medical challenges of a generation. Despite the new entry vaccines in clinical trials and the medical achievements, the fight against the fatal global epidemic virus continues. However, the scientific community has failed to address the scourge of viruses beyond the use of some drugs to control symptomatically the infection and to reduce viral load. The use of hitherto classical techniques had no promising results since the life expectancy upon viral infection with Ebola, HIV or some of the Flaviviridae family is very short in many cases indeed. The landscape of anti-viral treatment must be changed dramatically, and new horizons should arise including new strategies for the fight against viruses in the near future.

Novel genetic engineering technologies have been developed enabling precise editing of genomes and these have numerous important clinical applications including the treatment of genetic diseases, viral infections, and cancer. These new classes of reagents can specifically target nucleotide sequences within the cellular genome. The ability to correct gene associated mutations is an attractive approach as a treatment option. Based on recent studies, the CRISPR/Cas9 gene editing system has been used to target the HIV-1 long terminal repeat (LTR) of integrated pro-viral DNA in the genome of cell in tissue culture and was able to inactivate viral gene expression and replication in a variety of latently infected cells. This is an important first step in a potentially promising approach towards a new therapeutic pipeline that aims to eliminate all the permanently integrated DNA proviral copies of HIV-1 in an infected individual. It can also be used as a measure of the efficacy of CRISPR/Cas9 in cells to which it is delivered, independent to cell type.

In this direction, many issues must be addressed including the massive amount of data, the multiple sequence alignments based on homolog blocks, the hybrid sequence alignments, the classification of different viral strains in clusters, the statistical analysis of genetic variations, the conserved motifs and patterns exploration and the nuclear genome comparisons between infected and non-infected cells. Therefore, fast, flexible and memory efficient bioinformatics techniques must be deployed to facilitate analysis of thousands of samples simultaneously. New virus-specific sequence patterns can be identified using genome-wide scale computational analysis. Furthermore, significant insights on the duplicative stepwise viral evolution are investigated. All-in-all, collecting and separating the different forms of viral species in groups using deep learning and hyper-computing pipelines will increase the likelihood of finding the most representative sequence and structure specific conserved motifs for gene editing-based startegies on viral targets and eventually will lead to a powerful and effective treatment against viral infections.

The DARK side of Life...

For many years researchers have been doing gene and protein comparisons using similarity indexes of sequenced entries. In our group, we chose to turn 180 degrees and look for the exact opposite; to develop evolutionary measures based on de-similarity and the DARK side of the genome and proteome (i.e. what we know is there, but cannot find it).

In this direction, we investigate the role of minimal absent words (MAWs) and cryptides. MAWs are very short sequences of DNA that do not occur in the genome of a species, even though they are theoretically possible. The chances of not finding an 11-mere DNA sequence in 3.2 billion bases of human genome are zero, provided that DNA is a 4-letter code. So, how come CGCTCGACGTA, GTCCGAGCGTA, CGACGAACGGT or CCGATACGTCG are nowhere to be found in any human genome? We speculate that if they appear, they lead to non-survivors and therefore elucidating their function could be huge in understanding disease. Likewise, cryptides are peptide fragments generated during maturation or degadation processes of functional proteins showing various biological activities distinct from those of the parent proteins. Peptides are involved in many cellular processes, however, their origin may not be directly from the ribosome after mRNA processing, but from fragments of degraded proteins.

Accordingly, in our group we are interested in the development of novel DARK-omics platforms for the functional screening of new MAWs and cryptides, for the investigation of gene and peptido-mimetics with variegate applications.


New open access peer-reviewed edited volume on:


click here...

Evolutionary and developmental biology is focused on the investigation and the elucidation of the molecular mechanisms underlying embryonic development from one generation to the other. It started quite early with Darwin and Mendel, but soon it became apparent that it is a complicated network of interacting genes after stimuli from molecules, our microbiome, other organisms or species and the environment that we live in. However, in recent years the accumulation of big data in the fields of genomics, medical records and other relevant datasets has rendered the study of evo-devo impossible without the aid from advanced and highly sophisticated computational models, which aim to mine and fuse information from diverse disciplines in the realm of evolution and developmental biology. This book aims to address the new developments in the rapidly evolving field of evo-devo in the post genomics era. All recent biological and medical breakthroughs in the evo-devo field are welcomed. Finally, review articles encompassing recent advances, development current and future trends are also more than welcomed.

ISBN: 978-1-83962-170-3 / Print ISBN: 978-1-83962-169-7 / eBook (PDF) ISBN: 978-1-83962-171-0
Editors: Dr Dimitrios P. Vlachakis - Prof Elias Eliopoulos - Prof George Chrousos

Bioinformatics Software developed by our group

The Dark Suite

A comprehensive evolutionary and molecular modelling suite that comprises of a series of tools and workflows for drug design developed by our group.
(Download here)

Drugster Suite

Drugster is a freeware platform aimed to assist scientists in the field of Computer Aided Drug Design.
(Download here)

DrugOn Suite

DrugOn is a fully interactive pipeline designed to overcome the command line barrier with two friendly environments for the user.
(Download here)

Drugena Suite

Drugena encompasses an up-to-date database of antibodies for neurological disorders and the NCI database for the in silico development of ADCs.
(Download here)

Antisoma Suite

Antisoma application has been developed in order to provide the information found with this new approach and concerning the antibodies of the species.
(Download here)

Xplatform Suite

With the extensive use of microarray technology as a potential prognostic and diagnostic tool, the comparison and reproducibility of results obtained from the use of different platforms is of interest.
(Download here)

Space Suite

Space Suite, a new homology modelling approach, successfully applied to the homology modelling of the RNA-dependent RNA polymerase (RdRp) of dengue (type II) virus.
(Download here)

Structuprint Suite

A 'molecular cartography' suite for the two-dimensional transformation of protein structures the multidimensional analysis of their physicochemical properties.
(more info here)
(Download here)

Taggo Suite

Taggo takes advantage of the Gene Ontology (GO) to extract the proteins’ main attributes and combines the potential of discarding annotations that are supported by not so reliable Ecs.
(Download here)

Brukin2d Suite

The Brukin2d software, is a visualization tool for large data sets produced from LC-MS. Specifically the chromatograph and the mass spectra that correspond to its peaks is the focus of this work.
(Download here)

Giba Suite

GIBA is an effective and easy to use tool for the detection of protein complexes through clustering PPI networks. GIBA surpasses other methods in quality approximations of protein complexes.
(Download here)

AHC Suite

AHC is a hierarchical algorithm that performs successive min–cuts until it identifies dense subgraphs. AHC leads to the selection of more dense subgraphs as protein complexes candidates.
(Download here)

GAppi Suite

GAppi performs clustering in PPI networks to identify protein complexes. The algorithm has been tested exhaustively and results showed that GAppi outperforms other techniques producing feasible and very efficient solutions.
(Download here)

GOmir Suite

GOmir (by using up to four different databases) introduces, for the first time, miRNA predicted targets accompanied by (a) full gene description, (b) functional analysis and (c) detailed gene ontology clustering.
(Download here)

FED & SAFE Suites

A tool that encompasses retrieval and analysis of fusion events used to identify putative protein-protein interactions in completely sequenced genomes of various prokaryotes, and eukaryotes.
(Download FED here)
(Download SAFE here)

Thetis Suite

A suite of computer programs that has been developed for monitoring structural changes on a-helical secondary structural elements during molecular dynamics (MD) simulations on protein systems.
(Download here)


You can (usually) find us at:
Agricultural University of Athens,
Iera Odos 75, 11855, Athens, Greece
(click for MAP)

Email Address

You can (always) email us at:

Phone Number

You can (try to) call us at:
+30 210 5294323