MEC

Molecular Evolutionary Computing (Phases I, II, III)

2000~2009 Korea Science and Engineering Foundation (KOSEF)

The MEC project investigated evolutionary algorithms operating at the molecular level, using DNA molecules as the primary computational substrate. Running across three phases over nearly a decade, it was one of the lab’s foundational long-term programmes and among the earliest sustained international efforts to implement practical machine learning algorithms using DNA molecules.

Overview

Molecular Evolutionary Computing (MEC) explored a radical question: can the molecular machinery of life serve as a computing medium for learning and optimization? The project operated at the intersection of computer science, molecular biology, and biochemistry, implementing computational concepts — from evolutionary search and version-space learning to combinatorial optimization — directly in DNA strands processed through standard molecular biology operations such as PCR, gel electrophoresis, and hybridization.

The long-term vision was to build bio-based inference machines that exploit the inherent parallelism of molecular populations, potentially enabling computations that are intractable on silicon architectures. MEC also pursued the complementary direction of applying DNA computing to biomedical problems, particularly molecular-level gene expression diagnosis.

Research Phases

Phase I (2000–2002): Foundations of Molecular Evolutionary Search

Developed DNA-based solution strategies for the Travelling Salesman Problem (TSP) using strand encoding and selection
Investigated multiobjective optimization of DNA sequence design to minimize cross-hybridization and secondary-structure artifacts — a prerequisite for reliable computation
Designed the first in-vitro protocols for evolutionary search using molecular biology operations (ligation, PCR amplification, gel separation)
Initiated internal seminars reviewing the DNA Based Computing conference series (DNA6 proceedings, 2001) to track the international state of the art

Phase II (2002–2006): Machine Learning with DNA Molecules

Implemented version-space learning with DNA molecules: a classical machine learning framework (elimination of hypothesis space inconsistent with training examples) encoded and executed in molecular populations
Explored molecular theorem proving, splicing systems, and hairpin-structure-based logical inference
Designed and studied the NACST (Nucleic Acid Computing Simulation Toolbox) for in silico validation of molecular computing protocols before wet-lab implementation
Investigated chip-based and microreactor-based DNA computing architectures
Continued weekly MEC seminars covering international research on DNA self-assembly, aqueous computing, and biomolecular logic gates; participants included D.-Y. Cho, D. Kim, S.-Y. Shin, H.-W. Lim, I.-H. Lee, J.-E. Yun, J.-Y. Lee, J.-Y. Park, and H.-M. Jang

Phase III (2006–2009): DNA Computing for Molecular Diagnosis

Shifted the application domain toward molecular diagnosis: using DNA computing algorithms to classify gene expression profiles
Developed methods for microRNA biomarker detection using DNA-based computing systems
Bridged molecular computing and biomedical informatics, establishing the conceptual and experimental foundation for later projects such as ProMiR and HyperSNP

Research Team

Role	Name	Affiliation
Principal Investigator	Prof. Byoung-Tak Zhang	Seoul National University, Dept. of Computer Science and Engineering
Contact	btzhang@scai.snu.ac.kr	—

Researchers and seminar participants (representative):

Soo-Yong Shin (S.-Y. Shin) — page maintainer, 2001
In-hee Lee (I.-H. Lee) — page maintainer, 2002–2004
Dong-Yeon Cho (D.-Y. Cho)
Jae-Young Lee (J.-Y. Lee)
Ji-Young Park (J.-Y. Park)
Hyun-Woo Lim (H.-W. Lim)
Hyun-Moon Jang (H.-M. Jang)
Dong-Min Kim (D. Kim)
Ji-Eun Yun (J.-E. Yun)
Se-Jin Lee (S.-E. Lee), Seong-Jun Augh (S.-J. Augh), Hyun-Wook Lee (H.-W. Lee), Jae-Hoon Eom (J.-H. Eom), Sang-Jun Lee (S.-J. Lee), Jae-Eun Youn (J.-E. Youn), Yoon-Kyung Noh (Y.-K. Noh) (2001 cohort)

Technical Approach

The MEC methodology combined algorithmic design with wet-lab molecular biology:

Encoding: computational states (candidate solutions, hypotheses) were encoded as specific DNA oligonucleotide sequences
Selection / Fitness evaluation: hybridization affinity or PCR amplification biased the population toward fitter strands
Variation / Crossover: ligation, strand displacement, and restriction digestion introduced variation analogous to genetic operators
Readout: gel electrophoresis, fluorescence labeling, or microarray scanning decoded the final molecular population into a computational answer

Complementary simulation work (NACST toolbox) allowed iterative refinement of protocols in silico before committing resources to in vitro experiments.

Collaboration

The lab was an active participant in the international DNA computing community. The DNAC_researcher network page maintained by the group lists collaborating researchers at:

University of Southern California (Leonard Adleman)
University of Tokyo (Masami Hagiya)
Duke University (John H. Reif)
Caltech (Erik Winfree, Paul Rothemund)
University of Memphis (Max H. Garzon, Russell J. Deaton)
New York University (Nadrian C. Seeman)
National Institutes of Health (Tom Schneider)
GMD / Fraunhofer (John Simpson McCaskill)
Leiden University (Grzegorz Rozenberg)
University of Western Ontario (Lila Kari)

Key Publications

Representative publications from the MEC project (exact citation details for Phase I–II papers should be verified against the lab publication list):

S.-Y. Shin, I.-H. Lee, D. Kim, and B.-T. Zhang, “Multiobjective evolutionary optimization of DNA sequences for reliable DNA computing,” IEEE Transactions on Evolutionary Computation, 9(2), pp. 143–158, 2005.
B.-T. Zhang, “Molecular evolutionary computing (MEC),” invited talk / survey, presented at DNA computing workshops, 2001–2004.
I.-H. Lee, S.-Y. Shin, and B.-T. Zhang, “DNA-based version space learning for gene expression profile diagnosis,” internal MEC report / conference contribution, 2002–2004.
(See also the lab’s journal publication list at dnac_journal.html for the full set of DNA computing publications from this project period.)

Significance

MEC established the lab’s unique interdisciplinary identity at the boundary of computer science and molecular biology. The techniques and insights developed — particularly around DNA sequence design, molecular machine learning, and biomedical informatics — directly seeded later projects including ProMiR, HyperSNP, DNAChipBench, and Molecular ML. The project also cultivated a generation of researchers fluent in both algorithmic thinking and wet-lab molecular biology, an unusual combination that shaped the lab’s subsequent research directions.