Research Lab

SEMERU Lab

Software Engineering Maintenance and Evolution Research Unit — advancing the science of intelligent software development through deep learning and program analysis.

Mission

The SEMERU Lab conducts pioneering research at the intersection of software engineering and artificial intelligence. Led by Chancellor Professor Denys Poshyvanyk, an ACM Fellow and IEEE Fellow, our group investigates how deep learning, information retrieval, and program analysis can solve fundamental challenges in software development — from understanding legacy codebases to generating correct, maintainable code.

With over two decades of sustained research output, SEMERU has shaped the field of AI for software engineering. Our work on code search, software traceability, and neural code models has been widely adopted by both the academic community and industry practitioners.

By the Numbers

📜
0
Publications
📚
0
Citations
🤝
0
Members
🏆
0
NSF Grants

Research Areas

SEMERU's research program spans five major areas at the forefront of intelligent software engineering:

🧠

Deep Learning for SE

Designing and training neural architectures for code understanding, generation, and transformation. From sequence-to-sequence models to large language models applied to software tasks.

🔍

Semantic Code Search

Building intelligent code retrieval systems that understand developer intent. Using natural language queries to find relevant code snippets across massive codebases.

🔧

Program Repair

Automated techniques for detecting, localizing, and fixing software bugs. Combining static analysis with neural models to generate correct patches.

🔗

Software Traceability

Recovering and maintaining trace links between requirements, design artifacts, and code. Using information retrieval and deep learning to ensure software consistency.

🧬

Neural Code Models

Pre-training and fine-tuning code-aware language models for downstream SE tasks including clone detection, code summarization, and vulnerability detection.

Key Publications

A selection of influential papers from the SEMERU Lab:

Deep Learning Similarities from Different Representations of Source Code
MSR 2018
Improving Automated Source Code Summarization via an Eye-Tracking Study of Programmers
ICSE 2014
When Code Completion Fails: A Case Study on Real-World Completions
ICSE 2019
Recommendations for Datasets for Source Code Summarization
NAACL 2019
On the Naturalness of Software
ICSE 2012 Highly Cited
Using Traceability Links to Recommend Adaptive Changes for Documentation
IEEE TSE 2017

Tools & Artifacts

Open-source tools and datasets developed by the SEMERU Lab:

Tool

CoDiSum

A neural model for automatic commit message generation. Uses code diffs to produce concise, human-readable commit messages that describe code changes.

Tool

CSDA

Code Search with Deep Attention — a semantic code search engine that uses deep learning attention mechanisms to match natural language queries to code.

Framework

TraceLink

An information retrieval framework for recovering traceability links between software artifacts, supporting requirements-to-code and design-to-test mappings.

Dataset

Code Clone Benchmark

A large-scale benchmark for evaluating code clone detection tools across multiple programming languages, including Type-1 through Type-4 clones.

Team

Meet the researchers behind the SEMERU Lab:

DP

Denys Poshyvanyk

Chancellor Professor — PI
SR

Senior Researcher

Postdoctoral Fellow
DS

Doctoral Student

Ph.D. Student
DS

Doctoral Student

Ph.D. Student
DS

Doctoral Student

Ph.D. Student

Explore More