Past Projects
GPGPU를 이용한 인메모리 고속, 병렬 KLT 비식별화 알고리즘 개발
Tajo: A Distributed Data Warehouse System on Large Cluster
Tajo is a relational and distributed data warehouse system for Hadoop. Tajo is designed for low-latency and scalable ad-hoc queries, online aggregation and ETL on large-data sets by leveraging advanced database techniques. It supports SQL standards. Tajo uses HDFS as a primary storage layer and has its own query engine which allows direct control of distributed execution and data flow. As a result, Tajo has a variety of query evaluation strategies and more optimization opportunities.
Web Log Analyzer
Web Logs Analyzer is a system that user can easily query about real network traffic. It provides various features such as ETL, analytic measure materialization, query registration etc.
Real-time data processing network system
Pub/sub based user preference information system,
Real-time event processing network system,
Scalable system that using cloud computing techniqueGraphMR: A Distributed Graph Match Method using MapReduce
This work is a distributed graph match method on large data sets. This work was submitted to IEEE Transactions on Kowledge and Data Engineering (TKDE).
SPIDER: A System for Scalable, Parallel / Distributed Evaluation of large-scale RDF
This project aims at processing large-scale RDF data. In this project, we developed scale RDF processing method using MapReduce that is a distributed processing framework and storing techniques for large RDF data sets. This project was demonstrated in the 18th ACM Conference on Information and Knowledge Management (CIKM) in November 2009.
R3 - Wireless Broadcast System
This is a system that consist of server and client. The server broadcast news data and the client can connect to the channel and get the data.