Eleni Markou - Machine Learning/Software Engineer

Hello, world!

I am a Senior Machine Learning and Software Engineer, currently working at the BEAT .

Through the years the focus of my work have been twofold: the design and implementation of efficient and scalable ML pipelines for production systems while keeping up with the latest advances in academic research. Currently, I am developing deep learning models for dynamic pricing for our ride-hailing app while in the past I worked on recommender systems and various natural language processing tasks. My scientific interests include, but are not limited to, Deep Learning and Natural Language Processing.

In my free time I enjoy backpacking around the world, taking photos and tasting local food.

P.S.: I also enjoy blogging! I usually write about new things that I learn, new technologies that I try or technical problems that I faced and solved (or maybe not). You can check my posts here.

For more information about me, see my CV or contact me.

Projects

A visualization of a graph network.

Data Analytics and Research Assessment: A Case Study in Health and the Diseases of the Circulatory System

Joint work with Marianna Klironomou

Advised by Dr. Haris Papageorgiou

The project was developed under the auspices of "Data4Impact" H2020 project, which aimed to access the performance of EU and national research and innovation system. In collaboration with "Athena" Research Center, the project attempts to employ a network approach on the matter and develop a multi-layer graph infrastructure in order to assess the societal impact of health-related research in Europe for the past 10 years using Data Mining and Machine Learning techniques.

[Data4Impact Homepage] [Deliverable (PDF)]

Automatic Tagging of Audio Clips with Descriptive Tags.

Automatic Tagging of Audio Clips with Descriptive Tags

Advised by Dr. Haris Papageorgiou

We developed a Convolutional Neural Network (CNN) that would take the spectrograms of the initial audio clips and perform a multi-label classification task with 50+ potential tags. This way, we easily (1) label audio clips avoiding hand-engineered features like Mel Frequency Cepstral Coefficients (MFCCs) which require expert knowledge and (2) estimate the similarity between audio clips or music songs (by computing the number of overlapping tags).

[GitHub Repository]

Workflow of the selected methodlogy.

Computing similairty among textual documents in newspaper articles

By subsequently applying shingling (on a word level), minhashing and Locality Sensitive Hashing (LSH) we managed to be able to compute Jaccard similarity among text corpora. This way, we can (1) retrieve similar documents for any given document and (2)effectively apply the same methodology in significantly large textual collections.

[GitHub Repository]

Blog Posts

Machine Learning Alogrithms you need to know

Decision trees vs. clustering algorithms vs. linear regression: Which machine learning algorithms should you use, why, and when?

Imagine you have some data-related problem that you want to solve. You have heard of all the amazing things that machine learning algorithms can achieve and want to try it for yourself — but you have no prior experience or knowledge in this area. You start googling some terms like “machine learning models” and “machine learning methodologies,” but after some time, you find yourself ready to give up, completely lost somewhere between the different algorithms.

How to Work With Pivot Tables in PostgreSQL, Amazon Redshift, BigQuery, and MS SQL Server

During the past few years, some well-known database systems have implemented functions for pivot table creation, saving us from the development of complicated queries.

All of us have at some point worked with some spreadsheet software, like Excel or Google Sheets, or BI tools and we have to admit that they offer certain functionalities that are very handy when it comes to data presentation and reporting, like the so-called pivot tables. Since many business applications require some sort of pivot tables, I am sure many of you have found themselves struggling with how to satisfy these requirements using a database instead of a spreadsheet.

SQL Database, Table, and Data Partitioning: When and How to Do It

As with everything in life, it seems that table partitioning comes at a cost. Nevertheless, if implemented in the right way at the right time, it can be a lifesaver.

When I first came across table partitioning and started searching, I realized two things. First, it is a complex operation that requires good planning. Second, in some cases, it can be proven extremely beneficial, while in others, it can be a complete headache.

Hello, world!

Education

Athens University of Economics and Business (AUEB)

National Technical University of Athens (NTUA)

Professional Experience

BEAT

METIS Cyberspace Technology

VAIX.ai

Blendo.co

Greek Reaserch and Technology Network (GRnet)

Projects

Data Analytics and Research Assessment: A Case Study in Health and the Diseases of the Circulatory System

Joint work with Marianna Klironomou

Advised by Dr. Haris Papageorgiou

Automatic Tagging of Audio Clips with Descriptive Tags

Advised by Dr. Haris Papageorgiou

Computing similairty among textual documents in newspaper articles

Blog Posts

Machine Learning Alogrithms you need to know

Decision trees vs. clustering algorithms vs. linear regression: Which machine learning algorithms should you use, why, and when?

How to Work With Pivot Tables in PostgreSQL, Amazon Redshift, BigQuery, and MS SQL Server

During the past few years, some well-known database systems have implemented functions for pivot table creation, saving us from the development of complicated queries.

SQL Database, Table, and Data Partitioning: When and How to Do It

As with everything in life, it seems that table partitioning comes at a cost. Nevertheless, if implemented in the right way at the right time, it can be a lifesaver.