Sitemap

This story shows how to visualize pre-trained BERT embeddings in Tensorflow’s Tensorboard Embedding Projector. The story uses around 50 unique sentences and their BERT embeddings generated with TensorFlow Hub BERT models. See full article here

Comparing Transformer Tokenizers

less than 1 minute read

Published: November 19, 2019

Comparing Tokenizer vocabularies of State-of-the-Art Transformers (BERT, GPT-2, RoBERTa, XLM) See full article here

How to Start Writing on Medium

less than 1 minute read

Published: November 16, 2019

Practical advice analysing the first month of a Towards Data Science writer See full article here

Simple BERT using TensorFlow 2.0

less than 1 minute read

Published: October 30, 2019

This story shows a simple usage of the BERT [1] embedding using TensorFlow 2.0. As TensorFlow 2.0 has been released recently, the module aims to use easy, ready-to-use models based on the high-level Keras API. The previous usage of BERT was described in a long Notebook implementing a Movie Review prediction. In this story, we will see a simple BERT embedding generator using Keras and the latest TensorFlow and TensorFlow Hub modules. All codes are available on Google Colab. See full article here

Machine Translation: Compare to SOTA

less than 1 minute read

Published: October 28, 2019

My previous story describes BLEU as the most used metric for Machine Translation (MT). This one aims to introduce the Conferences, Datasets and Competitions where you can compare your models with the State-of-the-art, you can collect knowledge from and where you can meet researchers from the field. See full article here

Identifying the right meaning of the words using BERT

less than 1 minute read

Published: October 21, 2019

An important reason for using contextualised word embeddings is that the standard embeddings assign one vector for every meaning of a word, however, there are multiple-meaning words. The hypothesis is that the use of the context can solve the problem of categorizing multiple-meaning words (homonyms and homographs) into the same embedding vector. In this story, we will analyse whether BERT embeddings can be used to classify different meanings of a word to prove that contextualised word embeddings solve the problem. See full article here

Machine Translation: A Short Overview

less than 1 minute read

Published: October 18, 2019

This story is an overview of the field of Machine Translation. The story introduces several highly cited literature and famous applications, but I’d like to encourage you to share your opinion in the comments. The aim of this story is to provide a good start for someone new to the field. It covers the three main approaches of machine translation as well as several challenges of the field. Hopefully, the literature mentioned in the story presents the history of the problem as well as the state-of-the-art solutions. See full article here

Visualisation of embedding relations (word2vec, BERT)

less than 1 minute read

Published: October 14, 2019

In this story, we will visualise the word embedding vectors to understand the relations between words described by the embeddings. This story focuses on word2vec [1] and BERT [2]. To understand the embeddings, I suggest reading a different introduction as this story does not aim to describe them. See full article here

BLEU-BERT-y: Comparing sentence scores

less than 1 minute read

Published: October 14, 2019

The goal of this story is to understand BLEU as it is a widely used measurement of MT models and to investigate its relation to BERT. See full article here

Prim’s algorithm with Numpy arrays

less than 1 minute read

Published: August 27, 2019

Minimum spanning tree using Numpy array operations. See full article here

portfolio

Portfolio item number 1

Short description of portfolio item number 1

Portfolio item number 2

Short description of portfolio item number 2

publications

Hyphenation using deep neural networks

Published in XIV. Magyar Számítógépes Nyelvészeti Konferencia, 2018

Hyphenation algorithms are the computer based ways of syllabification and mostly used in typesetting, formatting documents as well as text-to-speech and speech recognition systems. We present a deep learning approach to automatic hyphenation of Hungarian text. Our experiments compare feed forward, recurrent and convolutional neural network approaches.

Recommended citation: Németh, G. D., Ács, J. (2018). "Hyphenation using deep neural networks" XIV. Magyar Számítógépes Nyelvészeti Konferencia http://negedng.github.io/files/2018-Hyphenation.pdf

A Snapshot of the Frontiers of Client Selection in Federated Learning

Published in Transactions on Machine Learning Research, 2022

Federated learning (FL) has been proposed as a privacy-preserving approach in distributed machine learning. A federated learning architecture consists of a central server and a number of clients that have access to private, potentially sensitive data. Clients are able to keep their data in their local machines and only share their locally trained model’s parameters with a central server that manages the collaborative learning process. FL has delivered promising results in real-life scenarios, such as healthcare, energy, and finance. However, when the number of participating clients is large, the overhead of managing the clients slows down the learning. Thus, client selection has been introduced as a strategy to limit the number of communicating parties at every step of the process. Since the early naive random selection of clients, several client selection methods have been proposed in the literature. Unfortunately, given that this is an emergent field, there is a lack of a taxonomy of client selection methods, making it hard to compare approaches. In this paper, we propose a taxonomy of client selection in Federated Learning that enables us to shed light on current progress in the field and identify potential areas of future research in this promising area of machine learning.

Recommended citation: Németh, G. D., Lozano, M. A., Quadrianto, N., & Oliver, N. (2022). " A Snapshot of the Frontiers of Client Selection in Federated Learning" Transactions on Machine Learning Research http://negedng.github.io/files/2022-Snapshot.pdf

Privacy and Accuracy Implications of Model Complexity and Integration in Heterogeneous Federated Learning

Published in IEEE Access, 2025

Federated Learning (FL) has been proposed as a privacy-preserving solution for distributed machine learning, particularly in heterogeneous FL settings where clients have varying computational capabilities and thus train models with different complexities compared to the server’s model. However, FL is not without vulnerabilities: recent studies have shown that it is susceptible to membership inference attacks (MIA), which can compromise the privacy of client data. In this paper, we examine the intersection of these two aspects, heterogeneous FL and its privacy vulnerabilities, by focusing on the role of client model integration, the process through which the server integrates parameters from clients’ smaller models into its larger model. To better understand this process, we first propose a taxonomy that categorizes existing heterogeneous FL methods and enables the design of seven novel heterogeneous FL model integration strategies. Using CIFAR-10, CIFAR-100, and FEMNIST vision datasets, we evaluate the privacy and accuracy trade-offs of these approaches under three types of MIAs. Our findings reveal significant differences in privacy leakage and performance depending on the integration method. Notably, introducing randomness in the model integration process enhances client privacy while maintaining competitive accuracy for both the clients and the server. This work provides quantitative light on the privacy-accuracy implications client model integration in heterogeneous FL settings, paving the way towards more secure and efficient FL systems. Download paper here

Recommended citation: Németh, G. D., Lozano, M. A., Quadrianto, N., & Oliver, N. (2025). " Privacy and Accuracy Implications of Model Complexity and Integration in Heterogeneous Federated Learning" IEEE Access 10.1109/ACCESS.2025.3546478 http://negedng.github.io/files/2025-Privacy.pdf

talks

Szótagolás mély neurális hálózatokkal

Published: November 16, 2017

More information here

Gergely Dániel Németh

Sitemap

Pages

Posts

portfolio

publications

talks

teaching