Sudipta Mondal

Dhaka,

sudiptamondal.cs@gmail.com

mondal.sudiptahere@gmail.com

Twitter

GitHub

Natural Language Processing
Trustworthy AI
Self-Supervised Learning
Computer Vision
Machine Learning
Deep Learning

🥉 VC’s List Award,
🥇 Dean’s List Award,
🥈

Interpretable Bangla Sarcasm Detection using BERT and Explainable AI

2023 IEEE 13th Annual Computing and Communication Workshop and Conference (CCWC), Las Vegas, NV, USA, March 2023

Abstract : A positive phrase or a sentence with an underlying negative motive is usually defined as sarcasm that is widely used in today's social media platforms such as Facebook, Twitter, Reddit, etc. In recent times active users in social media plat-forms are increasing dramatically which raises the need for an automated NLP-based system that can be utilized in various tasks such as determining market demand, sentiment analysis, threat detection, etc. However, since sarcasm usually implies the opposite meaning and its detection is frequently a challenging issue, data meaning extraction through an NLP-based model becomes more complicated. As a result, there has been a lot of study on sarcasm detection in English over the past several years, and there's been a noticeable improvement and yet sarcasm detection in the Bangla language's state remains the same. In this article, we present a BERT-based system that can achieve 99.60% while the utilized traditional machine learning algorithms are only capable of achieving 89.93%. Additionally, we have employed Local Interpretable Model-Agnostic Explanations that introduce explainability to our system. Moreover, we have utilized a newly collected bangla sarcasm dataset, BanglaSarc that was constructed specifically for the evaluation of this study. This dataset consists of fresh records of sarcastic and non-sarcastic comments, the majority of which are acquired from Facebook and YouTube comment sections.

Paper BibTeX

A Comprehensive Audio Dataset for Emotional Context Analysis in Advertisements

Submitted to: Data in Brief [under review]

Abstract : This dataset is useful because it includes audio features and statements that are linked to emotions. It can be used to recognize emotions and judge ads. A critical determinant in the success of an advertising campaign is the ability to discern when consumers attain a state of saturation, wherein the repetitive nature of advertisements for the same product causes fatigue. The dataset has a lot of promise for solving problems related to how well ads work and can be an important part of making advertising strategies more effective. There were several steps involved in gathering the data. It all started with a survey to find out what real people thought about different ads. The survey produced a textual dataset with different statements about emotions, such as Arousal, Valence, Dominance, Liking, and Purchase. Also, 100 audio ads from publicly broadcast media were gathered and saved in WAV format. At first, these audio files were split into 60-second chunks. After that, each 60-second clip was cut into sixsecond pieces, making a total of 1,000 audio clips. The dataset seamlessly combines these independent and dependent features, creating a comprehensive resource for further analysis. This dataset has a huge amount of potential for study into recognizing emotions, figuring out how well ads work, and setting standards for algorithms. It can be used by marketers, retailers, researchers, and businesses that want to improve their advertising campaigns and learn more about how people think about their products and how often they buy them. Furthermore, it offers a standardized set of data for testing and comparing the accuracy and effectiveness of classification algorithms on audio data.

A Smart Tutor Avatar for Mimicking Characters of Bangla Sign Language

Submitted to: 4th ICREST’25 [ACCEPTED]

Abstract : A smart tutor avatar teaches Bangla Sign Language (BSL) by mimicking its 49 characters, providing a visual and immersive learning experience to bridge communication gaps for the hearing-impaired. This research introduces a novel avatar system designed specifically for BSL, enhancing the learning process through interactive and engaging methods. The avatar uses machine learning and image processing to analyze hand gestures and replicate BSL signs, creating an interactive learning tool. Initial dataset pre-processing trains the system to produce accurate sign visuals. The user interface is intuitive, allowing learners to engage in physical exercises to improve sensorimotor skills, with immediate feedback provided via cosine similarity. While avatar tutors have been explored for American Sign Language, this study is pioneering in its approach to BSL. The proposed research has the potential to make BSL learning more accessible and enjoyable for a wider audience.

1. Machine Learning Projects

In these machine learning projects, various algorithms were applied to solve healthcare-related problems. For breast cancer prediction, models like linear regression, decision trees, and random forest were used to classify tumors as benign or malignant, with performance comparisons. In diabetes prediction, a Support Vector Machine model was employed to determine whether a person has diabetes. Additionally, linear regression was applied to predict medical insurance costs using a dataset from Kaggle. These projects showcase the versatility of machine learning in addressing healthcare challenges.

Project Link: GitHub Repository

2. Deep Learning Projects

Neural networks were applied to solve diverse classification in these deep learning projects. For breast cancer classification, a simple neural network was used to predict whether cancer is benign or malignant, employing ReLU and sigmoid activations for feature extraction and binary classification. Similarly, in the MNIST Handwritten Digit Classification project, a neural network was built to classify digits, using a similar structure with ReLU activations and a sigmoid layer for multi-class classification. Both projects showcase the power of neural networks in tackling medical diagnosis and image recognition challenges.

Project Link: GitHub Repository

3. NLP Projects

In these NLP projects, a Fake News Classifier was developed using LSTM to classify news articles as fake or real by leveraging the power of sequence-based learning on textual data. Additionally, sentiment analysis was performed on the Amazon Fine Food Reviews dataset, utilizing NLP techniques to analyze customer reviews and classify sentiments as either positive or negative. Both projects demonstrate the application of deep learning and natural language processing in addressing text classification challenges.

Project Link: GitHub Repository

4. Exploratory Data Analysis with Pyhton

Across these exploratory data analysis (EDA) projects, various datasets were explored to uncover meaningful insights and implement machine learning techniques. The Credit Card Fraud Detection project focused on analyzing a highly imbalanced dataset, using features derived from PCA to identify fraudulent transactions. The Zomato dataset involved both EDA and feature engineering to explore patterns in restaurant data. With the Irish dataset, univariate, bivariate, and multivariate analyses were conducted to understand relationships between variables. Lastly, the Titanic dataset combined EDA with logistic regression to predict passenger survival. These projects showcase the practical use of EDA across diverse datasets.

Project Link: GitHub Repository

[GithHub Code]

🌲 🌍