Skip to content

New submissions for Tue, 4 Jul 23 #388

@e-tornike

Description

@e-tornike

Keyword: contrastive

Improving the Transferability of Time Series Forecasting with Decomposition Adaptation

Authors: Yan Gao, Yan Wang, Qiang Wang
Arxiv: https://arxiv.org/abs/2307.00066
TLDR: Due to effective pattern mining and feature representation, neural forecasting models based on deep learning have achieved great progress. The premise of effective learning is to collect sufficient data. However, in time series forecasting, it is difficult to obtain enough data, which limits the performance of neural forecasting Models. To alleviate the data scarcity limitation, we design Sequence Decomposition Adaptation Network (SeDAN) which is a novel transfer architecture to improve forecasting performance on the target domain by aligning transferable knowledge
Repo: None

Prompting classes: Exploring the Power of Prompt Class Learning in Weakly Supervised Semantic Segmentation

Authors: Balamurali Murugesan, Rukhshanda Hussain, Rajarshi Bhattacharya, Ismail Ben Ayed, Jose Dolz
Arxiv: https://arxiv.org/abs/2307.00097
TLDR: Recently, CLIP-based approaches have exhibited remarkable performance on generalization and few-shot learning tasks, fueled by the power of contrastive language-vision pre-training. In particular, prompt tuning has emerged as an effective strategy to adapt the pre-trained language-visit models to downstream tasks by employing task-related textual tokens. Motivated by this progress, in this work we question whether other fundamental problems, such as weakly supervised semantic segmentation (WSSS),
Repo: None

Hierarchical Pretraining for Biomedical Term Embeddings

Authors: Bryan Cai, Sihang Zeng, Yucong Lin, Zheng Yuan, Doudou Zhou, Lu Tian
Arxiv: https://arxiv.org/abs/2307.00266
TLDR: Electronic health records (EHR) contain narrative notes that provide extensive details on the medical condition and management of patients. Natural language processing (NLP) of clinical notes can use observed frequencies of clinical terms as predictive features for downstream applications such as clinical decision making and patient trajectory prediction. However, due to the vast number of highly similar and related clinical concepts, a more effective modeling strategy is to represent clinical terms with semantic embeddings via representation learning and use the low dimensional embedd
Repo: None

BioCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information Retrieval

Authors: Qiao Jin, Won Kim, Qingyu Chen, Donald C. Comeau, Lana Yeganova, John Wilbur, Zhiyong Lu
Arxiv: https://arxiv.org/abs/2307.00589
TLDR: Information retrieval (IR) is essential in biomedical knowledge acquisition and clinical decision support. While recent progress has shown that language model encoders perform better semantic retrieval, training such models requires abundant query-article annotations that are difficult to obtain in biomedicine. As a result, most biomedical IR systems only conduct lexical matching. In response, we introduce BioCPT, a first-of-its-kind Contrastively Pre-trained Transformer model for zero-shot biomedical IR
Repo: None

CLIMAX: An exploration of Classifier-Based Contrastive Explanations

Authors: Praharsh Nanavati, Ranjitha Prasad
Arxiv: https://arxiv.org/abs/2307.00680
TLDR: Explainable AI is an evolving area that deals with understanding the decision making of machine learning models so that these models are more transparent, accountable, and understandable for humans. In particular, post-hoc model-agnostic interpretable AI techniques explain the decisions of a black-box ML model for a single instance locally, without the knowledge of the intrinsic nature of the ML model. Despite their simplicity and capability in providing valuable insights, existing approaches fail to deliver consistent and reliable explanations.
Repo: None

Resistive memory-based zero-shot liquid state machine for multimodal event data learning

Authors: Ning Lin, Shaocong Wang, Yi Li, Bo Wang, Shuhui Shi, Yangu He, Woyu Zhang, Yifei Yu, Yue Zhang, Xiaojuan Qi, Xiaoming Chen, Hao Jiang, Xumeng Zhang, Peng Lin, Xiaoxin Xu, Qi Liu, Zhongrui Wang, Dashan Shang, Ming Liu
Arxiv: https://arxiv.org/abs/2307.00771
TLDR: The human brain is a complex spiking neural network (SNN) that learns multimodal signals in a zero-shot manner by generalizing existing knowledge. Remarkably, the brain achieves this with minimal power consumption, using event-based signals that propagate within its structure. However, mimicking the human brain in neuromorphic hardware presents both hardware and software challenges. Hardware limitations, such as the slowdown of Moore's law and the von Neumann bottleneck, hinder the efficiency of digital
Repo: None

UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding

Authors: Rui Sun, Zhecan Wang, Haoxuan You, Noel Codella, Kai-Wei Chang, Shih-Fu Chang
Arxiv: https://arxiv.org/abs/2307.00862
TLDR: Vision-language tasks, such as VQA, SNLI-VE, and VCR are challenging because they require the model's reasoning ability to understand the semantics of the visual world and natural language. Supervised methods working for vision-language skills have been well-studied. However, solving these tasks in a zero-shot setting is less explored. Since Contrastive Language-Image Pre-training (CLIP) has shown remarkable zero-stroke performance on image-text matching
Repo: None

Autism Spectrum Disorder Classification in Children based on Structural MRI Features Extracted using Contrastive Variational Autoencoder

Authors: Ruimin Ma, Ruitao Xie, Yanlin Wang, Jintao Meng, Yanjie Wei, Wenhui Xi, Yi Pan
Arxiv: https://arxiv.org/abs/2307.00976
TLDR: Autism spectrum disorder (ASD) is a highly disabling mental disease that brings significant impairments of social interaction ability to the patients, making early screening and intervention of ASD critical. With the development of the machine learning and neuroimaging technology, extensive research has been conducted on machine classification of ASD based on structural MRI (s-MRI). However, most studies involve with datasets where participants' age are above 5-year-old, but, with mediocre predictive accuracy. In this paper
Repo: None

ENGAGE: Explanation Guided Data Augmentation for Graph Representation Learning

Authors: Yucheng Shi, Kaixiong Zhou, Ninghao Liu
Arxiv: https://arxiv.org/abs/2307.01053
TLDR: The recent contrastive learning methods, due to their effectiveness in representation learning, have been widely applied to modeling graph data. Random perturbation is widely used to build contrastive views for graph data, which however, could accidentally break graph structures and lead to suboptimal performance. In addition, graph data is usually highly abstract, so it is hard to extract intuitive meanings and design more informed augmentation schemes. Effective representations should preserve key characteristics in data and abandon superfluous information. In
Repo: None

Investigating Data Memorization in 3D Latent Diffusion Models for Medical Image Synthesis

Authors: Salman Ul Hassan Dar, Arman Ghanaat, Jannik Kahmann, Isabelle Ayx, Theano Papavassiliou, Stefan O. Schoenberg, Sandy Engelhardt
Arxiv: https://arxiv.org/abs/2307.01148
TLDR: Generative latent diffusion models have been established as state-of-the-art in data generation. One promising application is generation of realistic synthetic medical imaging data for open data sharing without compromising patient privacy. Despite the promise, the capacity of such models to memorize sensitive patient training data and synthesize samples showing high resemblance to training data samples is relatively unexplored. Here, we assess the memorization capacity of 3D latent diffusion model on photon-counting coronary computed tomography angi
Repo: None

Don't freeze: Finetune encoders for better Self-Supervised HAR

Authors: Vitor Fortes Rey, Dominique Nshimyimana, Paul Lukowicz
Arxiv: https://arxiv.org/abs/2307.01168
TLDR: Recently self-supervised learning has been proposed in the field of human activity recognition as a solution to the labelled data availability problem. The idea being that by using pretext tasks such as reconstruction or contrastive predictive coding, useful representations can be learned that then can be used for classification. Those approaches follow the pretrain, freeze and fine-tune procedure. In this paper we will show how a simple change - not freezing the representation - leads to substantial performance gains across pretext tasks. The
Repo: None

Keyword: data augmentation

Investigating Masking-based Data Generation in Language Models

Authors: Ed S. Ma
Arxiv: https://arxiv.org/abs/2307.00008
TLDR: The current era of natural language processing (NLP) has been defined by the prominence of pre-trained language models since the advent of BERT. A feature of Bert and models with similar architecture is the objective of masked language modeling, in which part of the input is intentionally masked and the model is trained to predict this piece of masked information. Data augmentation (DAG) is a data-driven technique widely used in machine learning, including research areas like computer vision and natural
Repo: None

Counterfactual Collaborative Reasoning

Authors: Jianchao Ji, Zelong Li, Shuyuan Xu, Max Xiong, Juntao Tan, Yingqiang Ge, Hao Wang, Yongfeng Zhang
Arxiv: https://arxiv.org/abs/2307.00165
TLDR: Causal reasoning and logical reasoning are two important types of reasoning abilities for human intelligence. However, their relationship has not been extensively explored under machine intelligence context. In this paper, we explore how the two reasoning abilities can be jointly modeled to enhance both accuracy and explainability of machine learning models. More specifically, by integrating two important Types of reasoning ability -- counterfactual reasoning and (neural) logical reasoning -- we propose Counterfactual Collaborative Reasoning (CCR), which conducts
Repo: None

Unsupervised Coordinate-Based Video Denoising

Authors: Mary Damilola Aiyetigbo, Dineshchandar Ravichandran, Reda Chalhoub, Peter Kalivas, Nianyi Li
Arxiv: https://arxiv.org/abs/2307.00179
TLDR: In this paper, we introduce a novel unsupervised video denoising deep learning approach that can help to mitigate data scarcity issues and shows robustness against different noise patterns, enhancing its broad applicability. Our method comprises three modules: a Feature generator creating features maps, a Denoise-Net generating denoised but slightly blurry reference frames, and a Refine-Net re-introducing high-frequency details. By leveraging the coordinate-based network, we can greatly simplify the
Repo: None

Long-Tailed Continual Learning For Visual Food Recognition

Authors: Jiangpeng He, Luotao Lin, Jack Ma, Heather A. Eicher-Miller, Fengqing Zhu
Arxiv: https://arxiv.org/abs/2307.00183
TLDR: Deep learning based food recognition has achieved remarkable progress in predicting food types given an eating occasion image. However, there are two major obstacles that hinder deployment in real world scenario. First, as new foods appear sequentially overtime, a trained model needs to learn the new classes continuously without causing catastrophic forgetting for already learned knowledge of existing food types. Second, the distribution of food images in real life is usually long-tailed as a small number of popular food types are consumed more frequently than others,
Repo: None

SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency

Authors: Yan Wang, Yuhang Li, Ruihao Gong, Aishan Liu, Yanfei Wang, Jian Hu, Yongqiang Yao, Yunchen Zhang, Tianzi Xiao, Fengwei Yu, Xianglong Liu
Arxiv: https://arxiv.org/abs/2307.00280
TLDR: Extensive studies have shown that deep learning models are vulnerable to adversarial and natural noises, yet little is known about model robustness on noises caused by different system implementations. In this paper, we for the first time introduce SysNoise, a frequently occurred but often overlooked noise in the deep learning training-deployment cycle. In particular, Sys noise happens when the source training system switches to a disparate target system in deployments, where various tiny system mismatch adds up to
Repo: None

CNN-BiLSTM model for English Handwriting Recognition: Comprehensive Evaluation on the IAM Dataset

Authors: Firat Kizilirmak, Berrin Yanikoglu
Arxiv: https://arxiv.org/abs/2307.00664
TLDR: We present a CNN-BiLSTM system for the problem of offline English handwriting recognition, with extensive evaluations on the public IAM dataset, including the effects of model size, data augmentation and the lexicon. Our best model achieves 3.59\% CER and 9.44\% WER using CNN-biLST M network with CTC layer. Test time augmentation with rotation and shear transformations applied to the input image, is proposed to increase recognition
Repo: None

ENGAGE: Explanation Guided Data Augmentation for Graph Representation Learning

Authors: Yucheng Shi, Kaixiong Zhou, Ninghao Liu
Arxiv: https://arxiv.org/abs/2307.01053
TLDR: The recent contrastive learning methods, due to their effectiveness in representation learning, have been widely applied to modeling graph data. Random perturbation is widely used to build contrastive views for graph data, which however, could accidentally break graph structures and lead to suboptimal performance. In addition, graph data is usually highly abstract, so it is hard to extract intuitive meanings and design more informed augmentation schemes. Effective representations should preserve key characteristics in data and abandon superfluous information. In
Repo: None

Keyword: knowledge graph

Hierarchical Pretraining for Biomedical Term Embeddings

Authors: Bryan Cai, Sihang Zeng, Yucong Lin, Zheng Yuan, Doudou Zhou, Lu Tian
Arxiv: https://arxiv.org/abs/2307.00266
TLDR: Electronic health records (EHR) contain narrative notes that provide extensive details on the medical condition and management of patients. Natural language processing (NLP) of clinical notes can use observed frequencies of clinical terms as predictive features for downstream applications such as clinical decision making and patient trajectory prediction. However, due to the vast number of highly similar and related clinical concepts, a more effective modeling strategy is to represent clinical terms with semantic embeddings via representation learning and use the low dimensional embedd
Repo: None

PersonaGen: A Tool for Generating Personas from User Feedback

Authors: Xishuo Zhang, Lin Liu, Yi Wang, Xiao Liu, Hailong Wang, Anqi Ren, Chetan Arora
Arxiv: https://arxiv.org/abs/2307.00390
TLDR: Personas are crucial in software development processes, particularly in agile settings. However, no effective tools are available for generating personas from user feedback in agile Software development processes. To fill this gap, we propose a novel tool that uses the GPT-4 model and knowledge graph to generate persona templates from well-processed user feedback, facilitating requirement analysis in agile software development. We developed a tool called PersonaGen. We evaluated PersonaGen using qualitative feedback from a small-scale user study
Repo: None

Semi-automated Thermal Envelope Model Setup for Adaptive Model Predictive Control with Event-triggered System Identification

Authors: Lu Wan, Xiaobing Dai, Torsten Welfonder, Ekaterina Petrova, Pieter Pauwels
Arxiv: https://arxiv.org/abs/2307.00638
TLDR: To reach carbon neutrality in the middle of this century, smart controls for building energy systems are urgently required. Model predictive control (MPC) demonstrates great potential in improving the performance of heating ventilation and air-conditioning (HVAC) systems, whereas its wide application in the building sector is impeded by the considerable manual efforts involved in setting up the control-oriented model. To facilitate the system identification (SI) of the building envelope as well as the configuration of the MPC
Repo: None

CollabKG: A Learnable Human-Machine-Cooperative Information Extraction Toolkit for (Event) Knowledge Graph Construction

Authors: Xiang Wei, Yufeng Chen, Ning Cheng, Xingyu Cui, Jinan Xu, Wenjuan Han
Arxiv: https://arxiv.org/abs/2307.00769
TLDR: In order to construct or extend entity-centric and event-centric knowledge graphs (KG and EKG), the information extraction (IE) annotation toolkit is essential. However, existing IE toolkits have several non-trivial problems, such as not supporting multi-tasks, not supporting automatic updates. In this work, we present CollabKG, a learnable human-machine-cooperative IE toolkit for KG and eKG construction.
Repo: None

Iterative Zero-Shot LLM Prompting for Knowledge Graph Construction

Authors: Salvatore Carta, Alessandro Giuliani, Leonardo Piano, Alessandro Sebastian Podda, Livio Pompianu, Sandro Gabriele Tiddia
Arxiv: https://arxiv.org/abs/2307.01128
TLDR: In the current digitalization era, capturing and effectively representing knowledge is crucial in most real-world scenarios. In this context, knowledge graphs represent a potent tool for retrieving and organizing a vast amount of information in a properly interconnected and interpretable structure. However, their generation is still challenging and often requires considerable human effort and domain expertise, hampering the scalability and flexibility across different application fields. This paper proposes an innovative knowledge graph generation approach that leverages the potential of the latest generative
Repo: None

Keyword: legal

From ChatGPT to ThreatGPT: Impact of Generative AI in Cybersecurity and Privacy

Authors: Maanak Gupta, CharanKumar Akiri, Kshitiz Aryal, Eli Parker, Lopamudra Praharaj
Arxiv: https://arxiv.org/abs/2307.00691
TLDR: Undoubtedly, the evolution of Generative AI (GenAI) models has been the highlight of digital transformation in the year 2022. As the different GenAI models like ChatGPT and Google Bot continue to foster their complexity and capability, it's critical to understand its consequences from a cybersecurity perspective. Several instances recently have demonstrated the use of GenAI tools in both the defensive and offensive side of cybersecurity, and focusing on the social, ethical and privacy implications this technology possesses. This research paper
Repo: None

Keyword: mixup

Augmenting Deep Learning Adaptation for Wearable Sensor Data through Combined Temporal-Frequency Image Encoding

Authors: Yidong Zhu, Md Mahmudur Rahman, Mohammad Arif Ul Alam
Arxiv: https://arxiv.org/abs/2307.00883
TLDR: Deep learning advancements have revolutionized scalable classification in many domains including computer vision. However, when it comes to wearable-based classification and domain adaptation, existing computer vision-based deep learning architectures and pretrained models trained on thousands of labeled images for months fall short. This is primarily because wearable sensor data necessitates sensor-specific preprocessing, architectural modification, and extensive data collection. To overcome these challenges, researchers have proposed encoding of wearable temporal sensor data in images using recurrent plots. In this
Repo: None

Keyword: multi-task

CollabKG: A Learnable Human-Machine-Cooperative Information Extraction Toolkit for (Event) Knowledge Graph Construction

Authors: Xiang Wei, Yufeng Chen, Ning Cheng, Xingyu Cui, Jinan Xu, Wenjuan Han
Arxiv: https://arxiv.org/abs/2307.00769
TLDR: In order to construct or extend entity-centric and event-centric knowledge graphs (KG and EKG), the information extraction (IE) annotation toolkit is essential. However, existing IE toolkits have several non-trivial problems, such as not supporting multi-tasks, not supporting automatic updates. In this work, we present CollabKG, a learnable human-machine-cooperative IE toolkit for KG and eKG construction.
Repo: None

Towards Suicide Prevention from Bipolar Disorder with Temporal Symptom-Aware Multitask Learning

Authors: Daeun Lee, Sejung Son, Hyolim Jeon, Seungbae Kim, Jinyoung Han
Arxiv: https://arxiv.org/abs/2307.00995
TLDR: Bipolar disorder (BD) is closely associated with an increased risk of suicide. However, while the prior work has revealed valuable insight into understanding the behavior of BD patients on social media, little attention has been paid to developing a model that can predict the future suicidality of a BD patient. Therefore, this study proposes a multi-task learning model for predicting the past suicidsality of patients by jointly learning current symptoms. We build a novel BD dataset clinically validated by psychiatrists
Repo: https://github.com/leedaeuni/Temporal-Symptom-Aware-Multitask-Learning-KDD23

Visual Instruction Tuning with Polite Flamingo

Authors: Delong Chen, Jianfeng Liu, Wenliang Dai, Baoyuan Wang
Arxiv: https://arxiv.org/abs/2307.01003
TLDR: Recent research has demonstrated that the multi-task fine-tuning of multi-modal Large Language Models (LLMs) using an assortment of annotated downstream vision-language datasets significantly enhances their performance. Yet, during this process, a side effect, which we termed as the "multi-modality alignment tax", surfaces. This side effect negatively impacts the model's ability to format responses appropriately -- for instance, its "politeness" -- due to the overly succinct and unform
Repo: None

Keyword: robustness

RObotic MAnipulation Network (ROMAN) -- Hybrid Hierarchical Learning for Solving Complex Sequential Tasks

Authors: Eleftherios Triantafyllidis, Fernando Acero, Zhaocheng Liu, Zhibin Li
Arxiv: https://arxiv.org/abs/2307.00125
TLDR: Solving long sequential tasks poses a significant challenge in embodied artificial intelligence. Enabling a robotic system to perform diverse sequential tasks with a broad range of manipulation skills is an active area of research. In this work, we present a Hybrid Hierarchical Learning framework, the Robotic Manipulation Network (ROMAN), to address the challenge of solving multiple complex tasks over long time horizons in robotic manipulation. ROMAN achieves task versatility and robust failure recovery by integrating behavioural cloning, imitation learning,
Repo: None

Unsupervised Coordinate-Based Video Denoising

Authors: Mary Damilola Aiyetigbo, Dineshchandar Ravichandran, Reda Chalhoub, Peter Kalivas, Nianyi Li
Arxiv: https://arxiv.org/abs/2307.00179
TLDR: In this paper, we introduce a novel unsupervised video denoising deep learning approach that can help to mitigate data scarcity issues and shows robustness against different noise patterns, enhancing its broad applicability. Our method comprises three modules: a Feature generator creating features maps, a Denoise-Net generating denoised but slightly blurry reference frames, and a Refine-Net re-introducing high-frequency details. By leveraging the coordinate-based network, we can greatly simplify the
Repo: None

More for Less: Compact Convolutional Transformers Enable Robust Medical Image Classification with Limited Data

Authors: Andrew Kean Gao
Arxiv: https://arxiv.org/abs/2307.00213
TLDR: Transformers are very powerful tools for a variety of tasks across domains, from text generation to image captioning. However, transformers require substantial amounts of training data, which is often a challenge in biomedical settings, where high quality labeled data can be challenging or expensive to obtain. This study investigates the efficacy of Compact Convolutional Transformers (CCT) for robust medical image classification with limited data, addressing a key issue faced by conventional Vision Transformers - their requirement for large datasets. A hybrid
Repo: None

SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency

Authors: Yan Wang, Yuhang Li, Ruihao Gong, Aishan Liu, Yanfei Wang, Jian Hu, Yongqiang Yao, Yunchen Zhang, Tianzi Xiao, Fengwei Yu, Xianglong Liu
Arxiv: https://arxiv.org/abs/2307.00280
TLDR: Extensive studies have shown that deep learning models are vulnerable to adversarial and natural noises, yet little is known about model robustness on noises caused by different system implementations. In this paper, we for the first time introduce SysNoise, a frequently occurred but often overlooked noise in the deep learning training-deployment cycle. In particular, Sys noise happens when the source training system switches to a disparate target system in deployments, where various tiny system mismatch adds up to
Repo: None

Launcher Attitude Control based on Incremental Nonlinear Dynamic Inversion: A Feasibility Study Towards Fast and Robust Design Approaches

Authors: Pedro Simplício, Paul Acquatella, Samir Bennani
Arxiv: https://arxiv.org/abs/2307.00372
TLDR: The so-called ``New Space era'' has seen a disruptive change in the business models and manufacturing technologies of launch vehicle companies. However, limited consideration has been given to the benefits that innovation in control theory can bring; not only in terms of increasing the limits of performance but also reducing mission preparation or ``missionisation'' efforts. Moreover, there is a gap between the current state-of-practice that still relies on linear controls and other modern control techniques that could bring relevant improvements in
Repo: None

CasTGAN: Cascaded Generative Adversarial Network for Realistic Tabular Data Synthesis

Authors: Abdallah Alshantti, Damiano Varagnolo, Adil Rasheed, Aria Rahmati, Frank Westad
Arxiv: https://arxiv.org/abs/2307.00384
TLDR: Generative adversarial networks (GANs) have drawn considerable attention in recent years for their proven capability in generating synthetic data which can be utilized for multiple purposes. While GANs have demonstrated tremendous successes in producing synthetic data samples that replicate the dynamics of the original datasets, the validity of the synthetic data and the underlying privacy concerns represent major challenges which are not sufficiently addressed. In this work, we design a cascaded tabular GAN framework (CasTGAN) for generating realistic tab
Repo: None

Brightness-Restricted Adversarial Attack Patch

Authors: Mingzhen Shao
Arxiv: https://arxiv.org/abs/2307.00421
TLDR: Adversarial attack patches have gained increasing attention due to their practical applicability in physical-world scenarios. However, the bright colors used in attack patches represent a significant drawback, as they can be easily identified by human observers. Moreover, even though these attacks have been highly successful in deceiving target networks, which specific features of the attack patch contribute to its success are still unknown. Our paper introduces a brightness-restricted patch (BrPatch) that uses optical characteristics to effectively reduce conspicuousness
Repo: None

Query-Efficient Decision-based Black-Box Patch Attack

Authors: Zhaoyu Chen, Bo Li, Shuang Wu, Shouhong Ding, Wenqiang Zhang
Arxiv: https://arxiv.org/abs/2307.00477
TLDR: Deep neural networks (DNNs) have been showed to be highly vulnerable to imperceptible adversarial perturbations. As a complementary type of adversary, patch attacks that introduce perceptible perturbatives to the images have attracted the interest of researchers. Existing patch attacks rely on the architecture of the model or the probabilities of predictions and perform poorly in the decision-based setting, which can still construct a perturbation with the minimal information exposed -- the top-1 predicted label
Repo: None

POV-SLAM: Probabilistic Object-Aware Variational SLAM in Semi-Static Environments

Authors: Jingxing Qian, Veronica Chatrath, James Servos, Aaron Mavrinac, Wolfram Burgard, Steven L. Waslander, Angela P. Schoellig
Arxiv: https://arxiv.org/abs/2307.00488
TLDR: Simultaneous localization and mapping (SLAM) in slowly varying scenes is important for long-term robot task completion. Failing to detect scene changes may lead to inaccurate maps and, ultimately, lost robots. Classical SLAM algorithms assume static scenes, and recent works take dynamics into account, but require scene changes to be observed in consecutive frames. Semi-static scenes, wherein objects appear, disappear, or move slowly over time, are often overlooked, yet are critical for long theterm
Repo: None

Variational Autoencoding Molecular Graphs with Denoising Diffusion Probabilistic Model

Authors: Daiki Koge, Naoaki Ono, Shigehiko Kanaya
Arxiv: https://arxiv.org/abs/2307.00623
TLDR: In data-driven drug discovery, designing molecular descriptors is a very important task. Deep generative models such as variational autoencoders (VAEs) offer a potential solution by designing descriptors as probabilistic latent vectors derived from molecular structures. These models can be trained on large datasets, which have only molecular structures, and applied to transfer learning. Nevertheless, the approximate posterior distribution of the latent vectors of the usual VAE assumes a simple multivariate Gaussian distribution with
Repo: None

SDC-HSDD-NDSA: Structure Detecting Cluster by Hierarchical Secondary Directed Differential with Normalized Density and Self-Adaption

Authors: Hao Shu
Arxiv: https://arxiv.org/abs/2307.00677
TLDR: Density-based clustering could be the most popular clustering algorithm since it can identify clusters of arbitrary shape as long as different (high-density) clusters are separated by low-density regions. However, the requirement of the separateness of clusters by low‐density regions is not trivial since a high-density region might have different structures which should be clustered into different groups. Such a situation demonstrates the main flaw of all previous density-based cluster clustering algorithms we have known--
Repo: None

Scratch Team of Single-Rotor Robots and Decentralized Cooperative Transportation with Robot Failure

Authors: Koshi Oishi, Yasushi Amano, Jimbo Tomohiko
Arxiv: https://arxiv.org/abs/2307.00705
TLDR: Achieving cooperative transportation by teams of aerial robots has been attracting attention owing to its flexibility with respect to payloads and robustness against failures. In this paper, we propose a flexible decentralized controller for the number of robots and the shapes of payloads in a cooperative transport task using multiple single-rotor robots. Our controller is robust to mass and center of mass fluctuations and robot failures. Moreover, asymptotic stability against dynamics errors is guaranteed. Additionally, the controller supports heter
Repo: None

Feasibility of Universal Anomaly Detection without Knowing the Abnormality in Medical Images

Authors: Can Cui, Yaohong Wang, Shunxing Bao, Yucheng Tang, Ruining Deng, Lucas W. Remedios, Zuhayr Asad, Joseph T. Roland, Ken S. Lau, Qi Liu, Lori A. Coburn, Keith T. Wilson, Bennett A. Landman, Yuankai Huo
Arxiv: https://arxiv.org/abs/2307.00750
TLDR: Many anomaly detection approaches, especially deep learning methods, have been recently developed to identify abnormal image morphology by only employing normal images during training. Unfortunately, many prior anomaly detection methods were optimized for a specific "known" abnormality (e.g., brain tumor, bone fraction, cell types). Moreover, even though only the normal images were used in the training process, the abnormal images were often insufficiently employed during the validation process (e-g., epoch selection, hyper-parameter
Repo: None

ImDiffusion: Imputed Diffusion Models for Multivariate Time Series Anomaly Detection

Authors: Yuhang Chen, Chaoyun Zhang, Minghua Ma, Yudong Liu, Ruomeng Ding, Bowen Li, Shilin He, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang
Arxiv: https://arxiv.org/abs/2307.00754
TLDR: Anomaly detection in multivariate time series data is of paramount importance for ensuring the efficient operation of large-scale systems across diverse domains. However, accurately detecting anomalies in such data poses significant challenges. Existing approaches, including forecasting and reconstruction-based methods, struggle to address these challenges effectively. To overcome these limitations, we propose a novel anomaly detection framework named ImDiffusion, which combines time series imputation and diffusion models to achieve accurate and robust anomaly detection. The imputation-based
Repo: None

Learning Noise-Resistant Image Representation by Aligning Clean and Noisy Domains

Authors: Yanhui Guo, Xiaolin Wu, Fangzhou Luo
Arxiv: https://arxiv.org/abs/2307.00761
TLDR: Recent supervised and unsupervised image representation learning algorithms have achieved quantum leaps. However, these techniques do not account for representation resilience against noise in their design paradigms. Consequently, these effective methods suffer failure when confronted with noise outside the training distribution, such as complicated real-world noise that is usually opaque to model training. To address this issue, dual domains are optimized to separately model a canonical space for noisy representations, namely the Noise-Robust (NR) domain, and a
Repo: None

Cryptography and Key Management Schemes for Wireless Sensor Networks

Authors: Jaydip Sen
Arxiv: https://arxiv.org/abs/2307.00872
TLDR: Wireless sensor networks (WSNs) are made up of a large number of tiny sensors, which can sense, analyze, and communicate information about the outside world. These networks play a significant role in a broad range of fields, from crucial military surveillance applications to monitoring building security. Key management in WSNs is a critical task. While the security and integrity of messages communicated through these networks and the authenticity of the nodes are dependent on the robustness of the key management schemes,
Repo: None

Enhancing the Robustness of QMIX against State-adversarial Attacks

Authors: Weiran Guo, Guanjun Liu, Ziyuan Zhou, Ling Wang, Jiacun Wang
Arxiv: https://arxiv.org/abs/2307.00907
TLDR: Deep reinforcement learning (DRL) performance is generally impacted by state-adversarial attacks, a perturbation applied to an agent's observation. Most recent research has concentrated on robust single-agent reinforcement learning or multi-agent learning (SARL) algorithms, and on the use of SARL algorithms to improve the robustness of the SARL algorithm. Still, there has yet to be much work on robust multi-Agent reinforcement learning. Using QMIX, one of
Repo: None

Towards Building Self-Aware Object Detectors via Reliable Uncertainty Quantification and Calibration

Authors: Kemal Oksuz, Tom Joy, Puneet K. Dokania
Arxiv: https://arxiv.org/abs/2307.00934
TLDR: The current approach for testing the robustness of object detectors suffers from serious deficiencies such as improper methods of performing out-of-distribution detection and using calibration metrics which do not consider both localisation and classification quality. In this work, we address these issues, and introduce the Self-Aware Object Detection (SAOD) task, a unified testing framework which respects and adheres to the challenges that object detectors face in safety-critical environments such as autonomous driving. Specifically, the SA
Repo: None

SAM-DA: UAV Tracks Anything at Night with SAM-Powered Domain Adaptation

Authors: Liangliang Yao, Haobo Zuo, Guangze Zheng, Changhong Fu, Jia Pan
Arxiv: https://arxiv.org/abs/2307.01024
TLDR: Domain adaptation (DA) has demonstrated significant promise for real-time nighttime unmanned aerial vehicle (UAV) tracking. However, the state-of-the-art (SOTA) DA still lacks the potential object with accurate pixel-level location and boundary to generate the high-quality target domain training sample. This key issue constrains the transfer of data from raw nighttime images to SAM-DA, i.e., DA-DA. Therefore, it is necessary to optimize the
Repo: None

Anonymous and Copy-Robust Delegations for Liquid Democracy

Authors: Markus Utke, Ulrike Schmidt-Kraepelin
Arxiv: https://arxiv.org/abs/2307.01174
TLDR: Liquid democracy with ranked delegations is a novel voting scheme that unites the practicability of representative democracy with the idealistic appeal of direct democracy: Every voter decides between casting their vote on a question at hand or delegating their voting weight to some other, trusted agent. Delegations are transitive, and since voters may end up in a delegation cycle, they are encouraged to indicate not only a single delegate, but a set of potential delegates and a ranking among them. Based
Repo: None

Keyword: scholarly

Numerical Association Rule Mining: A Systematic Literature Review

Authors: Minakshi Kaushik, Rahul Sharma, Iztok Fister Jr., Dirk Draheim
Arxiv: https://arxiv.org/abs/2307.00662
TLDR: Numerical association rule mining is a widely used variant of the association rule hunting technique, and it has been extensively used in discovering patterns and relationships in numerical data. Initially, researchers and scientists integrated numerical attributes in association rule mines using various discretization approaches; however, over time, a plethora of alternative methods have emerged in this field. Unfortunately, the increase of alternative Methods has resulted into a significant knowledge gap in understanding diverse techniques employed in numerical association rule Mining -- this paper attempts to
Repo: None

Keyword: semantic similarity

Automatic Design of Semantic Similarity Ensembles Using Grammatical Evolution

Authors: Jorge Martinez-Gil
Arxiv: https://arxiv.org/abs/2307.00925
TLDR: Semantic similarity measures are widely used in natural language processing to catalyze various computer-related tasks. However, no single semantic similarity measure is the most appropriate for all tasks, and researchers often use ensemble strategies to ensure performance. This research work proposes a method for automatically designing semantic similarity ensembles. In fact, our proposed method uses grammatical evolution, for the first time, to automatically select and aggregate measures from a pool of candidates to create an ensemble that maximizes correlation to human
Repo: None

Keyword: similarity measure

Applications of Binary Similarity and Distance Measures

Authors: Manoj Muniswamaiah, Tilak Agerwala, Charles C. Tappert
Arxiv: https://arxiv.org/abs/2307.00411
TLDR: In the recent past, binary similarity measures have been applied in solving biometric identification problems, including fingerprint, handwritten character detection, and in iris image recognition. The application of the relevant measurements has also resulted in more accurate data analysis. This paper surveys the applicability of binary similarity and distance measures in various fields.
Repo: None

Automatic Design of Semantic Similarity Ensembles Using Grammatical Evolution

Authors: Jorge Martinez-Gil
Arxiv: https://arxiv.org/abs/2307.00925
TLDR: Semantic similarity measures are widely used in natural language processing to catalyze various computer-related tasks. However, no single semantic similarity measure is the most appropriate for all tasks, and researchers often use ensemble strategies to ensure performance. This research work proposes a method for automatically designing semantic similarity ensembles. In fact, our proposed method uses grammatical evolution, for the first time, to automatically select and aggregate measures from a pool of candidates to create an ensemble that maximizes correlation to human
Repo: None

Keyword: summarization

Challenges in Domain-Specific Abstractive Summarization and How to Overcome them

Authors: Anum Afzal, Juraj Vladika, Daniel Braun, Florian Matthes
Arxiv: https://arxiv.org/abs/2307.00963
TLDR: Large Language Models work quite well with general-purpose data and many tasks in Natural Language Processing. However, they show several limitations when used for a task such as domain-specific abstractive text summarization. This paper identifies three of those limitations as research problems in the context of abstractive Text summarization: 1) Quadratic complexity of transformer-based models with respect to the input text length; 2) Model Hallucination, which is a model's ability to generate factually
Repo: None

Keyword: text generation

More for Less: Compact Convolutional Transformers Enable Robust Medical Image Classification with Limited Data

Authors: Andrew Kean Gao
Arxiv: https://arxiv.org/abs/2307.00213
TLDR: Transformers are very powerful tools for a variety of tasks across domains, from text generation to image captioning. However, transformers require substantial amounts of training data, which is often a challenge in biomedical settings, where high quality labeled data can be challenging or expensive to obtain. This study investigates the efficacy of Compact Convolutional Transformers (CCT) for robust medical image classification with limited data, addressing a key issue faced by conventional Vision Transformers - their requirement for large datasets. A hybrid
Repo: None

PatternGPT :A Pattern-Driven Framework for Large Language Model Text Generation

Authors: Le Xiao, Xin Shan
Arxiv: https://arxiv.org/abs/2307.00470
TLDR: Large language models(LLMs) have shown excellent text generation capabilities, but there is still much space for improvement in accuracy, sometimes with grammatical errors, semantic inaccuracies, and contextual incoherence, which seriously affect the reliability of the models. These problems may originate from the difficulties and limitations encountered in the pattern extraction stage of large language models. How to utilize the generative power of large languages models to generate as many possible patterns that help solve problems and find the optimal patterns from them
Repo: None