-
Notifications
You must be signed in to change notification settings - Fork 3
Open
Labels
abstract meaning representationargument miningcitation context analysiscomputational social sciencecontrastivecross-language information retrievalcross-lingual information retrievaldata augmentationextreme multi-labelknowledge discoveryknowledge graphlegallegal textmixupmulti-taskparaphrasepassage generationplagiarismrobustnessscholarlyscholarly document processingsemantic similaritysimilarity measuresimplificationsummarizationtext generation
Description
Keyword: contrastive
Improving the Transferability of Time Series Forecasting with Decomposition Adaptation
Authors: Yan Gao, Yan Wang, Qiang WangArxiv: https://arxiv.org/abs/2307.00066
TLDR: Due to effective pattern mining and feature representation, neural forecasting models based on deep learning have achieved great progress. The premise of effective learning is to collect sufficient data. However, in time series forecasting, it is difficult to obtain enough data, which limits the performance of neural forecasting Models. To alleviate the data scarcity limitation, we design Sequence Decomposition Adaptation Network (SeDAN) which is a novel transfer architecture to improve forecasting performance on the target domain by aligning transferable knowledge
Repo: None
Prompting classes: Exploring the Power of Prompt Class Learning in Weakly Supervised Semantic Segmentation
Authors: Balamurali Murugesan, Rukhshanda Hussain, Rajarshi Bhattacharya, Ismail Ben Ayed, Jose DolzArxiv: https://arxiv.org/abs/2307.00097
TLDR: Recently, CLIP-based approaches have exhibited remarkable performance on generalization and few-shot learning tasks, fueled by the power of contrastive language-vision pre-training. In particular, prompt tuning has emerged as an effective strategy to adapt the pre-trained language-visit models to downstream tasks by employing task-related textual tokens. Motivated by this progress, in this work we question whether other fundamental problems, such as weakly supervised semantic segmentation (WSSS),
Repo: None
Hierarchical Pretraining for Biomedical Term Embeddings
Authors: Bryan Cai, Sihang Zeng, Yucong Lin, Zheng Yuan, Doudou Zhou, Lu TianArxiv: https://arxiv.org/abs/2307.00266
TLDR: Electronic health records (EHR) contain narrative notes that provide extensive details on the medical condition and management of patients. Natural language processing (NLP) of clinical notes can use observed frequencies of clinical terms as predictive features for downstream applications such as clinical decision making and patient trajectory prediction. However, due to the vast number of highly similar and related clinical concepts, a more effective modeling strategy is to represent clinical terms with semantic embeddings via representation learning and use the low dimensional embedd
Repo: None
BioCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information Retrieval
Authors: Qiao Jin, Won Kim, Qingyu Chen, Donald C. Comeau, Lana Yeganova, John Wilbur, Zhiyong LuArxiv: https://arxiv.org/abs/2307.00589
TLDR: Information retrieval (IR) is essential in biomedical knowledge acquisition and clinical decision support. While recent progress has shown that language model encoders perform better semantic retrieval, training such models requires abundant query-article annotations that are difficult to obtain in biomedicine. As a result, most biomedical IR systems only conduct lexical matching. In response, we introduce BioCPT, a first-of-its-kind Contrastively Pre-trained Transformer model for zero-shot biomedical IR
Repo: None
CLIMAX: An exploration of Classifier-Based Contrastive Explanations
Authors: Praharsh Nanavati, Ranjitha PrasadArxiv: https://arxiv.org/abs/2307.00680
TLDR: Explainable AI is an evolving area that deals with understanding the decision making of machine learning models so that these models are more transparent, accountable, and understandable for humans. In particular, post-hoc model-agnostic interpretable AI techniques explain the decisions of a black-box ML model for a single instance locally, without the knowledge of the intrinsic nature of the ML model. Despite their simplicity and capability in providing valuable insights, existing approaches fail to deliver consistent and reliable explanations.
Repo: None
Resistive memory-based zero-shot liquid state machine for multimodal event data learning
Authors: Ning Lin, Shaocong Wang, Yi Li, Bo Wang, Shuhui Shi, Yangu He, Woyu Zhang, Yifei Yu, Yue Zhang, Xiaojuan Qi, Xiaoming Chen, Hao Jiang, Xumeng Zhang, Peng Lin, Xiaoxin Xu, Qi Liu, Zhongrui Wang, Dashan Shang, Ming LiuArxiv: https://arxiv.org/abs/2307.00771
TLDR: The human brain is a complex spiking neural network (SNN) that learns multimodal signals in a zero-shot manner by generalizing existing knowledge. Remarkably, the brain achieves this with minimal power consumption, using event-based signals that propagate within its structure. However, mimicking the human brain in neuromorphic hardware presents both hardware and software challenges. Hardware limitations, such as the slowdown of Moore's law and the von Neumann bottleneck, hinder the efficiency of digital
Repo: None
UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding
Authors: Rui Sun, Zhecan Wang, Haoxuan You, Noel Codella, Kai-Wei Chang, Shih-Fu ChangArxiv: https://arxiv.org/abs/2307.00862
TLDR: Vision-language tasks, such as VQA, SNLI-VE, and VCR are challenging because they require the model's reasoning ability to understand the semantics of the visual world and natural language. Supervised methods working for vision-language skills have been well-studied. However, solving these tasks in a zero-shot setting is less explored. Since Contrastive Language-Image Pre-training (CLIP) has shown remarkable zero-stroke performance on image-text matching
Repo: None
Autism Spectrum Disorder Classification in Children based on Structural MRI Features Extracted using Contrastive Variational Autoencoder
Authors: Ruimin Ma, Ruitao Xie, Yanlin Wang, Jintao Meng, Yanjie Wei, Wenhui Xi, Yi PanArxiv: https://arxiv.org/abs/2307.00976
TLDR: Autism spectrum disorder (ASD) is a highly disabling mental disease that brings significant impairments of social interaction ability to the patients, making early screening and intervention of ASD critical. With the development of the machine learning and neuroimaging technology, extensive research has been conducted on machine classification of ASD based on structural MRI (s-MRI). However, most studies involve with datasets where participants' age are above 5-year-old, but, with mediocre predictive accuracy. In this paper
Repo: None
ENGAGE: Explanation Guided Data Augmentation for Graph Representation Learning
Authors: Yucheng Shi, Kaixiong Zhou, Ninghao LiuArxiv: https://arxiv.org/abs/2307.01053
TLDR: The recent contrastive learning methods, due to their effectiveness in representation learning, have been widely applied to modeling graph data. Random perturbation is widely used to build contrastive views for graph data, which however, could accidentally break graph structures and lead to suboptimal performance. In addition, graph data is usually highly abstract, so it is hard to extract intuitive meanings and design more informed augmentation schemes. Effective representations should preserve key characteristics in data and abandon superfluous information. In
Repo: None
Investigating Data Memorization in 3D Latent Diffusion Models for Medical Image Synthesis
Authors: Salman Ul Hassan Dar, Arman Ghanaat, Jannik Kahmann, Isabelle Ayx, Theano Papavassiliou, Stefan O. Schoenberg, Sandy EngelhardtArxiv: https://arxiv.org/abs/2307.01148
TLDR: Generative latent diffusion models have been established as state-of-the-art in data generation. One promising application is generation of realistic synthetic medical imaging data for open data sharing without compromising patient privacy. Despite the promise, the capacity of such models to memorize sensitive patient training data and synthesize samples showing high resemblance to training data samples is relatively unexplored. Here, we assess the memorization capacity of 3D latent diffusion model on photon-counting coronary computed tomography angi
Repo: None
Don't freeze: Finetune encoders for better Self-Supervised HAR
Authors: Vitor Fortes Rey, Dominique Nshimyimana, Paul LukowiczArxiv: https://arxiv.org/abs/2307.01168
TLDR: Recently self-supervised learning has been proposed in the field of human activity recognition as a solution to the labelled data availability problem. The idea being that by using pretext tasks such as reconstruction or contrastive predictive coding, useful representations can be learned that then can be used for classification. Those approaches follow the pretrain, freeze and fine-tune procedure. In this paper we will show how a simple change - not freezing the representation - leads to substantial performance gains across pretext tasks. The
Repo: None
Keyword: data augmentation
Investigating Masking-based Data Generation in Language Models
Authors: Ed S. MaArxiv: https://arxiv.org/abs/2307.00008
TLDR: The current era of natural language processing (NLP) has been defined by the prominence of pre-trained language models since the advent of BERT. A feature of Bert and models with similar architecture is the objective of masked language modeling, in which part of the input is intentionally masked and the model is trained to predict this piece of masked information. Data augmentation (DAG) is a data-driven technique widely used in machine learning, including research areas like computer vision and natural
Repo: None
Counterfactual Collaborative Reasoning
Authors: Jianchao Ji, Zelong Li, Shuyuan Xu, Max Xiong, Juntao Tan, Yingqiang Ge, Hao Wang, Yongfeng ZhangArxiv: https://arxiv.org/abs/2307.00165
TLDR: Causal reasoning and logical reasoning are two important types of reasoning abilities for human intelligence. However, their relationship has not been extensively explored under machine intelligence context. In this paper, we explore how the two reasoning abilities can be jointly modeled to enhance both accuracy and explainability of machine learning models. More specifically, by integrating two important Types of reasoning ability -- counterfactual reasoning and (neural) logical reasoning -- we propose Counterfactual Collaborative Reasoning (CCR), which conducts
Repo: None
Unsupervised Coordinate-Based Video Denoising
Authors: Mary Damilola Aiyetigbo, Dineshchandar Ravichandran, Reda Chalhoub, Peter Kalivas, Nianyi LiArxiv: https://arxiv.org/abs/2307.00179
TLDR: In this paper, we introduce a novel unsupervised video denoising deep learning approach that can help to mitigate data scarcity issues and shows robustness against different noise patterns, enhancing its broad applicability. Our method comprises three modules: a Feature generator creating features maps, a Denoise-Net generating denoised but slightly blurry reference frames, and a Refine-Net re-introducing high-frequency details. By leveraging the coordinate-based network, we can greatly simplify the
Repo: None
Long-Tailed Continual Learning For Visual Food Recognition
Authors: Jiangpeng He, Luotao Lin, Jack Ma, Heather A. Eicher-Miller, Fengqing ZhuArxiv: https://arxiv.org/abs/2307.00183
TLDR: Deep learning based food recognition has achieved remarkable progress in predicting food types given an eating occasion image. However, there are two major obstacles that hinder deployment in real world scenario. First, as new foods appear sequentially overtime, a trained model needs to learn the new classes continuously without causing catastrophic forgetting for already learned knowledge of existing food types. Second, the distribution of food images in real life is usually long-tailed as a small number of popular food types are consumed more frequently than others,
Repo: None
SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency
Authors: Yan Wang, Yuhang Li, Ruihao Gong, Aishan Liu, Yanfei Wang, Jian Hu, Yongqiang Yao, Yunchen Zhang, Tianzi Xiao, Fengwei Yu, Xianglong LiuArxiv: https://arxiv.org/abs/2307.00280
TLDR: Extensive studies have shown that deep learning models are vulnerable to adversarial and natural noises, yet little is known about model robustness on noises caused by different system implementations. In this paper, we for the first time introduce SysNoise, a frequently occurred but often overlooked noise in the deep learning training-deployment cycle. In particular, Sys noise happens when the source training system switches to a disparate target system in deployments, where various tiny system mismatch adds up to
Repo: None
CNN-BiLSTM model for English Handwriting Recognition: Comprehensive Evaluation on the IAM Dataset
Authors: Firat Kizilirmak, Berrin YanikogluArxiv: https://arxiv.org/abs/2307.00664
TLDR: We present a CNN-BiLSTM system for the problem of offline English handwriting recognition, with extensive evaluations on the public IAM dataset, including the effects of model size, data augmentation and the lexicon. Our best model achieves 3.59\% CER and 9.44\% WER using CNN-biLST M network with CTC layer. Test time augmentation with rotation and shear transformations applied to the input image, is proposed to increase recognition
Repo: None
ENGAGE: Explanation Guided Data Augmentation for Graph Representation Learning
Authors: Yucheng Shi, Kaixiong Zhou, Ninghao LiuArxiv: https://arxiv.org/abs/2307.01053
TLDR: The recent contrastive learning methods, due to their effectiveness in representation learning, have been widely applied to modeling graph data. Random perturbation is widely used to build contrastive views for graph data, which however, could accidentally break graph structures and lead to suboptimal performance. In addition, graph data is usually highly abstract, so it is hard to extract intuitive meanings and design more informed augmentation schemes. Effective representations should preserve key characteristics in data and abandon superfluous information. In
Repo: None
Keyword: knowledge graph
Hierarchical Pretraining for Biomedical Term Embeddings
Authors: Bryan Cai, Sihang Zeng, Yucong Lin, Zheng Yuan, Doudou Zhou, Lu TianArxiv: https://arxiv.org/abs/2307.00266
TLDR: Electronic health records (EHR) contain narrative notes that provide extensive details on the medical condition and management of patients. Natural language processing (NLP) of clinical notes can use observed frequencies of clinical terms as predictive features for downstream applications such as clinical decision making and patient trajectory prediction. However, due to the vast number of highly similar and related clinical concepts, a more effective modeling strategy is to represent clinical terms with semantic embeddings via representation learning and use the low dimensional embedd
Repo: None
PersonaGen: A Tool for Generating Personas from User Feedback
Authors: Xishuo Zhang, Lin Liu, Yi Wang, Xiao Liu, Hailong Wang, Anqi Ren, Chetan AroraArxiv: https://arxiv.org/abs/2307.00390
TLDR: Personas are crucial in software development processes, particularly in agile settings. However, no effective tools are available for generating personas from user feedback in agile Software development processes. To fill this gap, we propose a novel tool that uses the GPT-4 model and knowledge graph to generate persona templates from well-processed user feedback, facilitating requirement analysis in agile software development. We developed a tool called PersonaGen. We evaluated PersonaGen using qualitative feedback from a small-scale user study
Repo: None
Semi-automated Thermal Envelope Model Setup for Adaptive Model Predictive Control with Event-triggered System Identification
Authors: Lu Wan, Xiaobing Dai, Torsten Welfonder, Ekaterina Petrova, Pieter PauwelsArxiv: https://arxiv.org/abs/2307.00638
TLDR: To reach carbon neutrality in the middle of this century, smart controls for building energy systems are urgently required. Model predictive control (MPC) demonstrates great potential in improving the performance of heating ventilation and air-conditioning (HVAC) systems, whereas its wide application in the building sector is impeded by the considerable manual efforts involved in setting up the control-oriented model. To facilitate the system identification (SI) of the building envelope as well as the configuration of the MPC
Repo: None
CollabKG: A Learnable Human-Machine-Cooperative Information Extraction Toolkit for (Event) Knowledge Graph Construction
Authors: Xiang Wei, Yufeng Chen, Ning Cheng, Xingyu Cui, Jinan Xu, Wenjuan HanArxiv: https://arxiv.org/abs/2307.00769
TLDR: In order to construct or extend entity-centric and event-centric knowledge graphs (KG and EKG), the information extraction (IE) annotation toolkit is essential. However, existing IE toolkits have several non-trivial problems, such as not supporting multi-tasks, not supporting automatic updates. In this work, we present CollabKG, a learnable human-machine-cooperative IE toolkit for KG and eKG construction.
Repo: None
Iterative Zero-Shot LLM Prompting for Knowledge Graph Construction
Authors: Salvatore Carta, Alessandro Giuliani, Leonardo Piano, Alessandro Sebastian Podda, Livio Pompianu, Sandro Gabriele TiddiaArxiv: https://arxiv.org/abs/2307.01128
TLDR: In the current digitalization era, capturing and effectively representing knowledge is crucial in most real-world scenarios. In this context, knowledge graphs represent a potent tool for retrieving and organizing a vast amount of information in a properly interconnected and interpretable structure. However, their generation is still challenging and often requires considerable human effort and domain expertise, hampering the scalability and flexibility across different application fields. This paper proposes an innovative knowledge graph generation approach that leverages the potential of the latest generative
Repo: None
Keyword: legal
From ChatGPT to ThreatGPT: Impact of Generative AI in Cybersecurity and Privacy
Authors: Maanak Gupta, CharanKumar Akiri, Kshitiz Aryal, Eli Parker, Lopamudra PraharajArxiv: https://arxiv.org/abs/2307.00691
TLDR: Undoubtedly, the evolution of Generative AI (GenAI) models has been the highlight of digital transformation in the year 2022. As the different GenAI models like ChatGPT and Google Bot continue to foster their complexity and capability, it's critical to understand its consequences from a cybersecurity perspective. Several instances recently have demonstrated the use of GenAI tools in both the defensive and offensive side of cybersecurity, and focusing on the social, ethical and privacy implications this technology possesses. This research paper
Repo: None
Keyword: mixup
Augmenting Deep Learning Adaptation for Wearable Sensor Data through Combined Temporal-Frequency Image Encoding
Authors: Yidong Zhu, Md Mahmudur Rahman, Mohammad Arif Ul AlamArxiv: https://arxiv.org/abs/2307.00883
TLDR: Deep learning advancements have revolutionized scalable classification in many domains including computer vision. However, when it comes to wearable-based classification and domain adaptation, existing computer vision-based deep learning architectures and pretrained models trained on thousands of labeled images for months fall short. This is primarily because wearable sensor data necessitates sensor-specific preprocessing, architectural modification, and extensive data collection. To overcome these challenges, researchers have proposed encoding of wearable temporal sensor data in images using recurrent plots. In this
Repo: None
Keyword: multi-task
CollabKG: A Learnable Human-Machine-Cooperative Information Extraction Toolkit for (Event) Knowledge Graph Construction
Authors: Xiang Wei, Yufeng Chen, Ning Cheng, Xingyu Cui, Jinan Xu, Wenjuan HanArxiv: https://arxiv.org/abs/2307.00769
TLDR: In order to construct or extend entity-centric and event-centric knowledge graphs (KG and EKG), the information extraction (IE) annotation toolkit is essential. However, existing IE toolkits have several non-trivial problems, such as not supporting multi-tasks, not supporting automatic updates. In this work, we present CollabKG, a learnable human-machine-cooperative IE toolkit for KG and eKG construction.
Repo: None
Towards Suicide Prevention from Bipolar Disorder with Temporal Symptom-Aware Multitask Learning
Authors: Daeun Lee, Sejung Son, Hyolim Jeon, Seungbae Kim, Jinyoung HanArxiv: https://arxiv.org/abs/2307.00995
TLDR: Bipolar disorder (BD) is closely associated with an increased risk of suicide. However, while the prior work has revealed valuable insight into understanding the behavior of BD patients on social media, little attention has been paid to developing a model that can predict the future suicidality of a BD patient. Therefore, this study proposes a multi-task learning model for predicting the past suicidsality of patients by jointly learning current symptoms. We build a novel BD dataset clinically validated by psychiatrists
Repo: https://github.com/leedaeuni/Temporal-Symptom-Aware-Multitask-Learning-KDD23
Visual Instruction Tuning with Polite Flamingo
Authors: Delong Chen, Jianfeng Liu, Wenliang Dai, Baoyuan WangArxiv: https://arxiv.org/abs/2307.01003
TLDR: Recent research has demonstrated that the multi-task fine-tuning of multi-modal Large Language Models (LLMs) using an assortment of annotated downstream vision-language datasets significantly enhances their performance. Yet, during this process, a side effect, which we termed as the "multi-modality alignment tax", surfaces. This side effect negatively impacts the model's ability to format responses appropriately -- for instance, its "politeness" -- due to the overly succinct and unform
Repo: None
Keyword: robustness
RObotic MAnipulation Network (ROMAN) -- Hybrid Hierarchical Learning for Solving Complex Sequential Tasks
Authors: Eleftherios Triantafyllidis, Fernando Acero, Zhaocheng Liu, Zhibin LiArxiv: https://arxiv.org/abs/2307.00125
TLDR: Solving long sequential tasks poses a significant challenge in embodied artificial intelligence. Enabling a robotic system to perform diverse sequential tasks with a broad range of manipulation skills is an active area of research. In this work, we present a Hybrid Hierarchical Learning framework, the Robotic Manipulation Network (ROMAN), to address the challenge of solving multiple complex tasks over long time horizons in robotic manipulation. ROMAN achieves task versatility and robust failure recovery by integrating behavioural cloning, imitation learning,
Repo: None
Unsupervised Coordinate-Based Video Denoising
Authors: Mary Damilola Aiyetigbo, Dineshchandar Ravichandran, Reda Chalhoub, Peter Kalivas, Nianyi LiArxiv: https://arxiv.org/abs/2307.00179
TLDR: In this paper, we introduce a novel unsupervised video denoising deep learning approach that can help to mitigate data scarcity issues and shows robustness against different noise patterns, enhancing its broad applicability. Our method comprises three modules: a Feature generator creating features maps, a Denoise-Net generating denoised but slightly blurry reference frames, and a Refine-Net re-introducing high-frequency details. By leveraging the coordinate-based network, we can greatly simplify the
Repo: None
More for Less: Compact Convolutional Transformers Enable Robust Medical Image Classification with Limited Data
Authors: Andrew Kean GaoArxiv: https://arxiv.org/abs/2307.00213
TLDR: Transformers are very powerful tools for a variety of tasks across domains, from text generation to image captioning. However, transformers require substantial amounts of training data, which is often a challenge in biomedical settings, where high quality labeled data can be challenging or expensive to obtain. This study investigates the efficacy of Compact Convolutional Transformers (CCT) for robust medical image classification with limited data, addressing a key issue faced by conventional Vision Transformers - their requirement for large datasets. A hybrid
Repo: None
SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency
Authors: Yan Wang, Yuhang Li, Ruihao Gong, Aishan Liu, Yanfei Wang, Jian Hu, Yongqiang Yao, Yunchen Zhang, Tianzi Xiao, Fengwei Yu, Xianglong LiuArxiv: https://arxiv.org/abs/2307.00280
TLDR: Extensive studies have shown that deep learning models are vulnerable to adversarial and natural noises, yet little is known about model robustness on noises caused by different system implementations. In this paper, we for the first time introduce SysNoise, a frequently occurred but often overlooked noise in the deep learning training-deployment cycle. In particular, Sys noise happens when the source training system switches to a disparate target system in deployments, where various tiny system mismatch adds up to
Repo: None
Launcher Attitude Control based on Incremental Nonlinear Dynamic Inversion: A Feasibility Study Towards Fast and Robust Design Approaches
Authors: Pedro Simplício, Paul Acquatella, Samir BennaniArxiv: https://arxiv.org/abs/2307.00372
TLDR: The so-called ``New Space era'' has seen a disruptive change in the business models and manufacturing technologies of launch vehicle companies. However, limited consideration has been given to the benefits that innovation in control theory can bring; not only in terms of increasing the limits of performance but also reducing mission preparation or ``missionisation'' efforts. Moreover, there is a gap between the current state-of-practice that still relies on linear controls and other modern control techniques that could bring relevant improvements in
Repo: None
CasTGAN: Cascaded Generative Adversarial Network for Realistic Tabular Data Synthesis
Authors: Abdallah Alshantti, Damiano Varagnolo, Adil Rasheed, Aria Rahmati, Frank WestadArxiv: https://arxiv.org/abs/2307.00384
TLDR: Generative adversarial networks (GANs) have drawn considerable attention in recent years for their proven capability in generating synthetic data which can be utilized for multiple purposes. While GANs have demonstrated tremendous successes in producing synthetic data samples that replicate the dynamics of the original datasets, the validity of the synthetic data and the underlying privacy concerns represent major challenges which are not sufficiently addressed. In this work, we design a cascaded tabular GAN framework (CasTGAN) for generating realistic tab
Repo: None
Brightness-Restricted Adversarial Attack Patch
Authors: Mingzhen ShaoArxiv: https://arxiv.org/abs/2307.00421
TLDR: Adversarial attack patches have gained increasing attention due to their practical applicability in physical-world scenarios. However, the bright colors used in attack patches represent a significant drawback, as they can be easily identified by human observers. Moreover, even though these attacks have been highly successful in deceiving target networks, which specific features of the attack patch contribute to its success are still unknown. Our paper introduces a brightness-restricted patch (BrPatch) that uses optical characteristics to effectively reduce conspicuousness
Repo: None
Query-Efficient Decision-based Black-Box Patch Attack
Authors: Zhaoyu Chen, Bo Li, Shuang Wu, Shouhong Ding, Wenqiang ZhangArxiv: https://arxiv.org/abs/2307.00477
TLDR: Deep neural networks (DNNs) have been showed to be highly vulnerable to imperceptible adversarial perturbations. As a complementary type of adversary, patch attacks that introduce perceptible perturbatives to the images have attracted the interest of researchers. Existing patch attacks rely on the architecture of the model or the probabilities of predictions and perform poorly in the decision-based setting, which can still construct a perturbation with the minimal information exposed -- the top-1 predicted label
Repo: None
POV-SLAM: Probabilistic Object-Aware Variational SLAM in Semi-Static Environments
Authors: Jingxing Qian, Veronica Chatrath, James Servos, Aaron Mavrinac, Wolfram Burgard, Steven L. Waslander, Angela P. SchoelligArxiv: https://arxiv.org/abs/2307.00488
TLDR: Simultaneous localization and mapping (SLAM) in slowly varying scenes is important for long-term robot task completion. Failing to detect scene changes may lead to inaccurate maps and, ultimately, lost robots. Classical SLAM algorithms assume static scenes, and recent works take dynamics into account, but require scene changes to be observed in consecutive frames. Semi-static scenes, wherein objects appear, disappear, or move slowly over time, are often overlooked, yet are critical for long theterm
Repo: None
Variational Autoencoding Molecular Graphs with Denoising Diffusion Probabilistic Model
Authors: Daiki Koge, Naoaki Ono, Shigehiko KanayaArxiv: https://arxiv.org/abs/2307.00623
TLDR: In data-driven drug discovery, designing molecular descriptors is a very important task. Deep generative models such as variational autoencoders (VAEs) offer a potential solution by designing descriptors as probabilistic latent vectors derived from molecular structures. These models can be trained on large datasets, which have only molecular structures, and applied to transfer learning. Nevertheless, the approximate posterior distribution of the latent vectors of the usual VAE assumes a simple multivariate Gaussian distribution with
Repo: None
SDC-HSDD-NDSA: Structure Detecting Cluster by Hierarchical Secondary Directed Differential with Normalized Density and Self-Adaption
Authors: Hao ShuArxiv: https://arxiv.org/abs/2307.00677
TLDR: Density-based clustering could be the most popular clustering algorithm since it can identify clusters of arbitrary shape as long as different (high-density) clusters are separated by low-density regions. However, the requirement of the separateness of clusters by low‐density regions is not trivial since a high-density region might have different structures which should be clustered into different groups. Such a situation demonstrates the main flaw of all previous density-based cluster clustering algorithms we have known--
Repo: None
Scratch Team of Single-Rotor Robots and Decentralized Cooperative Transportation with Robot Failure
Authors: Koshi Oishi, Yasushi Amano, Jimbo TomohikoArxiv: https://arxiv.org/abs/2307.00705
TLDR: Achieving cooperative transportation by teams of aerial robots has been attracting attention owing to its flexibility with respect to payloads and robustness against failures. In this paper, we propose a flexible decentralized controller for the number of robots and the shapes of payloads in a cooperative transport task using multiple single-rotor robots. Our controller is robust to mass and center of mass fluctuations and robot failures. Moreover, asymptotic stability against dynamics errors is guaranteed. Additionally, the controller supports heter
Repo: None
Feasibility of Universal Anomaly Detection without Knowing the Abnormality in Medical Images
Authors: Can Cui, Yaohong Wang, Shunxing Bao, Yucheng Tang, Ruining Deng, Lucas W. Remedios, Zuhayr Asad, Joseph T. Roland, Ken S. Lau, Qi Liu, Lori A. Coburn, Keith T. Wilson, Bennett A. Landman, Yuankai HuoArxiv: https://arxiv.org/abs/2307.00750
TLDR: Many anomaly detection approaches, especially deep learning methods, have been recently developed to identify abnormal image morphology by only employing normal images during training. Unfortunately, many prior anomaly detection methods were optimized for a specific "known" abnormality (e.g., brain tumor, bone fraction, cell types). Moreover, even though only the normal images were used in the training process, the abnormal images were often insufficiently employed during the validation process (e-g., epoch selection, hyper-parameter
Repo: None
ImDiffusion: Imputed Diffusion Models for Multivariate Time Series Anomaly Detection
Authors: Yuhang Chen, Chaoyun Zhang, Minghua Ma, Yudong Liu, Ruomeng Ding, Bowen Li, Shilin He, Saravan Rajmohan, Qingwei Lin, Dongmei ZhangArxiv: https://arxiv.org/abs/2307.00754
TLDR: Anomaly detection in multivariate time series data is of paramount importance for ensuring the efficient operation of large-scale systems across diverse domains. However, accurately detecting anomalies in such data poses significant challenges. Existing approaches, including forecasting and reconstruction-based methods, struggle to address these challenges effectively. To overcome these limitations, we propose a novel anomaly detection framework named ImDiffusion, which combines time series imputation and diffusion models to achieve accurate and robust anomaly detection. The imputation-based
Repo: None
Learning Noise-Resistant Image Representation by Aligning Clean and Noisy Domains
Authors: Yanhui Guo, Xiaolin Wu, Fangzhou LuoArxiv: https://arxiv.org/abs/2307.00761
TLDR: Recent supervised and unsupervised image representation learning algorithms have achieved quantum leaps. However, these techniques do not account for representation resilience against noise in their design paradigms. Consequently, these effective methods suffer failure when confronted with noise outside the training distribution, such as complicated real-world noise that is usually opaque to model training. To address this issue, dual domains are optimized to separately model a canonical space for noisy representations, namely the Noise-Robust (NR) domain, and a
Repo: None
Cryptography and Key Management Schemes for Wireless Sensor Networks
Authors: Jaydip SenArxiv: https://arxiv.org/abs/2307.00872
TLDR: Wireless sensor networks (WSNs) are made up of a large number of tiny sensors, which can sense, analyze, and communicate information about the outside world. These networks play a significant role in a broad range of fields, from crucial military surveillance applications to monitoring building security. Key management in WSNs is a critical task. While the security and integrity of messages communicated through these networks and the authenticity of the nodes are dependent on the robustness of the key management schemes,
Repo: None
Enhancing the Robustness of QMIX against State-adversarial Attacks
Authors: Weiran Guo, Guanjun Liu, Ziyuan Zhou, Ling Wang, Jiacun WangArxiv: https://arxiv.org/abs/2307.00907
TLDR: Deep reinforcement learning (DRL) performance is generally impacted by state-adversarial attacks, a perturbation applied to an agent's observation. Most recent research has concentrated on robust single-agent reinforcement learning or multi-agent learning (SARL) algorithms, and on the use of SARL algorithms to improve the robustness of the SARL algorithm. Still, there has yet to be much work on robust multi-Agent reinforcement learning. Using QMIX, one of
Repo: None
Towards Building Self-Aware Object Detectors via Reliable Uncertainty Quantification and Calibration
Authors: Kemal Oksuz, Tom Joy, Puneet K. DokaniaArxiv: https://arxiv.org/abs/2307.00934
TLDR: The current approach for testing the robustness of object detectors suffers from serious deficiencies such as improper methods of performing out-of-distribution detection and using calibration metrics which do not consider both localisation and classification quality. In this work, we address these issues, and introduce the Self-Aware Object Detection (SAOD) task, a unified testing framework which respects and adheres to the challenges that object detectors face in safety-critical environments such as autonomous driving. Specifically, the SA
Repo: None
SAM-DA: UAV Tracks Anything at Night with SAM-Powered Domain Adaptation
Authors: Liangliang Yao, Haobo Zuo, Guangze Zheng, Changhong Fu, Jia PanArxiv: https://arxiv.org/abs/2307.01024
TLDR: Domain adaptation (DA) has demonstrated significant promise for real-time nighttime unmanned aerial vehicle (UAV) tracking. However, the state-of-the-art (SOTA) DA still lacks the potential object with accurate pixel-level location and boundary to generate the high-quality target domain training sample. This key issue constrains the transfer of data from raw nighttime images to SAM-DA, i.e., DA-DA. Therefore, it is necessary to optimize the
Repo: None
Anonymous and Copy-Robust Delegations for Liquid Democracy
Authors: Markus Utke, Ulrike Schmidt-KraepelinArxiv: https://arxiv.org/abs/2307.01174
TLDR: Liquid democracy with ranked delegations is a novel voting scheme that unites the practicability of representative democracy with the idealistic appeal of direct democracy: Every voter decides between casting their vote on a question at hand or delegating their voting weight to some other, trusted agent. Delegations are transitive, and since voters may end up in a delegation cycle, they are encouraged to indicate not only a single delegate, but a set of potential delegates and a ranking among them. Based
Repo: None
Keyword: scholarly
Numerical Association Rule Mining: A Systematic Literature Review
Authors: Minakshi Kaushik, Rahul Sharma, Iztok Fister Jr., Dirk DraheimArxiv: https://arxiv.org/abs/2307.00662
TLDR: Numerical association rule mining is a widely used variant of the association rule hunting technique, and it has been extensively used in discovering patterns and relationships in numerical data. Initially, researchers and scientists integrated numerical attributes in association rule mines using various discretization approaches; however, over time, a plethora of alternative methods have emerged in this field. Unfortunately, the increase of alternative Methods has resulted into a significant knowledge gap in understanding diverse techniques employed in numerical association rule Mining -- this paper attempts to
Repo: None
Keyword: semantic similarity
Automatic Design of Semantic Similarity Ensembles Using Grammatical Evolution
Authors: Jorge Martinez-GilArxiv: https://arxiv.org/abs/2307.00925
TLDR: Semantic similarity measures are widely used in natural language processing to catalyze various computer-related tasks. However, no single semantic similarity measure is the most appropriate for all tasks, and researchers often use ensemble strategies to ensure performance. This research work proposes a method for automatically designing semantic similarity ensembles. In fact, our proposed method uses grammatical evolution, for the first time, to automatically select and aggregate measures from a pool of candidates to create an ensemble that maximizes correlation to human
Repo: None
Keyword: similarity measure
Applications of Binary Similarity and Distance Measures
Authors: Manoj Muniswamaiah, Tilak Agerwala, Charles C. TappertArxiv: https://arxiv.org/abs/2307.00411
TLDR: In the recent past, binary similarity measures have been applied in solving biometric identification problems, including fingerprint, handwritten character detection, and in iris image recognition. The application of the relevant measurements has also resulted in more accurate data analysis. This paper surveys the applicability of binary similarity and distance measures in various fields.
Repo: None
Automatic Design of Semantic Similarity Ensembles Using Grammatical Evolution
Authors: Jorge Martinez-GilArxiv: https://arxiv.org/abs/2307.00925
TLDR: Semantic similarity measures are widely used in natural language processing to catalyze various computer-related tasks. However, no single semantic similarity measure is the most appropriate for all tasks, and researchers often use ensemble strategies to ensure performance. This research work proposes a method for automatically designing semantic similarity ensembles. In fact, our proposed method uses grammatical evolution, for the first time, to automatically select and aggregate measures from a pool of candidates to create an ensemble that maximizes correlation to human
Repo: None
Keyword: summarization
Challenges in Domain-Specific Abstractive Summarization and How to Overcome them
Authors: Anum Afzal, Juraj Vladika, Daniel Braun, Florian MatthesArxiv: https://arxiv.org/abs/2307.00963
TLDR: Large Language Models work quite well with general-purpose data and many tasks in Natural Language Processing. However, they show several limitations when used for a task such as domain-specific abstractive text summarization. This paper identifies three of those limitations as research problems in the context of abstractive Text summarization: 1) Quadratic complexity of transformer-based models with respect to the input text length; 2) Model Hallucination, which is a model's ability to generate factually
Repo: None
Keyword: text generation
More for Less: Compact Convolutional Transformers Enable Robust Medical Image Classification with Limited Data
Authors: Andrew Kean GaoArxiv: https://arxiv.org/abs/2307.00213
TLDR: Transformers are very powerful tools for a variety of tasks across domains, from text generation to image captioning. However, transformers require substantial amounts of training data, which is often a challenge in biomedical settings, where high quality labeled data can be challenging or expensive to obtain. This study investigates the efficacy of Compact Convolutional Transformers (CCT) for robust medical image classification with limited data, addressing a key issue faced by conventional Vision Transformers - their requirement for large datasets. A hybrid
Repo: None
PatternGPT :A Pattern-Driven Framework for Large Language Model Text Generation
Authors: Le Xiao, Xin ShanArxiv: https://arxiv.org/abs/2307.00470
TLDR: Large language models(LLMs) have shown excellent text generation capabilities, but there is still much space for improvement in accuracy, sometimes with grammatical errors, semantic inaccuracies, and contextual incoherence, which seriously affect the reliability of the models. These problems may originate from the difficulties and limitations encountered in the pattern extraction stage of large language models. How to utilize the generative power of large languages models to generate as many possible patterns that help solve problems and find the optimal patterns from them
Repo: None
Metadata
Metadata
Assignees
Labels
abstract meaning representationargument miningcitation context analysiscomputational social sciencecontrastivecross-language information retrievalcross-lingual information retrievaldata augmentationextreme multi-labelknowledge discoveryknowledge graphlegallegal textmixupmulti-taskparaphrasepassage generationplagiarismrobustnessscholarlyscholarly document processingsemantic similaritysimilarity measuresimplificationsummarizationtext generation