New submissions for Tue,  4 Jul 23

<h2>Keyword: contrastive</h2><details><h3>Improving the Transferability of Time Series Forecasting with  Decomposition Adaptation</h3>                    <strong>Authors:</strong> Yan Gao, Yan Wang, Qiang Wang<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00066'>https://arxiv.org/abs/2307.00066</a><br>                    <strong>TLDR:</strong> Due to effective pattern mining and feature representation, neural forecasting models based on deep learning have achieved great progress. The premise of effective learning is to collect sufficient data. However, in time series forecasting, it is difficult to obtain enough data, which limits the performance of neural forecasting Models. To alleviate the data scarcity limitation, we design Sequence Decomposition Adaptation Network (SeDAN) which is a novel transfer architecture to improve forecasting performance on the target domain by aligning transferable knowledge<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Prompting classes: Exploring the Power of Prompt Class Learning in  Weakly Supervised Semantic Segmentation</h3>                    <strong>Authors:</strong> Balamurali Murugesan, Rukhshanda Hussain, Rajarshi Bhattacharya, Ismail Ben Ayed, Jose Dolz<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00097'>https://arxiv.org/abs/2307.00097</a><br>                    <strong>TLDR:</strong> Recently, CLIP-based approaches have exhibited remarkable performance on generalization and few-shot learning tasks, fueled by the power of contrastive language-vision pre-training. In particular, prompt tuning has emerged as an effective strategy to adapt the pre-trained language-visit models to downstream tasks by employing task-related textual tokens. Motivated by this progress, in this work we question whether other fundamental problems, such as weakly supervised semantic segmentation (WSSS),<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Hierarchical Pretraining for Biomedical Term Embeddings</h3>                    <strong>Authors:</strong> Bryan Cai, Sihang Zeng, Yucong Lin, Zheng Yuan, Doudou Zhou, Lu Tian<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00266'>https://arxiv.org/abs/2307.00266</a><br>                    <strong>TLDR:</strong> Electronic health records (EHR) contain narrative notes that provide extensive details on the medical condition and management of patients. Natural language processing (NLP) of clinical notes can use observed frequencies of clinical terms as predictive features for downstream applications such as clinical decision making and patient trajectory prediction. However, due to the vast number of highly similar and related clinical concepts, a more effective modeling strategy is to represent clinical terms with semantic embeddings via representation learning and use the low dimensional embedd<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>BioCPT: Contrastive Pre-trained Transformers with Large-scale PubMed  Search Logs for Zero-shot Biomedical Information Retrieval</h3>                    <strong>Authors:</strong> Qiao Jin, Won Kim, Qingyu Chen, Donald C. Comeau, Lana Yeganova, John Wilbur, Zhiyong Lu<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00589'>https://arxiv.org/abs/2307.00589</a><br>                    <strong>TLDR:</strong> Information retrieval (IR) is essential in biomedical knowledge acquisition and clinical decision support. While recent progress has shown that language model encoders perform better semantic retrieval, training such models requires abundant query-article annotations that are difficult to obtain in biomedicine. As a result, most biomedical IR systems only conduct lexical matching. In response, we introduce BioCPT, a first-of-its-kind Contrastively Pre-trained Transformer model for zero-shot biomedical IR<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>CLIMAX: An exploration of Classifier-Based Contrastive Explanations</h3>                    <strong>Authors:</strong> Praharsh Nanavati, Ranjitha Prasad<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00680'>https://arxiv.org/abs/2307.00680</a><br>                    <strong>TLDR:</strong> Explainable AI is an evolving area that deals with understanding the decision making of machine learning models so that these models are more transparent, accountable, and understandable for humans. In particular, post-hoc model-agnostic interpretable AI techniques explain the decisions of a black-box ML model for a single instance locally, without the knowledge of the intrinsic nature of the ML model. Despite their simplicity and capability in providing valuable insights, existing approaches fail to deliver consistent and reliable explanations.<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Resistive memory-based zero-shot liquid state machine for multimodal  event data learning</h3>                    <strong>Authors:</strong> Ning Lin, Shaocong Wang, Yi Li, Bo Wang, Shuhui Shi, Yangu He, Woyu Zhang, Yifei Yu, Yue Zhang, Xiaojuan Qi, Xiaoming Chen, Hao Jiang, Xumeng Zhang, Peng Lin, Xiaoxin Xu, Qi Liu, Zhongrui Wang, Dashan Shang, Ming Liu<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00771'>https://arxiv.org/abs/2307.00771</a><br>                    <strong>TLDR:</strong> The human brain is a complex spiking neural network (SNN) that learns multimodal signals in a zero-shot manner by generalizing existing knowledge. Remarkably, the brain achieves this with minimal power consumption, using event-based signals that propagate within its structure. However, mimicking the human brain in neuromorphic hardware presents both hardware and software challenges. Hardware limitations, such as the slowdown of Moore's law and the von Neumann bottleneck, hinder the efficiency of digital<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>UniFine: A Unified and Fine-grained Approach for Zero-shot  Vision-Language Understanding</h3>                    <strong>Authors:</strong> Rui Sun, Zhecan Wang, Haoxuan You, Noel Codella, Kai-Wei Chang, Shih-Fu Chang<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00862'>https://arxiv.org/abs/2307.00862</a><br>                    <strong>TLDR:</strong> Vision-language tasks, such as VQA, SNLI-VE, and VCR are challenging because they require the model's reasoning ability to understand the semantics of the visual world and natural language. Supervised methods working for vision-language skills have been well-studied. However, solving these tasks in a zero-shot setting is less explored. Since Contrastive Language-Image Pre-training (CLIP) has shown remarkable zero-stroke performance on image-text matching<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Autism Spectrum Disorder Classification in Children based on Structural  MRI Features Extracted using Contrastive Variational Autoencoder</h3>                    <strong>Authors:</strong> Ruimin Ma, Ruitao Xie, Yanlin Wang, Jintao Meng, Yanjie Wei, Wenhui Xi, Yi Pan<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00976'>https://arxiv.org/abs/2307.00976</a><br>                    <strong>TLDR:</strong> Autism spectrum disorder (ASD) is a highly disabling mental disease that brings significant impairments of social interaction ability to the patients, making early screening and intervention of ASD critical. With the development of the machine learning and neuroimaging technology, extensive research has been conducted on machine classification of ASD based on structural MRI (s-MRI). However, most studies involve with datasets where participants' age are above 5-year-old, but, with mediocre predictive accuracy. In this paper<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>ENGAGE: Explanation Guided Data Augmentation for Graph Representation  Learning</h3>                    <strong>Authors:</strong> Yucheng Shi, Kaixiong Zhou, Ninghao Liu<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.01053'>https://arxiv.org/abs/2307.01053</a><br>                    <strong>TLDR:</strong> The recent contrastive learning methods, due to their effectiveness in representation learning, have been widely applied to modeling graph data. Random perturbation is widely used to build contrastive views for graph data, which however, could accidentally break graph structures and lead to suboptimal performance. In addition, graph data is usually highly abstract, so it is hard to extract intuitive meanings and design more informed augmentation schemes. Effective representations should preserve key characteristics in data and abandon superfluous information. In<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Investigating Data Memorization in 3D Latent Diffusion Models for  Medical Image Synthesis</h3>                    <strong>Authors:</strong> Salman Ul Hassan Dar, Arman Ghanaat, Jannik Kahmann, Isabelle Ayx, Theano Papavassiliou, Stefan O. Schoenberg, Sandy Engelhardt<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.01148'>https://arxiv.org/abs/2307.01148</a><br>                    <strong>TLDR:</strong> Generative latent diffusion models have been established as state-of-the-art in data generation. One promising application is generation of realistic synthetic medical imaging data for open data sharing without compromising patient privacy. Despite the promise, the capacity of such models to memorize sensitive patient training data and synthesize samples showing high resemblance to training data samples is relatively unexplored. Here, we assess the memorization capacity of 3D latent diffusion model on photon-counting coronary computed tomography angi<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Don't freeze: Finetune encoders for better Self-Supervised HAR</h3>                    <strong>Authors:</strong> Vitor Fortes Rey, Dominique Nshimyimana, Paul Lukowicz<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.01168'>https://arxiv.org/abs/2307.01168</a><br>                    <strong>TLDR:</strong> Recently self-supervised learning has been proposed in the field of human activity recognition as a solution to the labelled data availability problem. The idea being that by using pretext tasks such as reconstruction or contrastive predictive coding, useful representations can be learned that then can be used for classification. Those approaches follow the pretrain, freeze and fine-tune procedure. In this paper we will show how a simple change - not freezing the representation - leads to substantial performance gains across pretext tasks. The<br>                    <strong>Repo:  <a href='None'>None</a><strong><br></details><h2>Keyword: data augmentation</h2><details><h3>Investigating Masking-based Data Generation in Language Models</h3>                    <strong>Authors:</strong> Ed S. Ma<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00008'>https://arxiv.org/abs/2307.00008</a><br>                    <strong>TLDR:</strong> The current era of natural language processing (NLP) has been defined by the prominence of pre-trained language models since the advent of BERT. A feature of Bert and models with similar architecture is the objective of masked language modeling, in which part of the input is intentionally masked and the model is trained to predict this piece of masked information. Data augmentation (DAG) is a data-driven technique widely used in machine learning, including research areas like computer vision and natural<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Counterfactual Collaborative Reasoning</h3>                    <strong>Authors:</strong> Jianchao Ji, Zelong Li, Shuyuan Xu, Max Xiong, Juntao Tan, Yingqiang Ge, Hao Wang, Yongfeng Zhang<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00165'>https://arxiv.org/abs/2307.00165</a><br>                    <strong>TLDR:</strong> Causal reasoning and logical reasoning are two important types of reasoning abilities for human intelligence. However, their relationship has not been extensively explored under machine intelligence context. In this paper, we explore how the two reasoning abilities can be jointly modeled to enhance both accuracy and explainability of machine learning models. More specifically, by integrating two important Types of reasoning ability -- counterfactual reasoning and (neural) logical reasoning -- we propose Counterfactual Collaborative Reasoning (CCR), which conducts<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Unsupervised Coordinate-Based Video Denoising</h3>                    <strong>Authors:</strong> Mary Damilola Aiyetigbo, Dineshchandar Ravichandran, Reda Chalhoub, Peter Kalivas, Nianyi Li<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00179'>https://arxiv.org/abs/2307.00179</a><br>                    <strong>TLDR:</strong> In this paper, we introduce a novel unsupervised video denoising deep learning approach that can help to mitigate data scarcity issues and shows robustness against different noise patterns, enhancing its broad applicability. Our method comprises three modules: a Feature generator creating features maps, a Denoise-Net generating denoised but slightly blurry reference frames, and a Refine-Net re-introducing high-frequency details. By leveraging the coordinate-based network, we can greatly simplify the<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Long-Tailed Continual Learning For Visual Food Recognition</h3>                    <strong>Authors:</strong> Jiangpeng He, Luotao Lin, Jack Ma, Heather A. Eicher-Miller, Fengqing Zhu<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00183'>https://arxiv.org/abs/2307.00183</a><br>                    <strong>TLDR:</strong> Deep learning based food recognition has achieved remarkable progress in predicting food types given an eating occasion image. However, there are two major obstacles that hinder deployment in real world scenario. First, as new foods appear sequentially overtime, a trained model needs to learn the new classes continuously without causing catastrophic forgetting for already learned knowledge of existing food types. Second, the distribution of food images in real life is usually long-tailed as a small number of popular food types are consumed more frequently than others,<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>SysNoise: Exploring and Benchmarking Training-Deployment System  Inconsistency</h3>                    <strong>Authors:</strong> Yan Wang, Yuhang Li, Ruihao Gong, Aishan Liu, Yanfei Wang, Jian Hu, Yongqiang Yao, Yunchen Zhang, Tianzi Xiao, Fengwei Yu, Xianglong Liu<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00280'>https://arxiv.org/abs/2307.00280</a><br>                    <strong>TLDR:</strong> Extensive studies have shown that deep learning models are vulnerable to adversarial and natural noises, yet little is known about model robustness on noises caused by different system implementations. In this paper, we for the first time introduce SysNoise, a frequently occurred but often overlooked noise in the deep learning training-deployment cycle. In particular, Sys noise happens when the source training system switches to a disparate target system in deployments, where various tiny system mismatch adds up to<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>CNN-BiLSTM model for English Handwriting Recognition: Comprehensive  Evaluation on the IAM Dataset</h3>                    <strong>Authors:</strong> Firat Kizilirmak, Berrin Yanikoglu<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00664'>https://arxiv.org/abs/2307.00664</a><br>                    <strong>TLDR:</strong> We present a CNN-BiLSTM system for the problem of offline English handwriting recognition, with extensive evaluations on the public IAM dataset, including the effects of model size, data augmentation and the lexicon. Our best model achieves 3.59\% CER and 9.44\% WER using CNN-biLST M network with CTC layer. Test time augmentation with rotation and shear transformations applied to the input image, is proposed to increase recognition<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>ENGAGE: Explanation Guided Data Augmentation for Graph Representation  Learning</h3>                    <strong>Authors:</strong> Yucheng Shi, Kaixiong Zhou, Ninghao Liu<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.01053'>https://arxiv.org/abs/2307.01053</a><br>                    <strong>TLDR:</strong> The recent contrastive learning methods, due to their effectiveness in representation learning, have been widely applied to modeling graph data. Random perturbation is widely used to build contrastive views for graph data, which however, could accidentally break graph structures and lead to suboptimal performance. In addition, graph data is usually highly abstract, so it is hard to extract intuitive meanings and design more informed augmentation schemes. Effective representations should preserve key characteristics in data and abandon superfluous information. In<br>                    <strong>Repo:  <a href='None'>None</a><strong><br></details><h2>Keyword: knowledge graph</h2><details><h3>Hierarchical Pretraining for Biomedical Term Embeddings</h3>                    <strong>Authors:</strong> Bryan Cai, Sihang Zeng, Yucong Lin, Zheng Yuan, Doudou Zhou, Lu Tian<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00266'>https://arxiv.org/abs/2307.00266</a><br>                    <strong>TLDR:</strong> Electronic health records (EHR) contain narrative notes that provide extensive details on the medical condition and management of patients. Natural language processing (NLP) of clinical notes can use observed frequencies of clinical terms as predictive features for downstream applications such as clinical decision making and patient trajectory prediction. However, due to the vast number of highly similar and related clinical concepts, a more effective modeling strategy is to represent clinical terms with semantic embeddings via representation learning and use the low dimensional embedd<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>PersonaGen: A Tool for Generating Personas from User Feedback</h3>                    <strong>Authors:</strong> Xishuo Zhang, Lin Liu, Yi Wang, Xiao Liu, Hailong Wang, Anqi Ren, Chetan Arora<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00390'>https://arxiv.org/abs/2307.00390</a><br>                    <strong>TLDR:</strong> Personas are crucial in software development processes, particularly in agile settings. However, no effective tools are available for generating personas from user feedback in agile Software development processes. To fill this gap, we propose a novel tool that uses the GPT-4 model and knowledge graph to generate persona templates from well-processed user feedback, facilitating requirement analysis in agile software development. We developed a tool called PersonaGen. We evaluated PersonaGen using qualitative feedback from a small-scale user study<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Semi-automated Thermal Envelope Model Setup for Adaptive Model  Predictive Control with Event-triggered System Identification</h3>                    <strong>Authors:</strong> Lu Wan, Xiaobing Dai, Torsten Welfonder, Ekaterina Petrova, Pieter Pauwels<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00638'>https://arxiv.org/abs/2307.00638</a><br>                    <strong>TLDR:</strong> To reach carbon neutrality in the middle of this century, smart controls for building energy systems are urgently required. Model predictive control (MPC) demonstrates great potential in improving the performance of heating ventilation and air-conditioning (HVAC) systems, whereas its wide application in the building sector is impeded by the considerable manual efforts involved in setting up the control-oriented model. To facilitate the system identification (SI) of the building envelope as well as the configuration of the MPC<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>CollabKG: A Learnable Human-Machine-Cooperative Information Extraction  Toolkit for (Event) Knowledge Graph Construction</h3>                    <strong>Authors:</strong> Xiang Wei, Yufeng Chen, Ning Cheng, Xingyu Cui, Jinan Xu, Wenjuan Han<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00769'>https://arxiv.org/abs/2307.00769</a><br>                    <strong>TLDR:</strong> In order to construct or extend entity-centric and event-centric knowledge graphs (KG and EKG), the information extraction (IE) annotation toolkit is essential. However, existing IE toolkits have several non-trivial problems, such as not supporting multi-tasks, not supporting automatic updates. In this work, we present CollabKG, a learnable human-machine-cooperative IE toolkit for KG and eKG construction.<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Iterative Zero-Shot LLM Prompting for Knowledge Graph Construction</h3>                    <strong>Authors:</strong> Salvatore Carta, Alessandro Giuliani, Leonardo Piano, Alessandro Sebastian Podda, Livio Pompianu, Sandro Gabriele Tiddia<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.01128'>https://arxiv.org/abs/2307.01128</a><br>                    <strong>TLDR:</strong> In the current digitalization era, capturing and effectively representing knowledge is crucial in most real-world scenarios. In this context, knowledge graphs represent a potent tool for retrieving and organizing a vast amount of information in a properly interconnected and interpretable structure. However, their generation is still challenging and often requires considerable human effort and domain expertise, hampering the scalability and flexibility across different application fields. This paper proposes an innovative knowledge graph generation approach that leverages the potential of the latest generative<br>                    <strong>Repo:  <a href='None'>None</a><strong><br></details><h2>Keyword: legal</h2><details><h3>From ChatGPT to ThreatGPT: Impact of Generative AI in Cybersecurity and  Privacy</h3>                    <strong>Authors:</strong> Maanak Gupta, CharanKumar Akiri, Kshitiz Aryal, Eli Parker, Lopamudra Praharaj<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00691'>https://arxiv.org/abs/2307.00691</a><br>                    <strong>TLDR:</strong> Undoubtedly, the evolution of Generative AI (GenAI) models has been the highlight of digital transformation in the year 2022. As the different GenAI models like ChatGPT and Google Bot continue to foster their complexity and capability, it's critical to understand its consequences from a cybersecurity perspective. Several instances recently have demonstrated the use of GenAI tools in both the defensive and offensive side of cybersecurity, and focusing on the social, ethical and privacy implications this technology possesses. This research paper<br>                    <strong>Repo:  <a href='None'>None</a><strong><br></details><h2>Keyword: mixup</h2><details><h3>Augmenting Deep Learning Adaptation for Wearable Sensor Data through  Combined Temporal-Frequency Image Encoding</h3>                    <strong>Authors:</strong> Yidong Zhu, Md Mahmudur Rahman, Mohammad Arif Ul Alam<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00883'>https://arxiv.org/abs/2307.00883</a><br>                    <strong>TLDR:</strong> Deep learning advancements have revolutionized scalable classification in many domains including computer vision. However, when it comes to wearable-based classification and domain adaptation, existing computer vision-based deep learning architectures and pretrained models trained on thousands of labeled images for months fall short. This is primarily because wearable sensor data necessitates sensor-specific preprocessing, architectural modification, and extensive data collection. To overcome these challenges, researchers have proposed encoding of wearable temporal sensor data in images using recurrent plots. In this<br>                    <strong>Repo:  <a href='None'>None</a><strong><br></details><h2>Keyword: multi-task</h2><details><h3>CollabKG: A Learnable Human-Machine-Cooperative Information Extraction  Toolkit for (Event) Knowledge Graph Construction</h3>                    <strong>Authors:</strong> Xiang Wei, Yufeng Chen, Ning Cheng, Xingyu Cui, Jinan Xu, Wenjuan Han<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00769'>https://arxiv.org/abs/2307.00769</a><br>                    <strong>TLDR:</strong> In order to construct or extend entity-centric and event-centric knowledge graphs (KG and EKG), the information extraction (IE) annotation toolkit is essential. However, existing IE toolkits have several non-trivial problems, such as not supporting multi-tasks, not supporting automatic updates. In this work, we present CollabKG, a learnable human-machine-cooperative IE toolkit for KG and eKG construction.<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Towards Suicide Prevention from Bipolar Disorder with Temporal  Symptom-Aware Multitask Learning</h3>                    <strong>Authors:</strong> Daeun Lee, Sejung Son, Hyolim Jeon, Seungbae Kim, Jinyoung Han<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00995'>https://arxiv.org/abs/2307.00995</a><br>                    <strong>TLDR:</strong> Bipolar disorder (BD) is closely associated with an increased risk of suicide. However, while the prior work has revealed valuable insight into understanding the behavior of BD patients on social media, little attention has been paid to developing a model that can predict the future suicidality of a BD patient. Therefore, this study proposes a multi-task learning model for predicting the past suicidsality of patients by jointly learning current symptoms. We build a novel BD dataset clinically validated by psychiatrists<br>                    <strong>Repo:  <a href='https://github.com/leedaeuni/Temporal-Symptom-Aware-Multitask-Learning-KDD23'>https://github.com/leedaeuni/Temporal-Symptom-Aware-Multitask-Learning-KDD23</a><strong><br><h3>Visual Instruction Tuning with Polite Flamingo</h3>                    <strong>Authors:</strong> Delong Chen, Jianfeng Liu, Wenliang Dai, Baoyuan Wang<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.01003'>https://arxiv.org/abs/2307.01003</a><br>                    <strong>TLDR:</strong> Recent research has demonstrated that the multi-task fine-tuning of multi-modal Large Language Models (LLMs) using an assortment of annotated downstream vision-language datasets significantly enhances their performance. Yet, during this process, a side effect, which we termed as the "multi-modality alignment tax", surfaces. This side effect negatively impacts the model's ability to format responses appropriately -- for instance, its "politeness" -- due to the overly succinct and unform<br>                    <strong>Repo:  <a href='None'>None</a><strong><br></details><h2>Keyword: robustness</h2><details><h3>RObotic MAnipulation Network (ROMAN) -- Hybrid Hierarchical Learning for  Solving Complex Sequential Tasks</h3>                    <strong>Authors:</strong> Eleftherios Triantafyllidis, Fernando Acero, Zhaocheng Liu, Zhibin Li<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00125'>https://arxiv.org/abs/2307.00125</a><br>                    <strong>TLDR:</strong> Solving long sequential tasks poses a significant challenge in embodied artificial intelligence. Enabling a robotic system to perform diverse sequential tasks with a broad range of manipulation skills is an active area of research. In this work, we present a Hybrid Hierarchical Learning framework, the Robotic Manipulation Network (ROMAN), to address the challenge of solving multiple complex tasks over long time horizons in robotic manipulation. ROMAN achieves task versatility and robust failure recovery by integrating behavioural cloning, imitation learning,<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Unsupervised Coordinate-Based Video Denoising</h3>                    <strong>Authors:</strong> Mary Damilola Aiyetigbo, Dineshchandar Ravichandran, Reda Chalhoub, Peter Kalivas, Nianyi Li<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00179'>https://arxiv.org/abs/2307.00179</a><br>                    <strong>TLDR:</strong> In this paper, we introduce a novel unsupervised video denoising deep learning approach that can help to mitigate data scarcity issues and shows robustness against different noise patterns, enhancing its broad applicability. Our method comprises three modules: a Feature generator creating features maps, a Denoise-Net generating denoised but slightly blurry reference frames, and a Refine-Net re-introducing high-frequency details. By leveraging the coordinate-based network, we can greatly simplify the<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>More for Less: Compact Convolutional Transformers Enable Robust Medical  Image Classification with Limited Data</h3>                    <strong>Authors:</strong> Andrew Kean Gao<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00213'>https://arxiv.org/abs/2307.00213</a><br>                    <strong>TLDR:</strong> Transformers are very powerful tools for a variety of tasks across domains, from text generation to image captioning. However, transformers require substantial amounts of training data, which is often a challenge in biomedical settings, where high quality labeled data can be challenging or expensive to obtain. This study investigates the efficacy of Compact Convolutional Transformers (CCT) for robust medical image classification with limited data, addressing a key issue faced by conventional Vision Transformers - their requirement for large datasets. A hybrid<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>SysNoise: Exploring and Benchmarking Training-Deployment System  Inconsistency</h3>                    <strong>Authors:</strong> Yan Wang, Yuhang Li, Ruihao Gong, Aishan Liu, Yanfei Wang, Jian Hu, Yongqiang Yao, Yunchen Zhang, Tianzi Xiao, Fengwei Yu, Xianglong Liu<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00280'>https://arxiv.org/abs/2307.00280</a><br>                    <strong>TLDR:</strong> Extensive studies have shown that deep learning models are vulnerable to adversarial and natural noises, yet little is known about model robustness on noises caused by different system implementations. In this paper, we for the first time introduce SysNoise, a frequently occurred but often overlooked noise in the deep learning training-deployment cycle. In particular, Sys noise happens when the source training system switches to a disparate target system in deployments, where various tiny system mismatch adds up to<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Launcher Attitude Control based on Incremental Nonlinear Dynamic  Inversion: A Feasibility Study Towards Fast and Robust Design Approaches</h3>                    <strong>Authors:</strong> Pedro Simplício, Paul Acquatella, Samir Bennani<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00372'>https://arxiv.org/abs/2307.00372</a><br>                    <strong>TLDR:</strong> The so-called ``New Space era'' has seen a disruptive change in the business models and manufacturing technologies of launch vehicle companies. However, limited consideration has been given to the benefits that innovation in control theory can bring; not only in terms of increasing the limits of performance but also reducing mission preparation or ``missionisation'' efforts. Moreover, there is a gap between the current state-of-practice that still relies on linear controls and other modern control techniques that could bring relevant improvements in<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>CasTGAN: Cascaded Generative Adversarial Network for Realistic Tabular  Data Synthesis</h3>                    <strong>Authors:</strong> Abdallah Alshantti, Damiano Varagnolo, Adil Rasheed, Aria Rahmati, Frank Westad<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00384'>https://arxiv.org/abs/2307.00384</a><br>                    <strong>TLDR:</strong> Generative adversarial networks (GANs) have drawn considerable attention in recent years for their proven capability in generating synthetic data which can be utilized for multiple purposes. While GANs have demonstrated tremendous successes in producing synthetic data samples that replicate the dynamics of the original datasets, the validity of the synthetic data and the underlying privacy concerns represent major challenges which are not sufficiently addressed. In this work, we design a cascaded tabular GAN framework (CasTGAN) for generating realistic tab<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Brightness-Restricted Adversarial Attack Patch</h3>                    <strong>Authors:</strong> Mingzhen Shao<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00421'>https://arxiv.org/abs/2307.00421</a><br>                    <strong>TLDR:</strong> Adversarial attack patches have gained increasing attention due to their practical applicability in physical-world scenarios. However, the bright colors used in attack patches represent a significant drawback, as they can be easily identified by human observers. Moreover, even though these attacks have been highly successful in deceiving target networks, which specific features of the attack patch contribute to its success are still unknown. Our paper introduces a brightness-restricted patch (BrPatch) that uses optical characteristics to effectively reduce conspicuousness<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Query-Efficient Decision-based Black-Box Patch Attack</h3>                    <strong>Authors:</strong> Zhaoyu Chen, Bo Li, Shuang Wu, Shouhong Ding, Wenqiang Zhang<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00477'>https://arxiv.org/abs/2307.00477</a><br>                    <strong>TLDR:</strong> Deep neural networks (DNNs) have been showed to be highly vulnerable to imperceptible adversarial perturbations. As a complementary type of adversary, patch attacks that introduce perceptible perturbatives to the images have attracted the interest of researchers. Existing patch attacks rely on the architecture of the model or the probabilities of predictions and perform poorly in the decision-based setting, which can still construct a perturbation with the minimal information exposed -- the top-1 predicted label<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>POV-SLAM: Probabilistic Object-Aware Variational SLAM in Semi-Static  Environments</h3>                    <strong>Authors:</strong> Jingxing Qian, Veronica Chatrath, James Servos, Aaron Mavrinac, Wolfram Burgard, Steven L. Waslander, Angela P. Schoellig<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00488'>https://arxiv.org/abs/2307.00488</a><br>                    <strong>TLDR:</strong> Simultaneous localization and mapping (SLAM) in slowly varying scenes is important for long-term robot task completion. Failing to detect scene changes may lead to inaccurate maps and, ultimately, lost robots. Classical SLAM algorithms assume static scenes, and recent works take dynamics into account, but require scene changes to be observed in consecutive frames. Semi-static scenes, wherein objects appear, disappear, or move slowly over time, are often overlooked, yet are critical for long theterm<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Variational Autoencoding Molecular Graphs with Denoising Diffusion  Probabilistic Model</h3>                    <strong>Authors:</strong> Daiki Koge, Naoaki Ono, Shigehiko Kanaya<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00623'>https://arxiv.org/abs/2307.00623</a><br>                    <strong>TLDR:</strong> In data-driven drug discovery, designing molecular descriptors is a very important task. Deep generative models such as variational autoencoders (VAEs) offer a potential solution by designing descriptors as probabilistic latent vectors derived from molecular structures. These models can be trained on large datasets, which have only molecular structures, and applied to transfer learning. Nevertheless, the approximate posterior distribution of the latent vectors of the usual VAE assumes a simple multivariate Gaussian distribution with<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>SDC-HSDD-NDSA: Structure Detecting Cluster by Hierarchical Secondary  Directed Differential with Normalized Density and Self-Adaption</h3>                    <strong>Authors:</strong> Hao Shu<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00677'>https://arxiv.org/abs/2307.00677</a><br>                    <strong>TLDR:</strong> Density-based clustering could be the most popular clustering algorithm since it can identify clusters of arbitrary shape as long as different (high-density) clusters are separated by low-density regions. However, the requirement of the separateness of clusters by low‐density regions is not trivial since a high-density region might have different structures which should be clustered into different groups. Such a situation demonstrates the main flaw of all previous density-based cluster clustering algorithms we have known--<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Scratch Team of Single-Rotor Robots and Decentralized Cooperative  Transportation with Robot Failure</h3>                    <strong>Authors:</strong> Koshi Oishi, Yasushi Amano, Jimbo Tomohiko<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00705'>https://arxiv.org/abs/2307.00705</a><br>                    <strong>TLDR:</strong> Achieving cooperative transportation by teams of aerial robots has been attracting attention owing to its flexibility with respect to payloads and robustness against failures. In this paper, we propose a flexible decentralized controller for the number of robots and the shapes of payloads in a cooperative transport task using multiple single-rotor robots. Our controller is robust to mass and center of mass fluctuations and robot failures. Moreover, asymptotic stability against dynamics errors is guaranteed. Additionally, the controller supports heter<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Feasibility of Universal Anomaly Detection without Knowing the  Abnormality in Medical Images</h3>                    <strong>Authors:</strong> Can Cui, Yaohong Wang, Shunxing Bao, Yucheng Tang, Ruining Deng, Lucas W. Remedios, Zuhayr Asad, Joseph T. Roland, Ken S. Lau, Qi Liu, Lori A. Coburn, Keith T. Wilson, Bennett A. Landman, Yuankai Huo<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00750'>https://arxiv.org/abs/2307.00750</a><br>                    <strong>TLDR:</strong> Many anomaly detection approaches, especially deep learning methods, have been recently developed to identify abnormal image morphology by only employing normal images during training. Unfortunately, many prior anomaly detection methods were optimized for a specific "known" abnormality (e.g., brain tumor, bone fraction, cell types). Moreover, even though only the normal images were used in the training process, the abnormal images were often insufficiently employed during the validation process (e-g., epoch selection, hyper-parameter<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>ImDiffusion: Imputed Diffusion Models for Multivariate Time Series  Anomaly Detection</h3>                    <strong>Authors:</strong> Yuhang Chen, Chaoyun Zhang, Minghua Ma, Yudong Liu, Ruomeng Ding, Bowen Li, Shilin He, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00754'>https://arxiv.org/abs/2307.00754</a><br>                    <strong>TLDR:</strong> Anomaly detection in multivariate time series data is of paramount importance for ensuring the efficient operation of large-scale systems across diverse domains. However, accurately detecting anomalies in such data poses significant challenges. Existing approaches, including forecasting and reconstruction-based methods, struggle to address these challenges effectively. To overcome these limitations, we propose a novel anomaly detection framework named ImDiffusion, which combines time series imputation and diffusion models to achieve accurate and robust anomaly detection. The imputation-based<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Learning Noise-Resistant Image Representation by Aligning Clean and  Noisy Domains</h3>                    <strong>Authors:</strong> Yanhui Guo, Xiaolin Wu, Fangzhou Luo<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00761'>https://arxiv.org/abs/2307.00761</a><br>                    <strong>TLDR:</strong> Recent supervised and unsupervised image representation learning algorithms have achieved quantum leaps. However, these techniques do not account for representation resilience against noise in their design paradigms. Consequently, these effective methods suffer failure when confronted with noise outside the training distribution, such as complicated real-world noise that is usually opaque to model training. To address this issue, dual domains are optimized to separately model a canonical space for noisy representations, namely the Noise-Robust (NR) domain, and a<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Cryptography and Key Management Schemes for Wireless Sensor Networks</h3>                    <strong>Authors:</strong> Jaydip Sen<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00872'>https://arxiv.org/abs/2307.00872</a><br>                    <strong>TLDR:</strong> Wireless sensor networks (WSNs) are made up of a large number of tiny sensors, which can sense, analyze, and communicate information about the outside world. These networks play a significant role in a broad range of fields, from crucial military surveillance applications to monitoring building security. Key management in WSNs is a critical task. While the security and integrity of messages communicated through these networks and the authenticity of the nodes are dependent on the robustness of the key management schemes,<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Enhancing the Robustness of QMIX against State-adversarial Attacks</h3>                    <strong>Authors:</strong> Weiran Guo, Guanjun Liu, Ziyuan Zhou, Ling Wang, Jiacun Wang<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00907'>https://arxiv.org/abs/2307.00907</a><br>                    <strong>TLDR:</strong> Deep reinforcement learning (DRL) performance is generally impacted by state-adversarial attacks, a perturbation applied to an agent's observation. Most recent research has concentrated on robust single-agent reinforcement learning or multi-agent learning (SARL) algorithms, and on the use of SARL algorithms to improve the robustness of the SARL algorithm. Still, there has yet to be much work on robust multi-Agent reinforcement learning. Using QMIX, one of<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Towards Building Self-Aware Object Detectors via Reliable Uncertainty  Quantification and Calibration</h3>                    <strong>Authors:</strong> Kemal Oksuz, Tom Joy, Puneet K. Dokania<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00934'>https://arxiv.org/abs/2307.00934</a><br>                    <strong>TLDR:</strong> The current approach for testing the robustness of object detectors suffers from serious deficiencies such as improper methods of performing out-of-distribution detection and using calibration metrics which do not consider both localisation and classification quality. In this work, we address these issues, and introduce the Self-Aware Object Detection (SAOD) task, a unified testing framework which respects and adheres to the challenges that object detectors face in safety-critical environments such as autonomous driving. Specifically, the SA<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>SAM-DA: UAV Tracks Anything at Night with SAM-Powered Domain Adaptation</h3>                    <strong>Authors:</strong> Liangliang Yao, Haobo Zuo, Guangze Zheng, Changhong Fu, Jia Pan<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.01024'>https://arxiv.org/abs/2307.01024</a><br>                    <strong>TLDR:</strong> Domain adaptation (DA) has demonstrated significant promise for real-time nighttime unmanned aerial vehicle (UAV) tracking. However, the state-of-the-art (SOTA) DA still lacks the potential object with accurate pixel-level location and boundary to generate the high-quality target domain training sample. This key issue constrains the transfer of data from raw nighttime images to SAM-DA, i.e., DA-DA. Therefore, it is necessary to optimize the<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Anonymous and Copy-Robust Delegations for Liquid Democracy</h3>                    <strong>Authors:</strong> Markus Utke, Ulrike Schmidt-Kraepelin<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.01174'>https://arxiv.org/abs/2307.01174</a><br>                    <strong>TLDR:</strong> Liquid democracy with ranked delegations is a novel voting scheme that unites the practicability of representative democracy with the idealistic appeal of direct democracy: Every voter decides between casting their vote on a question at hand or delegating their voting weight to some other, trusted agent. Delegations are transitive, and since voters may end up in a delegation cycle, they are encouraged to indicate not only a single delegate, but a set of potential delegates and a ranking among them. Based<br>                    <strong>Repo:  <a href='None'>None</a><strong><br></details><h2>Keyword: scholarly</h2><details><h3>Numerical Association Rule Mining: A Systematic Literature Review</h3>                    <strong>Authors:</strong> Minakshi Kaushik, Rahul Sharma, Iztok Fister Jr., Dirk Draheim<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00662'>https://arxiv.org/abs/2307.00662</a><br>                    <strong>TLDR:</strong> Numerical association rule mining is a widely used variant of the association rule hunting technique, and it has been extensively used in discovering patterns and relationships in numerical data. Initially, researchers and scientists integrated numerical attributes in association rule mines using various discretization approaches; however, over time, a plethora of alternative methods have emerged in this field. Unfortunately, the increase of alternative Methods has resulted into a significant knowledge gap in understanding diverse techniques employed in numerical association rule Mining -- this paper attempts to<br>                    <strong>Repo:  <a href='None'>None</a><strong><br></details><h2>Keyword: semantic similarity</h2><details><h3>Automatic Design of Semantic Similarity Ensembles Using Grammatical  Evolution</h3>                    <strong>Authors:</strong> Jorge Martinez-Gil<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00925'>https://arxiv.org/abs/2307.00925</a><br>                    <strong>TLDR:</strong> Semantic similarity measures are widely used in natural language processing to catalyze various computer-related tasks. However, no single semantic similarity measure is the most appropriate for all tasks, and researchers often use ensemble strategies to ensure performance. This research work proposes a method for automatically designing semantic similarity ensembles. In fact, our proposed method uses grammatical evolution, for the first time, to automatically select and aggregate measures from a pool of candidates to create an ensemble that maximizes correlation to human<br>                    <strong>Repo:  <a href='None'>None</a><strong><br></details><h2>Keyword: similarity measure</h2><details><h3>Applications of Binary Similarity and Distance Measures</h3>                    <strong>Authors:</strong> Manoj Muniswamaiah, Tilak Agerwala, Charles C. Tappert<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00411'>https://arxiv.org/abs/2307.00411</a><br>                    <strong>TLDR:</strong> In the recent past, binary similarity measures have been applied in solving biometric identification problems, including fingerprint, handwritten character detection, and in iris image recognition. The application of the relevant measurements has also resulted in more accurate data analysis. This paper surveys the applicability of binary similarity and distance measures in various fields.<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>Automatic Design of Semantic Similarity Ensembles Using Grammatical  Evolution</h3>                    <strong>Authors:</strong> Jorge Martinez-Gil<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00925'>https://arxiv.org/abs/2307.00925</a><br>                    <strong>TLDR:</strong> Semantic similarity measures are widely used in natural language processing to catalyze various computer-related tasks. However, no single semantic similarity measure is the most appropriate for all tasks, and researchers often use ensemble strategies to ensure performance. This research work proposes a method for automatically designing semantic similarity ensembles. In fact, our proposed method uses grammatical evolution, for the first time, to automatically select and aggregate measures from a pool of candidates to create an ensemble that maximizes correlation to human<br>                    <strong>Repo:  <a href='None'>None</a><strong><br></details><h2>Keyword: summarization</h2><details><h3>Challenges in Domain-Specific Abstractive Summarization and How to  Overcome them</h3>                    <strong>Authors:</strong> Anum Afzal, Juraj Vladika, Daniel Braun, Florian Matthes<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00963'>https://arxiv.org/abs/2307.00963</a><br>                    <strong>TLDR:</strong> Large Language Models work quite well with general-purpose data and many tasks in Natural Language Processing. However, they show several limitations when used for a task such as domain-specific abstractive text summarization. This paper identifies three of those limitations as research problems in the context of abstractive Text summarization: 1) Quadratic complexity of transformer-based models with respect to the input text length; 2) Model Hallucination, which is a model's ability to generate factually<br>                    <strong>Repo:  <a href='None'>None</a><strong><br></details><h2>Keyword: text generation</h2><details><h3>More for Less: Compact Convolutional Transformers Enable Robust Medical  Image Classification with Limited Data</h3>                    <strong>Authors:</strong> Andrew Kean Gao<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00213'>https://arxiv.org/abs/2307.00213</a><br>                    <strong>TLDR:</strong> Transformers are very powerful tools for a variety of tasks across domains, from text generation to image captioning. However, transformers require substantial amounts of training data, which is often a challenge in biomedical settings, where high quality labeled data can be challenging or expensive to obtain. This study investigates the efficacy of Compact Convolutional Transformers (CCT) for robust medical image classification with limited data, addressing a key issue faced by conventional Vision Transformers - their requirement for large datasets. A hybrid<br>                    <strong>Repo:  <a href='None'>None</a><strong><br><h3>PatternGPT :A Pattern-Driven Framework for Large Language Model Text  Generation</h3>                    <strong>Authors:</strong> Le Xiao, Xin Shan<br>                    <strong>Arxiv:</strong> <a href='https://arxiv.org/abs/2307.00470'>https://arxiv.org/abs/2307.00470</a><br>                    <strong>TLDR:</strong> Large language models(LLMs) have shown excellent text generation capabilities, but there is still much space for improvement in accuracy, sometimes with grammatical errors, semantic inaccuracies, and contextual incoherence, which seriously affect the reliability of the models. These problems may originate from the difficulties and limitations encountered in the pattern extraction stage of large language models. How to utilize the generative power of large languages models to generate as many possible patterns that help solve problems and find the optimal patterns from them<br>                    <strong>Repo:  <a href='None'>None</a><strong><br></details>

New submissions for Tue, 4 Jul 23 #388

Description

Keyword: contrastive

Improving the Transferability of Time Series Forecasting with Decomposition Adaptation

Prompting classes: Exploring the Power of Prompt Class Learning in Weakly Supervised Semantic Segmentation

Hierarchical Pretraining for Biomedical Term Embeddings

BioCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information Retrieval

CLIMAX: An exploration of Classifier-Based Contrastive Explanations

Resistive memory-based zero-shot liquid state machine for multimodal event data learning

UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding

Autism Spectrum Disorder Classification in Children based on Structural MRI Features Extracted using Contrastive Variational Autoencoder

ENGAGE: Explanation Guided Data Augmentation for Graph Representation Learning

Investigating Data Memorization in 3D Latent Diffusion Models for Medical Image Synthesis

Don't freeze: Finetune encoders for better Self-Supervised HAR

Keyword: data augmentation

Investigating Masking-based Data Generation in Language Models

Counterfactual Collaborative Reasoning

Unsupervised Coordinate-Based Video Denoising

Long-Tailed Continual Learning For Visual Food Recognition

SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency

CNN-BiLSTM model for English Handwriting Recognition: Comprehensive Evaluation on the IAM Dataset

ENGAGE: Explanation Guided Data Augmentation for Graph Representation Learning

Keyword: knowledge graph

Hierarchical Pretraining for Biomedical Term Embeddings

PersonaGen: A Tool for Generating Personas from User Feedback

Semi-automated Thermal Envelope Model Setup for Adaptive Model Predictive Control with Event-triggered System Identification

CollabKG: A Learnable Human-Machine-Cooperative Information Extraction Toolkit for (Event) Knowledge Graph Construction

Iterative Zero-Shot LLM Prompting for Knowledge Graph Construction

Keyword: legal

From ChatGPT to ThreatGPT: Impact of Generative AI in Cybersecurity and Privacy

Keyword: mixup

Augmenting Deep Learning Adaptation for Wearable Sensor Data through Combined Temporal-Frequency Image Encoding

Keyword: multi-task

CollabKG: A Learnable Human-Machine-Cooperative Information Extraction Toolkit for (Event) Knowledge Graph Construction

Towards Suicide Prevention from Bipolar Disorder with Temporal Symptom-Aware Multitask Learning

Visual Instruction Tuning with Polite Flamingo

Keyword: robustness

RObotic MAnipulation Network (ROMAN) -- Hybrid Hierarchical Learning for Solving Complex Sequential Tasks

Unsupervised Coordinate-Based Video Denoising

More for Less: Compact Convolutional Transformers Enable Robust Medical Image Classification with Limited Data

SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency

Launcher Attitude Control based on Incremental Nonlinear Dynamic Inversion: A Feasibility Study Towards Fast and Robust Design Approaches

CasTGAN: Cascaded Generative Adversarial Network for Realistic Tabular Data Synthesis

Brightness-Restricted Adversarial Attack Patch

Query-Efficient Decision-based Black-Box Patch Attack

POV-SLAM: Probabilistic Object-Aware Variational SLAM in Semi-Static Environments

Variational Autoencoding Molecular Graphs with Denoising Diffusion Probabilistic Model

SDC-HSDD-NDSA: Structure Detecting Cluster by Hierarchical Secondary Directed Differential with Normalized Density and Self-Adaption

Scratch Team of Single-Rotor Robots and Decentralized Cooperative Transportation with Robot Failure

Feasibility of Universal Anomaly Detection without Knowing the Abnormality in Medical Images

ImDiffusion: Imputed Diffusion Models for Multivariate Time Series Anomaly Detection

Learning Noise-Resistant Image Representation by Aligning Clean and Noisy Domains

Cryptography and Key Management Schemes for Wireless Sensor Networks

Enhancing the Robustness of QMIX against State-adversarial Attacks

Towards Building Self-Aware Object Detectors via Reliable Uncertainty Quantification and Calibration

SAM-DA: UAV Tracks Anything at Night with SAM-Powered Domain Adaptation

Anonymous and Copy-Robust Delegations for Liquid Democracy

Keyword: scholarly

Numerical Association Rule Mining: A Systematic Literature Review

Keyword: semantic similarity

Automatic Design of Semantic Similarity Ensembles Using Grammatical Evolution

Keyword: similarity measure

Applications of Binary Similarity and Distance Measures

Automatic Design of Semantic Similarity Ensembles Using Grammatical Evolution

Keyword: summarization

Challenges in Domain-Specific Abstractive Summarization and How to Overcome them

Keyword: text generation

More for Less: Compact Convolutional Transformers Enable Robust Medical Image Classification with Limited Data

PatternGPT :A Pattern-Driven Framework for Large Language Model Text Generation

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions