Skip to content

Commit ce2f26e

Browse files
OjaswinMtytso
authored andcommitted
ext4: avoid journaling sb update on error if journal is destroying
Presently we always BUG_ON if trying to start a transaction on a journal marked with JBD2_UNMOUNT, since this should never happen. However, while ltp running stress tests, it was observed that in case of some error handling paths, it is possible for update_super_work to start a transaction after the journal is destroyed eg: (umount) ext4_kill_sb kill_block_super generic_shutdown_super sync_filesystem /* commits all txns */ evict_inodes /* might start a new txn */ ext4_put_super flush_work(&sbi->s_sb_upd_work) /* flush the workqueue */ jbd2_journal_destroy journal_kill_thread journal->j_flags |= JBD2_UNMOUNT; jbd2_journal_commit_transaction jbd2_journal_get_descriptor_buffer jbd2_journal_bmap ext4_journal_bmap ext4_map_blocks ... ext4_inode_error ext4_handle_error schedule_work(&sbi->s_sb_upd_work) /* work queue kicks in */ update_super_work jbd2_journal_start start_this_handle BUG_ON(journal->j_flags & JBD2_UNMOUNT) Hence, introduce a new mount flag to indicate journal is destroying and only do a journaled (and deferred) update of sb if this flag is not set. Otherwise, just fallback to an un-journaled commit. Further, in the journal destroy path, we have the following sequence: 1. Set mount flag indicating journal is destroying 2. force a commit and wait for it 3. flush pending sb updates This sequence is important as it ensures that, after this point, there is no sb update that might be journaled so it is safe to update the sb outside the journal. (To avoid race discussed in 2d01ddc) Also, we don't need a similar check in ext4_grp_locked_error since it is only called from mballoc and AFAICT it would be always valid to schedule work here. Fixes: 2d01ddc ("ext4: save error info to sb through journal if available") Reported-by: Mahesh Kumar <maheshkumar657g@gmail.com> Signed-off-by: Ojaswin Mujoo <ojaswin@linux.ibm.com> Reviewed-by: Jan Kara <jack@suse.cz> Link: https://patch.msgid.link/9613c465d6ff00cd315602f99283d5f24018c3f7.1742279837.git.ojaswin@linux.ibm.com Signed-off-by: Theodore Ts'o <tytso@mit.edu>
1 parent 5a02a62 commit ce2f26e

File tree

3 files changed

+25
-9
lines changed

3 files changed

+25
-9
lines changed

fs/ext4/ext4.h

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1824,7 +1824,8 @@ static inline int ext4_valid_inum(struct super_block *sb, unsigned long ino)
18241824
*/
18251825
enum {
18261826
EXT4_MF_MNTDIR_SAMPLED,
1827-
EXT4_MF_FC_INELIGIBLE /* Fast commit ineligible */
1827+
EXT4_MF_FC_INELIGIBLE, /* Fast commit ineligible */
1828+
EXT4_MF_JOURNAL_DESTROY /* Journal is in process of destroying */
18281829
};
18291830

18301831
static inline void ext4_set_mount_flag(struct super_block *sb, int bit)

fs/ext4/ext4_jbd2.h

Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -437,6 +437,21 @@ static inline int ext4_journal_destroy(struct ext4_sb_info *sbi, journal_t *jour
437437
{
438438
int err = 0;
439439

440+
/*
441+
* At this point only two things can be operating on the journal.
442+
* JBD2 thread performing transaction commit and s_sb_upd_work
443+
* issuing sb update through the journal. Once we set
444+
* EXT4_JOURNAL_DESTROY, new ext4_handle_error() calls will not
445+
* queue s_sb_upd_work and ext4_force_commit() makes sure any
446+
* ext4_handle_error() calls from the running transaction commit are
447+
* finished. Hence no new s_sb_upd_work can be queued after we
448+
* flush it here.
449+
*/
450+
ext4_set_mount_flag(sbi->s_sb, EXT4_MF_JOURNAL_DESTROY);
451+
452+
ext4_force_commit(sbi->s_sb);
453+
flush_work(&sbi->s_sb_upd_work);
454+
440455
err = jbd2_journal_destroy(journal);
441456
sbi->s_journal = NULL;
442457

fs/ext4/super.c

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -702,9 +702,13 @@ static void ext4_handle_error(struct super_block *sb, bool force_ro, int error,
702702
* In case the fs should keep running, we need to writeout
703703
* superblock through the journal. Due to lock ordering
704704
* constraints, it may not be safe to do it right here so we
705-
* defer superblock flushing to a workqueue.
705+
* defer superblock flushing to a workqueue. We just need to be
706+
* careful when the journal is already shutting down. If we get
707+
* here in that case, just update the sb directly as the last
708+
* transaction won't commit anyway.
706709
*/
707-
if (continue_fs && journal)
710+
if (continue_fs && journal &&
711+
!ext4_test_mount_flag(sb, EXT4_MF_JOURNAL_DESTROY))
708712
schedule_work(&EXT4_SB(sb)->s_sb_upd_work);
709713
else
710714
ext4_commit_super(sb);
@@ -1290,7 +1294,6 @@ static void ext4_put_super(struct super_block *sb)
12901294
ext4_unregister_li_request(sb);
12911295
ext4_quotas_off(sb, EXT4_MAXQUOTAS);
12921296

1293-
flush_work(&sbi->s_sb_upd_work);
12941297
destroy_workqueue(sbi->rsv_conversion_wq);
12951298
ext4_release_orphan_info(sb);
12961299

@@ -1300,7 +1303,8 @@ static void ext4_put_super(struct super_block *sb)
13001303
if ((err < 0) && !aborted) {
13011304
ext4_abort(sb, -err, "Couldn't clean up the journal");
13021305
}
1303-
}
1306+
} else
1307+
flush_work(&sbi->s_sb_upd_work);
13041308

13051309
ext4_es_unregister_shrinker(sbi);
13061310
timer_shutdown_sync(&sbi->s_err_report);
@@ -4971,8 +4975,6 @@ static int ext4_load_and_init_journal(struct super_block *sb,
49714975
return 0;
49724976

49734977
out:
4974-
/* flush s_sb_upd_work before destroying the journal. */
4975-
flush_work(&sbi->s_sb_upd_work);
49764978
ext4_journal_destroy(sbi, sbi->s_journal);
49774979
return -EINVAL;
49784980
}
@@ -5669,8 +5671,6 @@ failed_mount8: __maybe_unused
56695671
sbi->s_ea_block_cache = NULL;
56705672

56715673
if (sbi->s_journal) {
5672-
/* flush s_sb_upd_work before journal destroy. */
5673-
flush_work(&sbi->s_sb_upd_work);
56745674
ext4_journal_destroy(sbi, sbi->s_journal);
56755675
}
56765676
failed_mount3a:

0 commit comments

Comments
 (0)