Skip to content

Commit fd8c35a

Browse files
x-y-zakpm00
authored andcommitted
mm/numa: no task_numa_fault() call if PMD is changed
When handling a numa page fault, task_numa_fault() should be called by a process that restores the page table of the faulted folio to avoid duplicated stats counting. Commit c5b5a3d ("mm: thp: refactor NUMA fault handling") restructured do_huge_pmd_numa_page() and did not avoid task_numa_fault() call in the second page table check after a numa migration failure. Fix it by making all !pmd_same() return immediately. This issue can cause task_numa_fault() being called more than necessary and lead to unexpected numa balancing results (It is hard to tell whether the issue will cause positive or negative performance impact due to duplicated numa fault counting). Link: https://lkml.kernel.org/r/20240809145906.1513458-3-ziy@nvidia.com Fixes: c5b5a3d ("mm: thp: refactor NUMA fault handling") Reported-by: "Huang, Ying" <ying.huang@intel.com> Closes: https://lore.kernel.org/linux-mm/87zfqfw0yw.fsf@yhuang6-desk2.ccr.corp.intel.com/ Signed-off-by: Zi Yan <ziy@nvidia.com> Acked-by: David Hildenbrand <david@redhat.com> Cc: Baolin Wang <baolin.wang@linux.alibaba.com> Cc: "Huang, Ying" <ying.huang@intel.com> Cc: Kefeng Wang <wangkefeng.wang@huawei.com> Cc: Mel Gorman <mgorman@suse.de> Cc: Yang Shi <shy828301@gmail.com> Cc: <stable@vger.kernel.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
1 parent 40b760c commit fd8c35a

File tree

1 file changed

+13
-16
lines changed

1 file changed

+13
-16
lines changed

mm/huge_memory.c

Lines changed: 13 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -1685,7 +1685,7 @@ vm_fault_t do_huge_pmd_numa_page(struct vm_fault *vmf)
16851685
vmf->ptl = pmd_lock(vma->vm_mm, vmf->pmd);
16861686
if (unlikely(!pmd_same(oldpmd, *vmf->pmd))) {
16871687
spin_unlock(vmf->ptl);
1688-
goto out;
1688+
return 0;
16891689
}
16901690

16911691
pmd = pmd_modify(oldpmd, vma->vm_page_prot);
@@ -1728,22 +1728,16 @@ vm_fault_t do_huge_pmd_numa_page(struct vm_fault *vmf)
17281728
if (!migrate_misplaced_folio(folio, vma, target_nid)) {
17291729
flags |= TNF_MIGRATED;
17301730
nid = target_nid;
1731-
} else {
1732-
flags |= TNF_MIGRATE_FAIL;
1733-
vmf->ptl = pmd_lock(vma->vm_mm, vmf->pmd);
1734-
if (unlikely(!pmd_same(oldpmd, *vmf->pmd))) {
1735-
spin_unlock(vmf->ptl);
1736-
goto out;
1737-
}
1738-
goto out_map;
1739-
}
1740-
1741-
out:
1742-
if (nid != NUMA_NO_NODE)
17431731
task_numa_fault(last_cpupid, nid, HPAGE_PMD_NR, flags);
1732+
return 0;
1733+
}
17441734

1745-
return 0;
1746-
1735+
flags |= TNF_MIGRATE_FAIL;
1736+
vmf->ptl = pmd_lock(vma->vm_mm, vmf->pmd);
1737+
if (unlikely(!pmd_same(oldpmd, *vmf->pmd))) {
1738+
spin_unlock(vmf->ptl);
1739+
return 0;
1740+
}
17471741
out_map:
17481742
/* Restore the PMD */
17491743
pmd = pmd_modify(oldpmd, vma->vm_page_prot);
@@ -1753,7 +1747,10 @@ vm_fault_t do_huge_pmd_numa_page(struct vm_fault *vmf)
17531747
set_pmd_at(vma->vm_mm, haddr, vmf->pmd, pmd);
17541748
update_mmu_cache_pmd(vma, vmf->address, vmf->pmd);
17551749
spin_unlock(vmf->ptl);
1756-
goto out;
1750+
1751+
if (nid != NUMA_NO_NODE)
1752+
task_numa_fault(last_cpupid, nid, HPAGE_PMD_NR, flags);
1753+
return 0;
17571754
}
17581755

17591756
/*

0 commit comments

Comments
 (0)