Changelog in Linux kernel 6.16.2

Linux: (powerpc/512) Fix possible `dma_unmap_single()` on uninitialized pointer [+ + +]

Author: Thomas Fourier <fourier.thomas@gmail.com>
Date:   Tue Jun 10 16:29:11 2025 +0200

    (powerpc/512) Fix possible `dma_unmap_single()` on uninitialized pointer
    
    [ Upstream commit 760b9b4f6de9a33ca56a05f950cabe82138d25bd ]
    
    If the device configuration fails (if `dma_dev->device_config()`),
    `sg_dma_address(&sg)` is not initialized and the jump to `err_dma_prep`
    leads to calling `dma_unmap_single()` on `sg_dma_address(&sg)`.
    
    Signed-off-by: Thomas Fourier <fourier.thomas@gmail.com>
    Reviewed-by: Christophe Leroy <christophe.leroy@csgroup.eu>
    Signed-off-by: Madhavan Srinivasan <maddy@linux.ibm.com>
    Link: https://patch.msgid.link/20250610142918.169540-2-fourier.thomas@gmail.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ACPI: APEI: GHES: add TAINT_MACHINE_CHECK on GHES panic path [+ + +]

Author: Breno Leitao <leitao@debian.org>
Date:   Wed Jul 2 08:39:51 2025 -0700

    ACPI: APEI: GHES: add TAINT_MACHINE_CHECK on GHES panic path
    
    [ Upstream commit 4734c8b46b901cff2feda8b82abc710b65dc31c1 ]
    
    When a GHES (Generic Hardware Error Source) triggers a panic, add the
    TAINT_MACHINE_CHECK taint flag to the kernel. This explicitly marks the
    kernel as tainted due to a machine check event, improving diagnostics
    and post-mortem analysis. The taint is set with LOCKDEP_STILL_OK to
    indicate lockdep remains valid.
    
    At large scale deployment, this helps to quickly determine panics that
    are coming due to hardware failures.
    
    Signed-off-by: Breno Leitao <leitao@debian.org>
    Reviewed-by: Tony Luck <tony.luck@intel.com>
    Link: https://patch.msgid.link/20250702-add_tain-v1-1-9187b10914b9@debian.org
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ACPI: APEI: send SIGBUS to current task if synchronous memory error not recovered [+ + +]

Author: Shuai Xue <xueshuai@linux.alibaba.com>
Date:   Mon Jul 14 19:42:11 2025 +0800

    ACPI: APEI: send SIGBUS to current task if synchronous memory error not recovered
    
    [ Upstream commit 79a5ae3c4c5eb7e38e0ebe4d6bf602d296080060 ]
    
    If a synchronous error is detected as a result of user-space process
    triggering a 2-bit uncorrected error, the CPU will take a synchronous
    error exception such as Synchronous External Abort (SEA) on Arm64. The
    kernel will queue a memory_failure() work which poisons the related
    page, unmaps the page, and then sends a SIGBUS to the process, so that
    a system wide panic can be avoided.
    
    However, no memory_failure() work will be queued when abnormal
    synchronous errors occur. These errors can include situations like
    invalid PA, unexpected severity, no memory failure config support,
    invalid GUID section, etc. In such a case, the user-space process will
    trigger SEA again.  This loop can potentially exceed the platform
    firmware threshold or even trigger a kernel hard lockup, leading to a
    system reboot.
    
    Fix it by performing a force kill if no memory_failure() work is queued
    for synchronous errors.
    
    Signed-off-by: Shuai Xue <xueshuai@linux.alibaba.com>
    Reviewed-by: Jarkko Sakkinen <jarkko@kernel.org>
    Reviewed-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
    Reviewed-by: Yazen Ghannam <yazen.ghannam@amd.com>
    Reviewed-by: Jane Chu <jane.chu@oracle.com>
    Reviewed-by: Hanjun Guo <guohanjun@huawei.com>
    Link: https://patch.msgid.link/20250714114212.31660-2-xueshuai@linux.alibaba.com
    [ rjw: Changelog edits ]
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ACPI: EC: Relax sanity check of the ECDT ID string [+ + +]

Author: Armin Wolf <W_Armin@gmx.de>
Date:   Tue Jul 29 08:20:38 2025 +0200

    ACPI: EC: Relax sanity check of the ECDT ID string
    
    commit 963e22c084c2b6097e1e635d29c6336881f67708 upstream.
    
    It turns out that the ECDT table inside the ThinkBook 14 G7 IML
    contains a valid EC description but an invalid ID string
    ("_SB.PC00.LPCB.EC0"). Ignoring this ECDT based on the invalid
    ID string prevents the kernel from detecting the built-in touchpad,
    so relax the sanity check of the ID string and only reject ECDTs
    with empty ID strings.
    
    Reported-by: Ilya K <me@0upti.me>
    Fixes: 7a0d59f6a913 ("ACPI: EC: Ignore ECDT tables with an invalid ID string")
    Signed-off-by: Armin Wolf <W_Armin@gmx.de>
    Tested-by: Ilya K <me@0upti.me>
    Link: https://patch.msgid.link/20250729062038.303734-1-W_Armin@gmx.de
    Cc: 6.16+ <stable@vger.kernel.org> # 6.16+
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ACPI: PRM: Reduce unnecessary printing to avoid user confusion [+ + +]

Author: Zhu Qiyu <qiyuzhu2@amd.com>
Date:   Fri Jul 4 01:41:04 2025 +0000

    ACPI: PRM: Reduce unnecessary printing to avoid user confusion
    
    [ Upstream commit 3db5648c4d608b5483470efc1da9780b081242dd ]
    
    Commit 088984c8d54c ("ACPI: PRM: Find EFI_MEMORY_RUNTIME block for PRM
    handler and context") introduced non-essential printing "Failed to find
    VA for GUID: xxxx, PA: 0x0" which may confuse users to think that
    something wrong is going on while it is not the case.
    
    According to the PRM Spec Section 4.1.2 [1], both static data buffer
    address and ACPI parameter buffer address may be NULL if they are not
    needed, so there is no need to print out the "Failed to find VA ... "
    in those cases.
    
    Link: https://uefi.org/sites/default/files/resources/Platform%20Runtime%20Mechanism%20-%20with%20legal%20notice.pdf # [1]
    Signed-off-by: Zhu Qiyu <qiyuzhu2@amd.com>
    Link: https://patch.msgid.link/20250704014104.82524-1-qiyuzhu2@amd.com
    [ rjw: Edits in new comments, subject and changelog ]
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ACPI: processor: fix acpi_object initialization [+ + +]

Author: Sebastian Ott <sebott@redhat.com>
Date:   Thu Jul 3 14:42:15 2025 +0200

    ACPI: processor: fix acpi_object initialization
    
    [ Upstream commit 13edf7539211d8f7d0068ce3ed143005f1da3547 ]
    
    Initialization of the local acpi_object in acpi_processor_get_info()
    only sets the first 4 bytes to zero and is thus incomplete. This is
    indicated by messages like:
            acpi ACPI0007:be: Invalid PBLK length [166288104]
    
    Fix this by initializing all 16 bytes of the processor member of that
    union.
    
    Signed-off-by: Sebastian Ott <sebott@redhat.com>
    Link: https://patch.msgid.link/20250703124215.12522-1-sebott@redhat.com
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ACPI: processor: perflib: Fix initial _PPC limit application [+ + +]

Author: Jiayi Li <lijiayi@kylinos.cn>
Date:   Mon Jul 21 11:26:06 2025 +0800

    ACPI: processor: perflib: Fix initial _PPC limit application
    
    commit d33bd88ac0ebb49e7f7c8f29a8c7ee9eae85d765 upstream.
    
    If the BIOS sets a _PPC frequency limit upfront, it will fail to take
    effect due to a call ordering issue.  Namely, freq_qos_update_request()
    is called before freq_qos_add_request() for the given request causing
    the constraint update to be ignored.  The call sequence in question is
    as follows:
    
    cpufreq_policy_online()
      acpi_cpufreq_cpu_init()
        acpi_processor_register_performance()
          acpi_processor_get_performance_info()
            acpi_processor_get_platform_limit()
             freq_qos_update_request(&perflib_req) <- inactive QoS request
      blocking_notifier_call_chain(&cpufreq_policy_notifier_list,
                                   CPUFREQ_CREATE_POLICY)
        acpi_processor_notifier()
          acpi_processor_ppc_init()
            freq_qos_add_request(&perflib_req) <- QoS request activation
    
    Address this by adding an acpi_processor_get_platform_limit() call
    to acpi_processor_ppc_init(), after the perflib_req activation via
    freq_qos_add_request(), which causes the initial _PPC limit to be
    picked up as appropriate.  However, also ensure that the _PPC limit
    will not be picked up in the cases when the cpufreq driver does not
    call acpi_processor_register_performance() by adding a pr->performance
    check to the related_cpus loop in acpi_processor_ppc_init().
    
    Fixes: d15ce412737a ("ACPI: cpufreq: Switch to QoS requests instead of cpufreq notifier")
    Signed-off-by: Jiayi Li <lijiayi@kylinos.cn>
    Link: https://patch.msgid.link/20250721032606.3459369-1-lijiayi@kylinos.cn
    [ rjw: Consolidate pr-related checks in acpi_processor_ppc_init() ]
    [ rjw: Subject and changelog adjustments ]
    Cc: 5.4+ <stable@vger.kernel.org> # 5.4+: 2d8b39a62a5d ACPI: processor: Avoid NULL pointer dereferences at init time
    Cc: 5.4+ <stable@vger.kernel.org> # 5.4+: 3000ce3c52f8 cpufreq: Use per-policy frequency QoS
    Cc: 5.4+ <stable@vger.kernel.org> # 5.4+: a1bb46c36ce3 ACPI: processor: Add QoS requests for all CPUs
    Cc: 5.4+ <stable@vger.kernel.org> # 5.4+
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ACPI: processor: perflib: Move problematic pr->performance check [+ + +]

Author: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
Date:   Tue Aug 12 14:57:06 2025 +0200

    ACPI: processor: perflib: Move problematic pr->performance check
    
    commit d405ec23df13e6df599f5bd965a55d13420366b8 upstream.
    
    Commit d33bd88ac0eb ("ACPI: processor: perflib: Fix initial _PPC limit
    application") added a pr->performance check that prevents the frequency
    QoS request from being added when the given processor has no performance
    object.  Unfortunately, this causes a WARN() in freq_qos_remove_request()
    to trigger on an attempt to take the given CPU offline later because the
    frequency QoS object has not been added for it due to the missing
    performance object.
    
    Address this by moving the pr->performance check before calling
    acpi_processor_get_platform_limit() so it only prevents a limit from
    being set for the CPU if the performance object is not present.  This
    way, the frequency QoS request is added as it was before the above
    commit and it is present all the time along with the CPU's cpufreq
    policy regardless of whether or not the CPU is online.
    
    Fixes: d33bd88ac0eb ("ACPI: processor: perflib: Fix initial _PPC limit application")
    Tested-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Cc: 5.4+ <stable@vger.kernel.org> # 5.4+
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Link: https://patch.msgid.link/2801421.mvXUDI8C0e@rafael.j.wysocki
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ACPI: Return -ENODEV from acpi_parse_spcr() when SPCR support is disabled [+ + +]

Author: Li Chen <chenl311@chinatelecom.cn>
Date:   Fri Jun 20 21:13:07 2025 +0800

    ACPI: Return -ENODEV from acpi_parse_spcr() when SPCR support is disabled
    
    commit b9f58d3572a8e1ef707b941eae58ec4014b9269d upstream.
    
    If CONFIG_ACPI_SPCR_TABLE is disabled, acpi_parse_spcr()
    currently returns 0, which may incorrectly suggest that
    SPCR parsing was successful. This patch changes the behavior
    to return -ENODEV to clearly indicate that SPCR support
    is not available.
    
    This prepares the codebase for future changes that depend
    on acpi_parse_spcr() failure detection, such as suppressing
    misleading console messages.
    
    Signed-off-by: Li Chen <chenl311@chinatelecom.cn>
    Acked-by: Hanjun Guo <guohanjun@huawei.com>
    Link: https://lore.kernel.org/r/20250620131309.126555-2-me@linux.beauty
    Signed-off-by: Catalin Marinas <catalin.marinas@arm.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ACPI: Suppress misleading SPCR console message when SPCR table is absent [+ + +]

Author: Li Chen <chenl311@chinatelecom.cn>
Date:   Fri Jun 20 21:13:08 2025 +0800

    ACPI: Suppress misleading SPCR console message when SPCR table is absent
    
    [ Upstream commit bad3fa2fb9206f4dcec6ddef094ec2fbf6e8dcb2 ]
    
    The kernel currently alway prints:
    "Use ACPI SPCR as default console: No/Yes "
    
    even on systems that lack an SPCR table. This can
    mislead users into thinking the SPCR table exists
    on the machines without SPCR.
    
    With this change, the "Yes" is only printed if
    the SPCR table is present, parsed and !param_acpi_nospcr.
    This avoids user confusion on SPCR-less systems.
    
    Signed-off-by: Li Chen <chenl311@chinatelecom.cn>
    Acked-by: Hanjun Guo <guohanjun@huawei.com>
    Link: https://lore.kernel.org/r/20250620131309.126555-3-me@linux.beauty
    Signed-off-by: Catalin Marinas <catalin.marinas@arm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ALSA: hda/ca0132: Fix buffer overflow in add_tuning_control [+ + +]

Author: Lucy Thrun <lucy.thrun@digital-rabbithole.de>
Date:   Tue Jun 10 19:50:12 2025 +0200

    ALSA: hda/ca0132: Fix buffer overflow in add_tuning_control
    
    [ Upstream commit a409c60111e6bb98fcabab2aeaa069daa9434ca0 ]
    
    The 'sprintf' call in 'add_tuning_control' may exceed the 44-byte
    buffer if either string argument is too long. This triggers a compiler
    warning.
    Replaced 'sprintf' with 'snprintf' to limit string lengths to prevent
    overflow.
    
    Reported-by: kernel test robot <lkp@intel.com>
    Closes: https://lore.kernel.org/oe-kbuild-all/202506100642.95jpuMY1-lkp@intel.com/
    Signed-off-by: Lucy Thrun <lucy.thrun@digital-rabbithole.de>
    Link: https://patch.msgid.link/20250610175012.918-3-lucy.thrun@digital-rabbithole.de
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ALSA: hda/realtek: Add Framework Laptop 13 (AMD Ryzen AI 300) to quirks [+ + +]

Author: Christopher Eby <kreed@kreed.org>
Date:   Sat Aug 9 20:00:06 2025 -0700

    ALSA: hda/realtek: Add Framework Laptop 13 (AMD Ryzen AI 300) to quirks
    
    commit 0db77eccd964b11ab2b757031d1354fcc5a025ea upstream.
    
    Framework Laptop 13 (AMD Ryzen AI 300) requires the same quirk for
    headset detection as other Framework 13 models.
    
    Signed-off-by: Christopher Eby <kreed@kreed.org>
    Cc: <stable@vger.kernel.org>
    Link: https://patch.msgid.link/20250810030006.9060-1-kreed@kreed.org
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ALSA: hda/realtek: add LG gram 16Z90R-A to alc269 fixup table [+ + +]

Author: Thomas Croft <thomasmcft@gmail.com>
Date:   Mon Aug 4 09:12:07 2025 -0600

    ALSA: hda/realtek: add LG gram 16Z90R-A to alc269 fixup table
    
    [ Upstream commit dbe05428c4e54068a86e7e02405f3b30b1d2b3dd ]
    
    Several months ago, Joshua Grisham submitted a patch [1]
    for several ALC298 based sound cards.
    
    The entry for the LG gram 16 in the alc269_fixup_tbl only matches the
    Subsystem ID for the 16Z90R-Q and 16Z90R-K models [2]. My 16Z90R-A has a
    different Subsystem ID [3]. I'm not sure why these IDs differ, but I
    speculate it's due to the NVIDIA GPU included in the 16Z90R-A model that
    isn't present in the other models.
    
    I applied the patch to the latest Arch Linux kernel and the card was
    initialized as expected.
    
    [1]: https://lore.kernel.org/linux-sound/20240909193000.838815-1-josh@joshuagrisham.com/
    [2]: https://linux-hardware.org/?id=pci:8086-51ca-1854-0488
    [3]: https://linux-hardware.org/?id=pci:8086-51ca-1854-0489
    
    Signed-off-by: Thomas Croft <thomasmcft@gmail.com>
    Link: https://patch.msgid.link/20250804151457.134761-2-thomasmcft@gmail.com
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ALSA: hda/realtek: Fix headset mic on HONOR BRB-X [+ + +]

Author: Vasiliy Kovalev <kovalev@altlinux.org>
Date:   Mon Aug 11 16:27:16 2025 +0300

    ALSA: hda/realtek: Fix headset mic on HONOR BRB-X
    
    commit b26e2afb3834d4a61ce54c8484ff6014bef0b4b7 upstream.
    
    Add a PCI quirk to enable microphone input on the headphone jack on
    the HONOR BRB-X M1010 laptop.
    
    Signed-off-by: Vasiliy Kovalev <kovalev@altlinux.org>
    Cc: <stable@vger.kernel.org>
    Link: https://patch.msgid.link/20250811132716.45076-1-kovalev@altlinux.org
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ALSA: hda: add MODULE_FIRMWARE for cs35l41/cs35l56 [+ + +]

Author: GalaxySnail <me@glxys.nl>
Date:   Tue Jun 24 18:17:17 2025 +0800

    ALSA: hda: add MODULE_FIRMWARE for cs35l41/cs35l56
    
    [ Upstream commit 6eda9429501508196001845998bb8c73307d311a ]
    
    add firmware information in the .modinfo section, so that userspace
    tools can find out firmware required by cs35l41/cs35l56 kernel module
    
    Signed-off-by: GalaxySnail <me@glxys.nl>
    Reviewed-by: Richard Fitzgerald <rf@opensource.cirrus.com>
    Link: https://patch.msgid.link/20250624101716.2365302-2-me@glxys.nl
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ALSA: hda: Disable jack polling at shutdown [+ + +]

Author: Takashi Iwai <tiwai@suse.de>
Date:   Mon Jun 23 15:14:30 2025 +0200

    ALSA: hda: Disable jack polling at shutdown
    
    [ Upstream commit 1adcbdf54f76e1004bdf71df4eb1888c26e7ad06 ]
    
    Although the jack polling is canceled at shutdown in
    snd_hda_codec_shutdown(), it might be still re-triggered when the work
    is being processed at cancel_delayed_work_sync() call.  This may
    result in the unexpected hardware access that should have been already
    disabled.
    
    For assuring to stop the jack polling, clear codec->jackpoll_interval
    at shutdown.
    
    Reported-by: Joakim Zhang <joakim.zhang@cixtech.com>
    Closes: https://lore.kernel.org/20250619020844.2974160-4-joakim.zhang@cixtech.com
    Tested-by: Joakim Zhang <joakim.zhang@cixtech.com>
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Link: https://patch.msgid.link/20250623131437.10670-2-tiwai@suse.de
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ALSA: hda: Handle the jack polling always via a work [+ + +]

Author: Takashi Iwai <tiwai@suse.de>
Date:   Mon Jun 23 15:14:32 2025 +0200

    ALSA: hda: Handle the jack polling always via a work
    
    [ Upstream commit 5f7e54b23e4d253eff3b10b12d6fa92d28d7dddc ]
    
    We used to call directly hda_jackpoll_work() from a couple of places
    for updating the jack and notify to user-space, but this makes rather
    the code flow fragile.  Namely, because of those direct calls,
    hda_jackpoll_work() uses snd_hda_power_up_pm() and *_down_pm() calls
    instead of the standard snd_hda_power_up() and *_down() calls.  The
    latter pair assures the runtime PM resume sync, so it can avoid the
    race against the PM callbacks gracefully, while the former pair may
    continue if called concurrently, hence it may race (by design).
    
    In this patch, we change the call pattern of hda_jackpoll_work(); now
    all callers are replaced with the standard snd_hda_jack_report_sync()
    and the additional schedule_delayed_work().
    
    Since hda_jackpoll_work() is called only from the associated work,
    it's always outside the PM code path, and we can safely use
    snd_hda_power_up() and *_down() there instead.  This allows us to
    remove the racy check of power-state in hda_jackpoll_work(), as well
    as the tricky cancel_delayed_work() and rescheduling at
    hda_codec_runtime_suspend().
    
    Reported-by: Joakim Zhang <joakim.zhang@cixtech.com>
    Closes: https://lore.kernel.org/20250619020844.2974160-1-joakim.zhang@cixtech.com
    Tested-by: Joakim Zhang <joakim.zhang@cixtech.com>
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Link: https://patch.msgid.link/20250623131437.10670-4-tiwai@suse.de
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ALSA: intel8x0: Fix incorrect codec index usage in mixer for ICH4 [+ + +]

Author: Alok Tiwari <alok.a.tiwari@oracle.com>
Date:   Sat Jun 21 11:52:24 2025 -0700

    ALSA: intel8x0: Fix incorrect codec index usage in mixer for ICH4
    
    [ Upstream commit 87aafc8580acf87fcaf1a7e30ed858d8c8d37d81 ]
    
    code mistakenly used a hardcoded index (codec[1]) instead of
    iterating, over the codec array using the loop variable i.
    Use codec[i] instead of codec[1] to match the loop iteration.
    
    Signed-off-by: Alok Tiwari <alok.a.tiwari@oracle.com>
    Link: https://patch.msgid.link/20250621185233.4081094-1-alok.a.tiwari@oracle.com
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ALSA: pcm: Rewrite recalculate_boundary() to avoid costly loop [+ + +]

Author: Christophe Leroy <christophe.leroy@csgroup.eu>
Date:   Fri Jun 6 11:44:02 2025 +0200

    ALSA: pcm: Rewrite recalculate_boundary() to avoid costly loop
    
    [ Upstream commit 92f59aeb13252265c20e7aef1379a8080c57e0a2 ]
    
    At the time being recalculate_boundary() is implemented with a
    loop which shows up as costly in a perf profile, as depicted by
    the annotate below:
    
        0.00 :   c057e934:       3d 40 7f ff     lis     r10,32767
        0.03 :   c057e938:       61 4a ff ff     ori     r10,r10,65535
        0.21 :   c057e93c:       7d 49 50 50     subf    r10,r9,r10
        5.39 :   c057e940:       7d 3c 4b 78     mr      r28,r9
        2.11 :   c057e944:       55 29 08 3c     slwi    r9,r9,1
        3.04 :   c057e948:       7c 09 50 40     cmplw   r9,r10
        2.47 :   c057e94c:       40 81 ff f4     ble     c057e940 <snd_pcm_ioctl+0xee0>
    
    Total: 13.2% on that simple loop.
    
    But what the loop does is to multiply the boundary by 2 until it is
    over the wanted border. This can be avoided by using fls() to get the
    boundary value order and shift it by the appropriate number of bits at
    once.
    
    This change provides the following profile:
    
        0.04 :   c057f6e8:       3d 20 7f ff     lis     r9,32767
        0.02 :   c057f6ec:       61 29 ff ff     ori     r9,r9,65535
        0.34 :   c057f6f0:       7d 5a 48 50     subf    r10,r26,r9
        0.23 :   c057f6f4:       7c 1a 50 40     cmplw   r26,r10
        0.02 :   c057f6f8:       41 81 00 20     bgt     c057f718 <snd_pcm_ioctl+0xf08>
        0.26 :   c057f6fc:       7f 47 00 34     cntlzw  r7,r26
        0.09 :   c057f700:       7d 48 00 34     cntlzw  r8,r10
        0.22 :   c057f704:       7d 08 38 50     subf    r8,r8,r7
        0.04 :   c057f708:       7f 5a 40 30     slw     r26,r26,r8
        0.35 :   c057f70c:       7c 0a d0 40     cmplw   r10,r26
        0.13 :   c057f710:       40 80 05 f8     bge     c057fd08 <snd_pcm_ioctl+0x14f8>
        0.00 :   c057f714:       57 5a f8 7e     srwi    r26,r26,1
    
    Total: 1.7% with that loopless alternative.
    
    Signed-off-by: Christophe Leroy <christophe.leroy@csgroup.eu>
    Link: https://patch.msgid.link/4836e2cde653eebaf2709ebe30eec736bb8c67fd.1749202237.git.christophe.leroy@csgroup.eu
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ALSA: usb-audio: Avoid precedence issues in mixer_quirks macros [+ + +]

Author: Cristian Ciocaltea <cristian.ciocaltea@collabora.com>
Date:   Mon May 26 17:07:42 2025 +0300

    ALSA: usb-audio: Avoid precedence issues in mixer_quirks macros
    
    [ Upstream commit fd3ab72e42e9871a9902b945a2bf8bb87b49c718 ]
    
    Fix all macro related issues identified by checkpatch.pl:
    
      CHECK: Macro argument 'x' may be better as '(x)' to avoid precedence issues
    
    Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com>
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Link: https://patch.msgid.link/20250526-dualsense-alsa-jack-v1-3-1a821463b632@collabora.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ALSA: usb-audio: Validate UAC3 cluster segment descriptors [+ + +]

Author: Takashi Iwai <tiwai@suse.de>
Date:   Thu Aug 14 10:12:43 2025 +0200

    ALSA: usb-audio: Validate UAC3 cluster segment descriptors
    
    commit ecfd41166b72b67d3bdeb88d224ff445f6163869 upstream.
    
    UAC3 class segment descriptors need to be verified whether their sizes
    match with the declared lengths and whether they fit with the
    allocated buffer sizes, too.  Otherwise malicious firmware may lead to
    the unexpected OOB accesses.
    
    Fixes: 11785ef53228 ("ALSA: usb-audio: Initial Power Domain support")
    Reported-and-tested-by: Youngjun Lee <yjjuny.lee@samsung.com>
    Cc: <stable@vger.kernel.org>
    Link: https://patch.msgid.link/20250814081245.8902-2-tiwai@suse.de
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ALSA: usb-audio: Validate UAC3 power domain descriptors, too [+ + +]

Author: Takashi Iwai <tiwai@suse.de>
Date:   Thu Aug 14 10:12:42 2025 +0200

    ALSA: usb-audio: Validate UAC3 power domain descriptors, too
    
    commit d832ccbc301fbd9e5a1d691bdcf461cdb514595f upstream.
    
    UAC3 power domain descriptors need to be verified with its variable
    bLength for avoiding the unexpected OOB accesses by malicious
    firmware, too.
    
    Fixes: 9a2fe9b801f5 ("ALSA: usb: initial USB Audio Device Class 3.0 support")
    Reported-and-tested-by: Youngjun Lee <yjjuny.lee@samsung.com>
    Cc: <stable@vger.kernel.org>
    Link: https://patch.msgid.link/20250814081245.8902-1-tiwai@suse.de
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

apparmor: fix x_table_lookup when stacking is not the first entry [+ + +]

Author: John Johansen <john.johansen@canonical.com>
Date:   Tue Jan 2 21:54:30 2024 -0800

    apparmor: fix x_table_lookup when stacking is not the first entry
    
    [ Upstream commit a9eb185be84e998aa9a99c7760534ccc06216705 ]
    
    x_table_lookup currently does stacking during label_parse() if the
    target specifies a stack but its only caller ensures that it will
    never be used with stacking.
    
    Refactor to slightly simplify the code in x_to_label(), this
    also fixes a long standing problem where x_to_labels check on stacking
    is only on the first element to the table option list, instead of
    the element that is found and used.
    
    Signed-off-by: John Johansen <john.johansen@canonical.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

apparmor: shift ouid when mediating hard links in userns [+ + +]

Author: Gabriel Totev <gabriel.totev@zetier.com>
Date:   Wed Apr 16 18:42:08 2025 -0400

    apparmor: shift ouid when mediating hard links in userns
    
    [ Upstream commit c5bf96d20fd787e4909b755de4705d52f3458836 ]
    
    When using AppArmor profiles inside an unprivileged container,
    the link operation observes an unshifted ouid.
    (tested with LXD and Incus)
    
    For example, root inside container and uid 1000000 outside, with
    `owner /root/link l,` profile entry for ln:
    
    /root$ touch chain && ln chain link
    ==> dmesg
    apparmor="DENIED" operation="link" class="file"
    namespace="root//lxd-feet_<var-snap-lxd-common-lxd>" profile="linkit"
    name="/root/link" pid=1655 comm="ln" requested_mask="l" denied_mask="l"
    fsuid=1000000 ouid=0 [<== should be 1000000] target="/root/chain"
    
    Fix by mapping inode uid of old_dentry in aa_path_link() rather than
    using it directly, similarly to how it's mapped in __file_path_perm()
    later in the file.
    
    Signed-off-by: Gabriel Totev <gabriel.totev@zetier.com>
    Signed-off-by: John Johansen <john.johansen@canonical.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

apparmor: use the condition in AA_BUG_FMT even with debug disabled [+ + +]

Author: Mateusz Guzik <mjguzik@gmail.com>
Date:   Mon Jan 27 21:54:04 2025 +0100

    apparmor: use the condition in AA_BUG_FMT even with debug disabled
    
    [ Upstream commit 67e370aa7f968f6a4f3573ed61a77b36d1b26475 ]
    
    This follows the established practice and fixes a build failure for me:
    security/apparmor/file.c: In function ‘__file_sock_perm’:
    security/apparmor/file.c:544:24: error: unused variable ‘sock’ [-Werror=unused-variable]
      544 |         struct socket *sock = (struct socket *) file->private_data;
          |                        ^~~~
    
    Signed-off-by: Mateusz Guzik <mjguzik@gmail.com>
    Signed-off-by: John Johansen <john.johansen@canonical.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

arm64: dts: ti: k3-j722s-evm: Fix USB gpio-hog level for Type-C [+ + +]

Author: Siddharth Vadapalli <s-vadapalli@ti.com>
Date:   Mon Jun 23 15:36:57 2025 +0530

    arm64: dts: ti: k3-j722s-evm: Fix USB gpio-hog level for Type-C
    
    commit 65ba2a6e77e9e5c843a591055789050e77b5c65e upstream.
    
    According to the "GPIO Expander Map / Table" section of the J722S EVM
    Schematic within the Evaluation Module Design Files package [0], the
    GPIO Pin P05 located on the GPIO Expander 1 (I2C0/0x23) has to be pulled
    down to select the Type-C interface. Since commit under Fixes claims to
    enable the Type-C interface, update the property within "p05-hog" from
    "output-high" to "output-low", thereby switching from the Type-A
    interface to the Type-C interface.
    
    [0]: https://www.ti.com/lit/zip/sprr495
    
    Cc: stable@vger.kernel.org
    Fixes: 485705df5d5f ("arm64: dts: ti: k3-j722s: Enable PCIe and USB support on J722S-EVM")
    Signed-off-by: Siddharth Vadapalli <s-vadapalli@ti.com>
    Link: https://lore.kernel.org/r/20250623100657.4082031-1-s-vadapalli@ti.com
    Signed-off-by: Vignesh Raghavendra <vigneshr@ti.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

arm64: Handle KCOV __init vs inline mismatches [+ + +]

Author: Kees Cook <kees@kernel.org>
Date:   Wed Jul 23 22:50:25 2025 -0700

    arm64: Handle KCOV __init vs inline mismatches
    
    [ Upstream commit 65c430906efffee9bd7551d474f01a6b1197df90 ]
    
    GCC appears to have kind of fragile inlining heuristics, in the
    sense that it can change whether or not it inlines something based on
    optimizations. It looks like the kcov instrumentation being added (or in
    this case, removed) from a function changes the optimization results,
    and some functions marked "inline" are _not_ inlined. In that case,
    we end up with __init code calling a function not marked __init, and we
    get the build warnings I'm trying to eliminate in the coming patch that
    adds __no_sanitize_coverage to __init functions:
    
    WARNING: modpost: vmlinux: section mismatch in reference: acpi_get_enable_method+0x1c (section: .text.unlikely) -> acpi_psci_present (section: .init.text)
    
    This problem is somewhat fragile (though using either __always_inline
    or __init will deterministically solve it), but we've tripped over
    this before with GCC and the solution has usually been to just use
    __always_inline and move on.
    
    For arm64 this requires forcing one ACPI function to be inlined with
    __always_inline.
    
    Link: https://lore.kernel.org/r/20250724055029.3623499-1-kees@kernel.org
    Signed-off-by: Kees Cook <kees@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

arm64: Mark kernel as tainted on SAE and SError panic [+ + +]

Author: Breno Leitao <leitao@debian.org>
Date:   Wed Jul 16 02:42:01 2025 -0700

    arm64: Mark kernel as tainted on SAE and SError panic
    
    [ Upstream commit d7ce7e3a84642aadf7c4787f7ec4f58eb163d129 ]
    
    Set TAINT_MACHINE_CHECK when SError or Synchronous External Abort (SEA)
    interrupts trigger a panic to flag potential hardware faults. This
    tainting mechanism aids in debugging and enables correlation of
    hardware-related crashes in large-scale deployments.
    
    This change aligns with similar patches[1] that mark machine check
    events when the system crashes due to hardware errors.
    
    Link: https://lore.kernel.org/all/20250702-add_tain-v1-1-9187b10914b9@debian.org/ [1]
    Signed-off-by: Breno Leitao <leitao@debian.org>
    Acked-by: Mark Rutland <mark.rutland@arm.com>
    Link: https://lore.kernel.org/r/20250716-vmcore_hw_error-v2-1-f187f7d62aba@debian.org
    Signed-off-by: Will Deacon <will@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

arm64: stacktrace: Check kretprobe_find_ret_addr() return value [+ + +]

Author: Mark Rutland <mark.rutland@arm.com>
Date:   Wed May 21 12:09:59 2025 +0100

    arm64: stacktrace: Check kretprobe_find_ret_addr() return value
    
    [ Upstream commit beecfd6a88a675e20987e70ec532ba734b230fa4 ]
    
    If kretprobe_find_ret_addr() fails to find the original return address,
    it returns 0. Check for this case so that a reliable stacktrace won't
    silently ignore it.
    
    Signed-off-by: Mark Rutland <mark.rutland@arm.com>
    Cc: Andrea della Porta <andrea.porta@suse.com>
    Cc: Breno Leitao <leitao@debian.org>
    Cc: Josh Poimboeuf <jpoimboe@kernel.org>
    Cc: Miroslav Benes <mbenes@suse.cz>
    Cc: Petr Mladek <pmladek@suse.com>
    Cc: Song Liu <song@kernel.org>
    Cc: Will Deacon <will@kernel.org>
    Reviewed-and-tested-by: Song Liu <song@kernel.org>
    Link: https://lore.kernel.org/r/20250521111000.2237470-2-mark.rutland@arm.com
    Signed-off-by: Catalin Marinas <catalin.marinas@arm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ARM: rockchip: fix kernel hang during smp initialization [+ + +]

Author: Alexander Kochetkov <al.kochet@gmail.com>
Date:   Thu Jul 3 17:04:53 2025 +0300

    ARM: rockchip: fix kernel hang during smp initialization
    
    [ Upstream commit 7cdb433bb44cdc87dc5260cdf15bf03cc1cd1814 ]
    
    In order to bring up secondary CPUs main CPU write trampoline
    code to SRAM. The trampoline code is written while secondary
    CPUs are powered on (at least that true for RK3188 CPU).
    Sometimes that leads to kernel hang. Probably because secondary
    CPU execute trampoline code while kernel doesn't expect.
    
    The patch moves SRAM initialization step to the point where all
    secondary CPUs are powered down.
    
    That fixes rarely hangs on RK3188:
    [    0.091568] CPU0: thread -1, cpu 0, socket 0, mpidr 80000000
    [    0.091996] rockchip_smp_prepare_cpus: ncores 4
    
    Signed-off-by: Alexander Kochetkov <al.kochet@gmail.com>
    Link: https://lore.kernel.org/r/20250703140453.1273027-1-al.kochet@gmail.com
    Signed-off-by: Heiko Stuebner <heiko@sntech.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ARM: tegra: Use I/O memcpy to write to IRAM [+ + +]

Author: Aaron Kling <webgeek1234@gmail.com>
Date:   Thu May 22 11:11:24 2025 -0500

    ARM: tegra: Use I/O memcpy to write to IRAM
    
    [ Upstream commit 398e67e0f5ae04b29bcc9cbf342e339fe9d3f6f1 ]
    
    Kasan crashes the kernel trying to check boundaries when using the
    normal memcpy.
    
    Signed-off-by: Aaron Kling <webgeek1234@gmail.com>
    Link: https://lore.kernel.org/r/20250522-mach-tegra-kasan-v1-1-419041b8addb@gmail.com
    Signed-off-by: Thierry Reding <treding@nvidia.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ASoC: codecs: rt5640: Retry DEVICE_ID verification [+ + +]

Author: Xinxin Wan <xinxin.wan@intel.com>
Date:   Fri May 30 16:21:19 2025 +0200

    ASoC: codecs: rt5640: Retry DEVICE_ID verification
    
    [ Upstream commit 19f971057b2d7b99c80530ec1052b45de236a8da ]
    
    To be more resilient to codec-detection failures when the hardware
    powers on slowly, add retry mechanism to the device verification check.
    Similar pattern is found throughout a number of Realtek codecs. Our
    tests show that 60ms delay is sufficient to address readiness issues on
    rt5640 chip.
    
    Reviewed-by: Amadeusz Sławiński <amadeuszx.slawinski@linux.intel.com>
    Reviewed-by: Cezary Rojewski <cezary.rojewski@intel.com>
    Signed-off-by: Xinxin Wan <xinxin.wan@intel.com>
    Signed-off-by: Cezary Rojewski <cezary.rojewski@intel.com>
    Link: https://patch.msgid.link/20250530142120.2944095-3-cezary.rojewski@intel.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ASoC: core: Check for rtd == NULL in snd_soc_remove_pcm_runtime() [+ + +]

Author: Peter Ujfalusi <peter.ujfalusi@linux.intel.com>
Date:   Thu Jun 19 11:42:20 2025 +0300

    ASoC: core: Check for rtd == NULL in snd_soc_remove_pcm_runtime()
    
    [ Upstream commit 2d91cb261cac6d885954b8f5da28b5c176c18131 ]
    
    snd_soc_remove_pcm_runtime() might be called with rtd == NULL which will
    leads to null pointer dereference.
    This was reproduced with topology loading and marking a link as ignore
    due to missing hardware component on the system.
    On module removal the soc_tplg_remove_link() would call
    snd_soc_remove_pcm_runtime() with rtd == NULL since the link was ignored,
    no runtime was created.
    
    Signed-off-by: Peter Ujfalusi <peter.ujfalusi@linux.intel.com>
    Reviewed-by: Bard Liao <yung-chuan.liao@linux.intel.com>
    Reviewed-by: Ranjani Sridharan <ranjani.sridharan@linux.intel.com>
    Reviewed-by: Liam Girdwood <liam.r.girdwood@intel.com>
    Reviewed-by: Kai Vehmanen <kai.vehmanen@linux.intel.com>
    Link: https://patch.msgid.link/20250619084222.559-3-peter.ujfalusi@linux.intel.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ASoC: fsl_sai: replace regmap_write with regmap_update_bits [+ + +]

Author: Shengjiu Wang <shengjiu.wang@nxp.com>
Date:   Thu Aug 7 10:03:18 2025 +0800

    ASoC: fsl_sai: replace regmap_write with regmap_update_bits
    
    [ Upstream commit 0e270f32975fd21874185ba53653630dd40bf560 ]
    
    Use the regmap_write() for software reset in fsl_sai_config_disable would
    cause the FSL_SAI_CSR_BCE bit to be cleared. Refer to
    commit 197c53c8ecb34 ("ASoC: fsl_sai: Don't disable bitclock for i.MX8MP")
    FSL_SAI_CSR_BCE should not be cleared. So need to use regmap_update_bits()
    instead of regmap_write() for these bit operations.
    
    Fixes: dc78f7e59169d ("ASoC: fsl_sai: Force a software reset when starting in consumer mode")
    Signed-off-by: Shengjiu Wang <shengjiu.wang@nxp.com>
    Link: https://patch.msgid.link/20250807020318.2143219-1-shengjiu.wang@nxp.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ASoC: hdac_hdmi: Rate limit logging on connection and disconnection [+ + +]

Author: Mark Brown <broonie@kernel.org>
Date:   Fri Jun 13 17:41:04 2025 +0100

    ASoC: hdac_hdmi: Rate limit logging on connection and disconnection
    
    [ Upstream commit c4ca928a6db1593802cd945f075a7e21dd0430c1 ]
    
    We currently log parse failures for ELD data and some disconnection events
    as errors without rate limiting. These log messages can be triggered very
    frequently in some situations, especially ELD parsing when there is nothing
    connected to a HDMI port which will generate:
    
    hdmi-audio-codec hdmi-audio-codec.1.auto: HDMI: Unknown ELD version 0
    
    While there's doubtless work that could be done on reducing the number of
    connection notification callbacks it's possible these may be legitimately
    generated by poor quality physical connections so let's use rate limiting
    to mitigate the log spam for the parse errors and lower the severity for
    disconnect logging to debug level.
    
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Link: https://patch.msgid.link/20250613-asoc-hdmi-eld-logging-v1-1-76d64154d969@kernel.org
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ASoC: Intel: avs: Fix uninitialized pointer error in probe() [+ + +]

Author: Cezary Rojewski <cezary.rojewski@intel.com>
Date:   Wed Jul 30 14:49:06 2025 +0200

    ASoC: Intel: avs: Fix uninitialized pointer error in probe()
    
    [ Upstream commit 11f74f48c14c1f4fe16541900ea5944c42e30ccf ]
    
    If pcim_request_all_regions() fails, error path operates on
    uninitialized 'bus' pointer. Found out by Coverity static analyzer.
    
    Reviewed-by: Amadeusz Sławiński <amadeuszx.slawinski@linux.intel.com>
    Signed-off-by: Cezary Rojewski <cezary.rojewski@intel.com>
    Link: https://patch.msgid.link/20250730124906.351798-1-cezary.rojewski@intel.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ASoC: Intel: sof_sdw: Add quirk for Alienware Area 51 (2025) 0CCC SKU [+ + +]

Author: Peter Jakubek <peterjakubek@gmail.com>
Date:   Thu Jul 31 18:21:04 2025 +0100

    ASoC: Intel: sof_sdw: Add quirk for Alienware Area 51 (2025) 0CCC SKU
    
    [ Upstream commit 1b03391d073dad748636a1ad9668b837cce58265 ]
    
    Add DMI quirk entry for Alienware systems with SKU "0CCC" to enable
    proper speaker codec configuration (SOC_SDW_CODEC_SPKR).
    
    This system requires the same audio configuration as some existing Dell systems.
    Without this patch, the laptop's speakers and microphone will not work.
    
    Signed-off-by: Peter Jakubek <peterjakubek@gmail.com>
    Link: https://patch.msgid.link/20250731172104.2009007-1-peterjakubek@gmail.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ASoC: qcom: use drvdata instead of component to keep id [+ + +]

Author: Srinivas Kandagatla <srini@kernel.org>
Date:   Wed Jun 4 02:06:48 2025 +0000

    ASoC: qcom: use drvdata instead of component to keep id
    
    [ Upstream commit 8167f4f42572818fa8153be2b03e4c2120846603 ]
    
    Qcom lpass is using component->id to keep DAI ID (A).
    
    (S)     static int lpass_platform_pcmops_open(
                                    sruct snd_soc_component *component,
                                    struct snd_pcm_substream *substream)
            {                                                 ^^^^^^^^^(B0)
                    ...
    (B1)            struct snd_soc_pcm_runtime *soc_runtime = snd_soc_substream_to_rtd(substream);
    (B2)            struct snd_soc_dai *cpu_dai = snd_soc_rtd_to_cpu(soc_runtime, 0);
                    ...
    (B3)            unsigned int dai_id = cpu_dai->driver->id;
    
    (A)             component->id = dai_id;
                    ...
            }
    
    This driver can get dai_id from substream (B0 - B3).
    In this driver, below functions get dai_id from component->id (A).
    
    (X)     lpass_platform_pcmops_suspend()
    (Y)     lpass_platform_pcmops_resume()
    (Z)     lpass_platform_copy()
    
    Here, (Z) can get it from substream (B0 - B3), don't need to use
    component->id (A). On suspend/resume (X)(Y), dai_id can only be obtained
    from component->id (A), because there is no substream (B0) in function
    parameter.
    
    But, component->id (A) itself should not be used for such purpose.
    It is intilialized at snd_soc_component_initialize(), and parsed its ID
    (= component->id) from device name (a).
    
            int snd_soc_component_initialize(...)
            {
                    ...
                    if (!component->name) {
    (a)                     component->name = fmt_single_name(dev, &component->id);
                            ...                                     ^^^^^^^^^^^^^
                    }
                    ...
            }
    
    Unfortunately, current code is broken to start with.
    
    There are many regmaps that the driver cares about, however its only
    managing one (either dp or i2s) in component suspend/resume path.
    
    I2S regmap is mandatory however other regmaps are setup based on flags
    like "hdmi_port_enable" and "codec_dma_enable".
    
    Correct thing for suspend/resume path to handle is by checking these
    flags, instead of using component->id.
    
    Signed-off-by: Srinivas Kandagatla <srini@kernel.org>
    Suggested-by: Kuninori Morimoto <kuninori.morimoto.gx@renesas.com>
    Signed-off-by: Kuninori Morimoto <kuninori.morimoto.gx@renesas.com>
    Link: https://patch.msgid.link/87a56ouuob.wl-kuninori.morimoto.gx@renesas.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ASoC: SDCA: Add flag for unused IRQs [+ + +]

Author: Charles Keepax <ckeepax@opensource.cirrus.com>
Date:   Tue Jun 24 13:28:42 2025 +0100

    ASoC: SDCA: Add flag for unused IRQs
    
    [ Upstream commit 775f5729b47d8737f4f98e0141f61b3358245398 ]
    
    Zero is a valid SDCA IRQ interrupt position so add a special value to
    indicate that the IRQ is not used.
    
    Reviewed-by: Bard Liao <yung-chuan.liao@linux.intel.com>
    Signed-off-by: Charles Keepax <ckeepax@opensource.cirrus.com>
    Reviewed-by: Pierre-Louis Bossart <pierre-louis.bossart@linux.dev>
    Link: https://patch.msgid.link/20250624122844.2761627-6-ckeepax@opensource.cirrus.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ASoC: soc-dapm: set bias_level if snd_soc_dapm_set_bias_level() was successed [+ + +]

Author: Kuninori Morimoto <kuninori.morimoto.gx@renesas.com>
Date:   Fri Jul 11 02:26:39 2025 +0000

    ASoC: soc-dapm: set bias_level if snd_soc_dapm_set_bias_level() was successed
    
    [ Upstream commit f40ecc2743652c0b0f19935f81baf57c601eb7f0 ]
    
    ASoC has 2 functions to set bias level.
            (A) snd_soc_dapm_force_bias_level()
            (B) snd_soc_dapm_set_bias_level()
    
    snd_soc_dapm_force_bias_level() (A) will set dapm->bias_level (a) if
    successed.
    
    (A)     int snd_soc_dapm_force_bias_level(...)
            {
                    ...
                    if (ret == 0)
    (a)                     dapm->bias_level = level;
                    ...
            }
    
    snd_soc_dapm_set_bias_level() (B) is also a function that sets bias_level.
    It will call snd_soc_dapm_force_bias_level() (A) inside, but doesn't
    set dapm->bias_level by itself. One note is that (A) might not be called.
    
    (B)     static int snd_soc_dapm_set_bias_level(...)
            {
                    ...
                    ret = snd_soc_card_set_bias_level(...);
                    ...
                    if (dapm != &card->dapm)
    (A)                     ret = snd_soc_dapm_force_bias_level(...);
                    ...
                    ret = snd_soc_card_set_bias_level_post(...);
                    ...
            }
    
    dapm->bias_level will be set if (A) was called, but might not be set
    if (B) was called, even though it calles set_bias_level() function.
    
    We should set dapm->bias_level if we calls
    snd_soc_dapm_set_bias_level() (B), too.
    
    Signed-off-by: Kuninori Morimoto <kuninori.morimoto.gx@renesas.com>
    Link: https://patch.msgid.link/87qzyn4g4h.wl-kuninori.morimoto.gx@renesas.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ASoC: SOF: topology: Parse the dapm_widget_tokens in case of DSPless mode [+ + +]

Author: Peter Ujfalusi <peter.ujfalusi@linux.intel.com>
Date:   Thu Jun 19 13:26:40 2025 +0300

    ASoC: SOF: topology: Parse the dapm_widget_tokens in case of DSPless mode
    
    [ Upstream commit 6b3cb7f4341cbf62d41ccf6ea906dbe66be8aa3d ]
    
    Parsing the dapm_widget_tokens is also needed for DSPless mode as it is
    setting the snd_soc_dapm_widget.no_wname_in_kcontrol_name flag for the
    kcontrol creation from DAPM widgets.
    Without that flag set, the following warnings might appear because of long
    control names:
    ALSA: Control name 'eqiir.2.1 Post Mixer Analog Playback IIR Eq bytes' truncated to 'eqiir.2.1 Post Mixer Analog Playback IIR Eq'
    ALSA: Control name 'eqfir.2.1 Post Mixer Analog Playback FIR Eq bytes' truncated to 'eqfir.2.1 Post Mixer Analog Playback FIR Eq'
    ALSA: Control name 'drc.2.1 Post Mixer Analog Playback DRC bytes' truncated to 'drc.2.1 Post Mixer Analog Playback DRC byte'
    ALSA: Control name 'drc.2.1 Post Mixer Analog Playback DRC switch' truncated to 'drc.2.1 Post Mixer Analog Playback DRC swit'
    ALSA: Control name 'gain.15.1 Pre Mixer Deepbuffer HDA Analog Volume' truncated to 'gain.15.1 Pre Mixer Deepbuffer HDA Analog V'
    
    Signed-off-by: Peter Ujfalusi <peter.ujfalusi@linux.intel.com>
    Reviewed-by: Liam Girdwood <liam.r.girdwood@intel.com>
    Reviewed-by: Ranjani Sridharan <ranjani.sridharan@linux.intel.com>
    Link: https://patch.msgid.link/20250619102640.12068-1-peter.ujfalusi@linux.intel.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ata: ahci: Disable DIPM if host lacks support [+ + +]

Author: Damien Le Moal <dlemoal@kernel.org>
Date:   Tue Jul 1 21:53:17 2025 +0900

    ata: ahci: Disable DIPM if host lacks support
    
    [ Upstream commit f7870e8d345cdabfb94bcbdcba6a07e050f8705e ]
    
    The AHCI specification version 1.3.1 section 8.3.1.4 (Software
    Requirements and Precedence) states that:
    
    If CAP.SSC or CAP.PSC is cleared to ‘0’, software should disable
    device-initiated power management by issuing the appropriate SET
    FEATURES command to the device.
    
    To satisfy this constraint and force ata_dev_configure to disable the
    device DIPM feature, modify ahci_update_initial_lpm_policy() to set the
    ATA_FLAG_NO_DIPM flag on ports that have a host with either the
    ATA_HOST_NO_PART flag set or the ATA_HOST_NO_SSC flag set.
    
    Signed-off-by: Damien Le Moal <dlemoal@kernel.org>
    Reviewed-by: Niklas Cassel <cassel@kernel.org>
    Reviewed-by: Hannes Reinecke <hare@suse.de>
    Reviewed-by: Hannes Reinecke <hare@suse.de.>
    Link: https://lore.kernel.org/r/20250701125321.69496-7-dlemoal@kernel.org
    Signed-off-by: Niklas Cassel <cassel@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ata: ahci: Disallow LPM policy control if not supported [+ + +]

Author: Damien Le Moal <dlemoal@kernel.org>
Date:   Tue Jul 1 21:53:19 2025 +0900

    ata: ahci: Disallow LPM policy control if not supported
    
    [ Upstream commit 65b2c92f69d3df81422d27e5be012e357e733241 ]
    
    Commit fa997b0576c9 ("ata: ahci: Do not enable LPM if no LPM states are
    supported by the HBA") introduced an early return in
    ahci_update_initial_lpm_policy() to ensure that the target_lpm_policy
    of ports belonging to a host that does not support the Partial, Slumber
    and DevSleep power states is unchanged and remains set to
    ATA_LPM_UNKNOWN and thus prevents the execution of
    ata_eh_link_set_lpm().
    
    However, a user or a system daemon (e.g. systemd-udevd) may still
    attempt changing the LPM policy through the sysfs
    link_power_management_policy of the host.
    
    Improve this to prevent sysfs LPM policy changes by setting the flag
    ATA_FLAG_NO_LPM for the port of such host, and initialize the port
    target_lpm_policy to ATA_LPM_MAX_POWER to guarantee that no unsupported
    low power state is being used on the port and its link.
    
    Signed-off-by: Damien Le Moal <dlemoal@kernel.org>
    Reviewed-by: Hannes Reinecke <hare@suse.de>
    Reviewed-by: Niklas Cassel <cassel@kernel.org>
    Link: https://lore.kernel.org/r/20250701125321.69496-9-dlemoal@kernel.org
    Signed-off-by: Niklas Cassel <cassel@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ata: libata-sata: Add link_power_management_supported sysfs attribute [+ + +]

Author: Damien Le Moal <dlemoal@kernel.org>
Date:   Mon Jul 28 13:04:29 2025 +0900

    ata: libata-sata: Add link_power_management_supported sysfs attribute
    
    commit 0060beec0bfa647c4b510df188b1c4673a197839 upstream.
    
    A port link power management (LPM) policy can be controlled using the
    link_power_management_policy sysfs host attribute. However, this
    attribute exists also for hosts that do not support LPM and in such
    case, attempting to change the LPM policy for the host (port) will fail
    with -EOPNOTSUPP.
    
    Introduce the new sysfs link_power_management_supported host attribute
    to indicate to the user if a the port and the devices connected to the
    port for the host support LPM, which implies that the
    link_power_management_policy attribute can be used.
    
    Since checking that a port and its devices support LPM is common between
    the new ata_scsi_lpm_supported_show() function and the existing
    ata_scsi_lpm_store() function, the new helper ata_scsi_lpm_supported()
    is introduced.
    
    Fixes: 413e800cadbf ("ata: libata-sata: Disallow changing LPM state if not supported")
    Reported-by: Borah, Chaitanya Kumar <chaitanya.kumar.borah@intel.com>
    Reported-by: kernel test robot <oliver.sang@intel.com>
    Closes: https://lore.kernel.org/oe-lkp/202507251014.a5becc3b-lkp@intel.com
    Signed-off-by: Damien Le Moal <dlemoal@kernel.org>
    Reviewed-by: Martin K. Petersen <martin.petersen@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ata: libata-sata: Disallow changing LPM state if not supported [+ + +]

Author: Damien Le Moal <dlemoal@kernel.org>
Date:   Tue Jul 1 21:53:16 2025 +0900

    ata: libata-sata: Disallow changing LPM state if not supported
    
    [ Upstream commit 413e800cadbf67550d76c77c230b2ecd96bce83a ]
    
    Modify ata_scsi_lpm_store() to return an error if a user attempts to set
    a link power management policy for a port that does not support LPM,
    that is, ports flagged with ATA_FLAG_NO_LPM.
    
    Signed-off-by: Damien Le Moal <dlemoal@kernel.org>
    Reviewed-by: Niklas Cassel <cassel@kernel.org>
    Reviewed-by: Hannes Reinecke <hare@suse.de>
    Link: https://lore.kernel.org/r/20250701125321.69496-6-dlemoal@kernel.org
    Signed-off-by: Niklas Cassel <cassel@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

be2net: Use correct byte order and format string for TCP seq and ack_seq [+ + +]

Author: Alok Tiwari <alok.a.tiwari@oracle.com>
Date:   Thu Jul 17 12:35:47 2025 -0700

    be2net: Use correct byte order and format string for TCP seq and ack_seq
    
    [ Upstream commit 4701ee5044fb3992f1c910630a9673c2dc600ce5 ]
    
    The TCP header fields seq and ack_seq are 32-bit values in network
    byte order as (__be32). these fields were earlier printed using
    ntohs(), which converts only 16-bit values and produces incorrect
    results for 32-bit fields. This patch is changeing the conversion
    to ntohl(), ensuring correct interpretation of these sequence numbers.
    
    Notably, the format specifier is updated from %d to %u to reflect the
    unsigned nature of these fields.
    
    improves the accuracy of debug log messages for TCP sequence and
    acknowledgment numbers during TX timeouts.
    
    Signed-off-by: Alok Tiwari <alok.a.tiwari@oracle.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://patch.msgid.link/20250717193552.3648791-1-alok.a.tiwari@oracle.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

Linux: better lockdep annotations for simple_recursive_removal() [+ + +]

Author: Al Viro <viro@zeniv.linux.org.uk>
Date:   Wed Jul 2 22:30:32 2025 -0400

    better lockdep annotations for simple_recursive_removal()
    
    [ Upstream commit 2a8061ee5e41034eb14170ec4517b5583dbeff9f ]
    
    We want a class that nests outside of I_MUTEX_NORMAL (for the sake of
    callbacks that might want to lock the victim) and inside I_MUTEX_PARENT
    (so that a variant of that could be used with parent of the victim
    held locked by the caller).
    
    In reality, simple_recursive_removal()
            * never holds two locks at once
            * holds the lock on parent of dentry passed to callback
            * is used only on the trees with fixed topology, so the depths
    are not changing.
    
    So the locking order is actually fine.
    
    AFAICS, the best solution is to assign I_MUTEX_CHILD to the locks
    grabbed by that thing.
    
    Reported-by: syzbot+169de184e9defe7fe709@syzkaller.appspotmail.com
    Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

binder: Fix selftest page indexing [+ + +]

Author: Tiffany Yang <ynaffit@google.com>
Date:   Mon Jul 14 11:53:14 2025 -0700

    binder: Fix selftest page indexing
    
    [ Upstream commit bea3e7bfa2957d986683543cbf57092715f9a91b ]
    
    The binder allocator selftest was only checking the last page of buffers
    that ended on a page boundary. Correct the page indexing to account for
    buffers that are not page-aligned.
    
    Signed-off-by: Tiffany Yang <ynaffit@google.com>
    Acked-by: Carlos Llamas <cmllamas@google.com>
    Link: https://lore.kernel.org/r/20250714185321.2417234-2-ynaffit@google.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

block: avoid possible overflow for chunk_sectors check in blk_stack_limits() [+ + +]

Author: John Garry <john.g.garry@oracle.com>
Date:   Tue Jul 29 09:14:47 2025 +0000

    block: avoid possible overflow for chunk_sectors check in blk_stack_limits()
    
    [ Upstream commit 448dfecc7ff807822ecd47a5c052acedca7d09e8 ]
    
    In blk_stack_limits(), we check that the t->chunk_sectors value is a
    multiple of the t->physical_block_size value.
    
    However, by finding the chunk_sectors value in bytes, we may overflow
    the unsigned int which holds chunk_sectors, so change the check to be
    based on sectors.
    
    Reviewed-by: Hannes Reinecke <hare@suse.de>
    Reviewed-by: Martin K. Petersen <martin.petersen@oracle.com>
    Signed-off-by: John Garry <john.g.garry@oracle.com>
    Reviewed-by: Damien Le Moal <dlemoal@kernel.org>
    Link: https://lore.kernel.org/r/20250729091448.1691334-2-john.g.garry@oracle.com
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

block: fix kobject double initialization in add_disk [+ + +]

Author: Zheng Qixing <zhengqixing@huawei.com>
Date:   Fri Aug 8 13:36:09 2025 +0800

    block: fix kobject double initialization in add_disk
    
    [ Upstream commit 343dc5423bfe876c12bb80c56f5e44286e442a07 ]
    
    Device-mapper can call add_disk() multiple times for the same gendisk
    due to its two-phase creation process (dm create + dm load). This leads
    to kobject double initialization errors when the underlying iSCSI devices
    become temporarily unavailable and then reappear.
    
    However, if the first add_disk() call fails and is retried, the queue_kobj
    gets initialized twice, causing:
    
    kobject: kobject (ffff88810c27bb90): tried to init an initialized object,
    something is seriously wrong.
     Call Trace:
      <TASK>
      dump_stack_lvl+0x5b/0x80
      kobject_init.cold+0x43/0x51
      blk_register_queue+0x46/0x280
      add_disk_fwnode+0xb5/0x280
      dm_setup_md_queue+0x194/0x1c0
      table_load+0x297/0x2d0
      ctl_ioctl+0x2a2/0x480
      dm_ctl_ioctl+0xe/0x20
      __x64_sys_ioctl+0xc7/0x110
      do_syscall_64+0x72/0x390
      entry_SYSCALL_64_after_hwframe+0x76/0x7e
    
    Fix this by separating kobject initialization from sysfs registration:
     - Initialize queue_kobj early during gendisk allocation
     - add_disk() only adds the already-initialized kobject to sysfs
     - del_gendisk() removes from sysfs but doesn't destroy the kobject
     - Final cleanup happens when the disk is released
    
    Fixes: 2bd85221a625 ("block: untangle request_queue refcounting from sysfs")
    Reported-by: Li Lingfeng <lilingfeng3@huawei.com>
    Closes: https://lore.kernel.org/all/83591d0b-2467-433c-bce0-5581298eb161@huawei.com/
    Signed-off-by: Zheng Qixing <zhengqixing@huawei.com>
    Reviewed-by: Ming Lei <ming.lei@redhat.com>
    Reviewed-by: Yu Kuai <yukuai3@huawei.com>
    Reviewed-by: Nilay Shroff <nilay@linux.ibm.com>
    Link: https://lore.kernel.org/r/20250808053609.3237836-1-zhengqixing@huaweicloud.com
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

block: Introduce bio_needs_zone_write_plugging() [+ + +]

Author: Damien Le Moal <dlemoal@kernel.org>
Date:   Wed Jun 25 18:33:24 2025 +0900

    block: Introduce bio_needs_zone_write_plugging()
    
    commit f70291411ba20d50008db90a6f0731efac27872c upstream.
    
    In preparation for fixing device mapper zone write handling, introduce
    the inline helper function bio_needs_zone_write_plugging() to test if a
    BIO requires handling through zone write plugging using the function
    blk_zone_plug_bio(). This function returns true for any write
    (op_is_write(bio) == true) operation directed at a zoned block device
    using zone write plugging, that is, a block device with a disk that has
    a zone write plug hash table.
    
    This helper allows simplifying the check on entry to blk_zone_plug_bio()
    and used in to protect calls to it for blk-mq devices and DM devices.
    
    Fixes: f211268ed1f9 ("dm: Use the block layer zone append emulation")
    Cc: stable@vger.kernel.org
    Signed-off-by: Damien Le Moal <dlemoal@kernel.org>
    Reviewed-by: Christoph Hellwig <hch@lst.de>
    Link: https://lore.kernel.org/r/20250625093327.548866-3-dlemoal@kernel.org
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

block: Make REQ_OP_ZONE_FINISH a write operation [+ + +]

Author: Damien Le Moal <dlemoal@kernel.org>
Date:   Wed Jun 25 18:33:23 2025 +0900

    block: Make REQ_OP_ZONE_FINISH a write operation
    
    commit 3f66ccbaaef3a0c5bd844eab04e3207b4061c546 upstream.
    
    REQ_OP_ZONE_FINISH is defined as "12", which makes
    op_is_write(REQ_OP_ZONE_FINISH) return false, despite the fact that a
    zone finish operation is an operation that modifies a zone (transition
    it to full) and so should be considered as a write operation (albeit
    one that does not transfer any data to the device).
    
    Fix this by redefining REQ_OP_ZONE_FINISH to be an odd number (13), and
    redefine REQ_OP_ZONE_RESET and REQ_OP_ZONE_RESET_ALL using sequential
    odd numbers from that new value.
    
    Fixes: 6c1b1da58f8c ("block: add zone open, close and finish operations")
    Cc: stable@vger.kernel.org
    Signed-off-by: Damien Le Moal <dlemoal@kernel.org>
    Reviewed-by: Bart Van Assche <bvanassche@acm.org>
    Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com>
    Reviewed-by: Christoph Hellwig <hch@lst.de>
    Link: https://lore.kernel.org/r/20250625093327.548866-2-dlemoal@kernel.org
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

Bluetooth: btusb: Add new VID/PID 0489/e14e for MT7925 [+ + +]

Author: En-Wei Wu <en-wei.wu@canonical.com>
Date:   Wed Jul 9 14:36:06 2025 +0800

    Bluetooth: btusb: Add new VID/PID 0489/e14e for MT7925
    
    [ Upstream commit 942873c8137fe0015ab37f62f159d88079859c5e ]
    
    Add VID 0489 & PID e14e for MediaTek MT7925 USB Bluetooth chip.
    
    The information in /sys/kernel/debug/usb/devices about the Bluetooth
    device is listed as the below.
    
    T:  Bus=01 Lev=01 Prnt=01 Port=03 Cnt=03 Dev#=  4 Spd=480  MxCh= 0
    D:  Ver= 2.10 Cls=ef(misc ) Sub=02 Prot=01 MxPS=64 #Cfgs=  1
    P:  Vendor=0489 ProdID=e14e Rev= 1.00
    S:  Manufacturer=MediaTek Inc.
    S:  Product=Wireless_Device
    S:  SerialNumber=000000000
    C:* #Ifs= 3 Cfg#= 1 Atr=e0 MxPwr=100mA
    A:  FirstIf#= 0 IfCount= 3 Cls=e0(wlcon) Sub=01 Prot=01
    I:* If#= 0 Alt= 0 #EPs= 3 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=81(I) Atr=03(Int.) MxPS=  16 Ivl=125us
    E:  Ad=82(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms
    E:  Ad=02(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms
    I:* If#= 1 Alt= 0 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=83(I) Atr=01(Isoc) MxPS=   0 Ivl=1ms
    E:  Ad=03(O) Atr=01(Isoc) MxPS=   0 Ivl=1ms
    I:  If#= 1 Alt= 1 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=83(I) Atr=01(Isoc) MxPS=   9 Ivl=1ms
    E:  Ad=03(O) Atr=01(Isoc) MxPS=   9 Ivl=1ms
    I:  If#= 1 Alt= 2 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  17 Ivl=1ms
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  17 Ivl=1ms
    I:  If#= 1 Alt= 3 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  25 Ivl=1ms
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  25 Ivl=1ms
    I:  If#= 1 Alt= 4 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  33 Ivl=1ms
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  33 Ivl=1ms
    I:  If#= 1 Alt= 5 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  49 Ivl=1ms
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  49 Ivl=1ms
    I:  If#= 1 Alt= 6 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  63 Ivl=1ms
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  63 Ivl=1ms
    I:  If#= 2 Alt= 0 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=8a(I) Atr=03(Int.) MxPS=  64 Ivl=125us
    E:  Ad=0a(O) Atr=03(Int.) MxPS=  64 Ivl=125us
    I:* If#= 2 Alt= 1 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=8a(I) Atr=03(Int.) MxPS= 512 Ivl=125us
    E:  Ad=0a(O) Atr=03(Int.) MxPS= 512 Ivl=125us
    
    Signed-off-by: En-Wei Wu <en-wei.wu@canonical.com>
    Signed-off-by: Luiz Augusto von Dentz <luiz.von.dentz@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

Bluetooth: btusb: Add support for variant of RTL8851BE (USB ID 13d3:3601) [+ + +]

Author: Uwe Kleine-König <ukleinek@debian.org>
Date:   Thu May 29 18:23:32 2025 +0200

    Bluetooth: btusb: Add support for variant of RTL8851BE (USB ID 13d3:3601)
    
    [ Upstream commit 65b0dca6f9f2c912a77a6ad6cf56f60a895a496b ]
    
    Teach the btusb driver to recognize another variant of the RTL8851BE
    bluetooth radio.
    
    /sys/kernel/debug/usb/devices reports for that device:
    
            T:  Bus=03 Lev=01 Prnt=01 Port=02 Cnt=01 Dev#=  2 Spd=12   MxCh= 0
            D:  Ver= 1.00 Cls=e0(wlcon) Sub=01 Prot=01 MxPS=64 #Cfgs=  1
            P:  Vendor=13d3 ProdID=3601 Rev= 0.00
            S:  Manufacturer=Realtek
            S:  Product=Bluetooth Radio
            S:  SerialNumber=00e04c000001
            C:* #Ifs= 2 Cfg#= 1 Atr=e0 MxPwr=500mA
            I:* If#= 0 Alt= 0 #EPs= 3 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
            E:  Ad=81(I) Atr=03(Int.) MxPS=  16 Ivl=1ms
            E:  Ad=02(O) Atr=02(Bulk) MxPS=  64 Ivl=0ms
            E:  Ad=82(I) Atr=02(Bulk) MxPS=  64 Ivl=0ms
            I:* If#= 1 Alt= 0 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
            E:  Ad=03(O) Atr=01(Isoc) MxPS=   0 Ivl=1ms
            E:  Ad=83(I) Atr=01(Isoc) MxPS=   0 Ivl=1ms
            I:  If#= 1 Alt= 1 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
            E:  Ad=03(O) Atr=01(Isoc) MxPS=   9 Ivl=1ms
            E:  Ad=83(I) Atr=01(Isoc) MxPS=   9 Ivl=1ms
            I:  If#= 1 Alt= 2 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
            E:  Ad=03(O) Atr=01(Isoc) MxPS=  17 Ivl=1ms
            E:  Ad=83(I) Atr=01(Isoc) MxPS=  17 Ivl=1ms
            I:  If#= 1 Alt= 3 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
            E:  Ad=03(O) Atr=01(Isoc) MxPS=  25 Ivl=1ms
            E:  Ad=83(I) Atr=01(Isoc) MxPS=  25 Ivl=1ms
            I:  If#= 1 Alt= 4 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
            E:  Ad=03(O) Atr=01(Isoc) MxPS=  33 Ivl=1ms
            E:  Ad=83(I) Atr=01(Isoc) MxPS=  33 Ivl=1ms
            I:  If#= 1 Alt= 5 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
            E:  Ad=03(O) Atr=01(Isoc) MxPS=  49 Ivl=1ms
            E:  Ad=83(I) Atr=01(Isoc) MxPS=  49 Ivl=1ms
            I:  If#= 1 Alt= 6 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
            E:  Ad=03(O) Atr=01(Isoc) MxPS=  63 Ivl=1ms
            E:  Ad=83(I) Atr=01(Isoc) MxPS=  63 Ivl=1ms
    
    Reported-by: shdeb <shdeb3000000@gmail.com>
    Link: https://bugs.debian.org/1106386
    Signed-off-by: Uwe Kleine-König <ukleinek@debian.org>
    Signed-off-by: Luiz Augusto von Dentz <luiz.von.dentz@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

Bluetooth: hci_event: Add support for handling LE BIG Sync Lost event [+ + +]

Author: Yang Li <yang.li@amlogic.com>
Date:   Tue Jul 1 15:56:22 2025 +0800

    Bluetooth: hci_event: Add support for handling LE BIG Sync Lost event
    
    [ Upstream commit b2a5f2e1c127cb431df22e114998ff72eb4578c8 ]
    
    When the BIS source stops, the controller sends an LE BIG Sync Lost
    event (subevent 0x1E). Currently, this event is not handled, causing
    the BIS stream to remain active in BlueZ and preventing recovery.
    
    Signed-off-by: Yang Li <yang.li@amlogic.com>
    Signed-off-by: Luiz Augusto von Dentz <luiz.von.dentz@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

Bluetooth: hci_sock: Reset cookie to zero in hci_sock_free_cookie() [+ + +]

Author: Zijun Hu <zijun.hu@oss.qualcomm.com>
Date:   Mon Jun 23 20:31:16 2025 +0800

    Bluetooth: hci_sock: Reset cookie to zero in hci_sock_free_cookie()
    
    [ Upstream commit 4d7936e8a5b1fa803f4a631d2da4a80fa4f0f37f ]
    
    Reset cookie value to 0 instead of 0xffffffff in hci_sock_free_cookie()
    since:
    0         :  means cookie has not been assigned yet
    0xffffffff:  means cookie assignment failure
    
    Also fix generating cookie failure with usage shown below:
    hci_sock_gen_cookie(sk)   // generate cookie
    hci_sock_free_cookie(sk)  // free cookie
    hci_sock_gen_cookie(sk)   // Can't generate cookie any more
    
    Signed-off-by: Zijun Hu <zijun.hu@oss.qualcomm.com>
    Signed-off-by: Luiz Augusto von Dentz <luiz.von.dentz@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

bnxt: fill data page pool with frags if PAGE_SIZE > BNXT_RX_PAGE_SIZE [+ + +]

Author: David Wei <dw@davidwei.uk>
Date:   Tue Aug 12 11:29:07 2025 -0700

    bnxt: fill data page pool with frags if PAGE_SIZE > BNXT_RX_PAGE_SIZE
    
    [ Upstream commit 39f8fcda2088382a4aa70b258d6f7225aa386f11 ]
    
    The data page pool always fills the HW rx ring with pages. On arm64 with
    64K pages, this will waste _at least_ 32K of memory per entry in the rx
    ring.
    
    Fix by fragmenting the pages if PAGE_SIZE > BNXT_RX_PAGE_SIZE. This
    makes the data page pool the same as the header pool.
    
    Tested with iperf3 with a small (64 entries) rx ring to encourage buffer
    circulation.
    
    Fixes: cd1fafe7da1f ("eth: bnxt: add support rx side device memory TCP")
    Reviewed-by: Michael Chan <michael.chan@broadcom.com>
    Signed-off-by: David Wei <dw@davidwei.uk>
    Link: https://patch.msgid.link/20250812182907.1540755-1-dw@davidwei.uk
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

bpf: Forget ranges when refining tnum after JSET [+ + +]

Author: Paul Chaignon <paul.chaignon@gmail.com>
Date:   Thu Jul 10 20:20:53 2025 +0200

    bpf: Forget ranges when refining tnum after JSET
    
    [ Upstream commit 6279846b9b2532e1b04559ef8bd0dec049f29383 ]
    
    Syzbot reported a kernel warning due to a range invariant violation on
    the following BPF program.
    
      0: call bpf_get_netns_cookie
      1: if r0 == 0 goto <exit>
      2: if r0 & Oxffffffff goto <exit>
    
    The issue is on the path where we fall through both jumps.
    
    That path is unreachable at runtime: after insn 1, we know r0 != 0, but
    with the sign extension on the jset, we would only fallthrough insn 2
    if r0 == 0. Unfortunately, is_branch_taken() isn't currently able to
    figure this out, so the verifier walks all branches. The verifier then
    refines the register bounds using the second condition and we end
    up with inconsistent bounds on this unreachable path:
    
      1: if r0 == 0 goto <exit>
        r0: u64=[0x1, 0xffffffffffffffff] var_off=(0, 0xffffffffffffffff)
      2: if r0 & 0xffffffff goto <exit>
        r0 before reg_bounds_sync: u64=[0x1, 0xffffffffffffffff] var_off=(0, 0)
        r0 after reg_bounds_sync:  u64=[0x1, 0] var_off=(0, 0)
    
    Improving the range refinement for JSET to cover all cases is tricky. We
    also don't expect many users to rely on JSET given LLVM doesn't generate
    those instructions. So instead of improving the range refinement for
    JSETs, Eduard suggested we forget the ranges whenever we're narrowing
    tnums after a JSET. This patch implements that approach.
    
    Reported-by: syzbot+c711ce17dd78e5d4fdcf@syzkaller.appspotmail.com
    Suggested-by: Eduard Zingerman <eddyz87@gmail.com>
    Acked-by: Yonghong Song <yonghong.song@linux.dev>
    Acked-by: Eduard Zingerman <eddyz87@gmail.com>
    Signed-off-by: Paul Chaignon <paul.chaignon@gmail.com>
    Link: https://lore.kernel.org/r/9d4fd6432a095d281f815770608fdcd16028ce0b.1752171365.git.paul.chaignon@gmail.com
    Signed-off-by: Alexei Starovoitov <ast@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

bpf: Make reg_not_null() true for CONST_PTR_TO_MAP [+ + +]

Author: Ihor Solodrai <isolodrai@meta.com>
Date:   Mon Jun 9 11:30:22 2025 -0700

    bpf: Make reg_not_null() true for CONST_PTR_TO_MAP
    
    [ Upstream commit 5534e58f2e9bd72b253d033ee0af6e68eb8ac96b ]
    
    When reg->type is CONST_PTR_TO_MAP, it can not be null. However the
    verifier explores the branches under rX == 0 in check_cond_jmp_op()
    even if reg->type is CONST_PTR_TO_MAP, because it was not checked for
    in reg_not_null().
    
    Fix this by adding CONST_PTR_TO_MAP to the set of types that are
    considered non nullable in reg_not_null().
    
    An old "unpriv: cmp map pointer with zero" selftest fails with this
    change, because now early out correctly triggers in
    check_cond_jmp_op(), making the verification to pass.
    
    In practice verifier may allow pointer to null comparison in unpriv,
    since in many cases the relevant branch and comparison op are removed
    as dead code. So change the expected test result to __success_unpriv.
    
    Signed-off-by: Ihor Solodrai <isolodrai@meta.com>
    Signed-off-by: Andrii Nakryiko <andrii@kernel.org>
    Acked-by: Andrii Nakryiko <andrii@kernel.org>
    Link: https://lore.kernel.org/bpf/20250609183024.359974-2-isolodrai@meta.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

bpftool: Fix JSON writer resource leak in version command [+ + +]

Author: Yuan Chen <chenyuan@kylinos.cn>
Date:   Tue Jun 17 09:24:42 2025 -0400

    bpftool: Fix JSON writer resource leak in version command
    
    [ Upstream commit 85cd83fed8267cde0dd1cea719808aad95ae4de7 ]
    
    When using `bpftool --version -j/-p`, the JSON writer object
    created in do_version() was not properly destroyed after use.
    This caused a memory leak each time the version command was
    executed with JSON output.
    
    Fix: 004b45c0e51a (tools: bpftool: provide JSON output for all possible commands)
    
    Suggested-by: Quentin Monnet <qmo@kernel.org>
    Signed-off-by: Yuan Chen <chenyuan@kylinos.cn>
    Signed-off-by: Andrii Nakryiko <andrii@kernel.org>
    Reviewed-by: Quentin Monnet <qmo@kernel.org>
    Link: https://lore.kernel.org/bpf/20250617132442.9998-1-chenyuan_fl@163.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

btrfs: abort transaction during log replay if walk_log_tree() failed [+ + +]

Author: Filipe Manana <fdmanana@suse.com>
Date:   Wed May 21 17:41:18 2025 +0100

    btrfs: abort transaction during log replay if walk_log_tree() failed
    
    commit 2a5898c4aac67494c2f0f7fe38373c95c371c930 upstream.
    
    If we failed walking a log tree during replay, we have a missing
    transaction abort to prevent committing a transaction where we didn't
    fully replay all the changes from a log tree and therefore can leave the
    respective subvolume tree in some inconsistent state. So add the missing
    transaction abort.
    
    CC: stable@vger.kernel.org # 6.1+
    Reviewed-by: Qu Wenruo <wqu@suse.com>
    Signed-off-by: Filipe Manana <fdmanana@suse.com>
    Reviewed-by: David Sterba <dsterba@suse.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

btrfs: clear dirty status from extent buffer on error at insert_new_root() [+ + +]

Author: Filipe Manana <fdmanana@suse.com>
Date:   Mon Jun 30 10:50:46 2025 +0100

    btrfs: clear dirty status from extent buffer on error at insert_new_root()
    
    commit c0d013495a80cbb53e2288af7ae0ec4170aafd7c upstream.
    
    If we failed to insert the tree mod log operation, we are not removing the
    dirty status from the allocated and dirtied extent buffer before we free
    it. Removing the dirty status is needed for several reasons such as to
    adjust the fs_info->dirty_metadata_bytes counter and remove the dirty
    status from the respective folios. So add the missing call to
    btrfs_clear_buffer_dirty().
    
    Fixes: f61aa7ba08ab ("btrfs: do not BUG_ON() on tree mod log failure at insert_new_root()")
    CC: stable@vger.kernel.org # 6.6+
    Reviewed-by: Boris Burkov <boris@bur.io>
    Signed-off-by: Filipe Manana <fdmanana@suse.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

btrfs: do not allow relocation of partially dropped subvolumes [+ + +]

Author: Qu Wenruo <wqu@suse.com>
Date:   Fri Jul 25 20:33:25 2025 +0930

    btrfs: do not allow relocation of partially dropped subvolumes
    
    commit 4289b494ac553e74e86fed1c66b2bf9530bc1082 upstream.
    
    [BUG]
    There is an internal report that balance triggered transaction abort,
    with the following call trace:
    
      item 85 key (594509824 169 0) itemoff 12599 itemsize 33
              extent refs 1 gen 197740 flags 2
              ref#0: tree block backref root 7
      item 86 key (594558976 169 0) itemoff 12566 itemsize 33
              extent refs 1 gen 197522 flags 2
              ref#0: tree block backref root 7
     ...
     BTRFS error (device loop0): extent item not found for insert, bytenr 594526208 num_bytes 16384 parent 449921024 root_objectid 934 owner 1 offset 0
     BTRFS error (device loop0): failed to run delayed ref for logical 594526208 num_bytes 16384 type 182 action 1 ref_mod 1: -117
     ------------[ cut here ]------------
     BTRFS: Transaction aborted (error -117)
     WARNING: CPU: 1 PID: 6963 at ../fs/btrfs/extent-tree.c:2168 btrfs_run_delayed_refs+0xfa/0x110 [btrfs]
    
    And btrfs check doesn't report anything wrong related to the extent
    tree.
    
    [CAUSE]
    The cause is a little complex, firstly the extent tree indeed doesn't
    have the backref for 594526208.
    
    The extent tree only have the following two backrefs around that bytenr
    on-disk:
    
            item 65 key (594509824 METADATA_ITEM 0) itemoff 13880 itemsize 33
                    refs 1 gen 197740 flags TREE_BLOCK
                    tree block skinny level 0
                    (176 0x7) tree block backref root CSUM_TREE
            item 66 key (594558976 METADATA_ITEM 0) itemoff 13847 itemsize 33
                    refs 1 gen 197522 flags TREE_BLOCK
                    tree block skinny level 0
                    (176 0x7) tree block backref root CSUM_TREE
    
    But the such missing backref item is not an corruption on disk, as the
    offending delayed ref belongs to subvolume 934, and that subvolume is
    being dropped:
    
            item 0 key (934 ROOT_ITEM 198229) itemoff 15844 itemsize 439
                    generation 198229 root_dirid 256 bytenr 10741039104 byte_limit 0 bytes_used 345571328
                    last_snapshot 198229 flags 0x1000000000001(RDONLY) refs 0
                    drop_progress key (206324 EXTENT_DATA 2711650304) drop_level 2
                    level 2 generation_v2 198229
    
    And that offending tree block 594526208 is inside the dropped range of
    that subvolume.  That explains why there is no backref item for that
    bytenr and why btrfs check is not reporting anything wrong.
    
    But this also shows another problem, as btrfs will do all the orphan
    subvolume cleanup at a read-write mount.
    
    So half-dropped subvolume should not exist after an RW mount, and
    balance itself is also exclusive to subvolume cleanup, meaning we
    shouldn't hit a subvolume half-dropped during relocation.
    
    The root cause is, there is no orphan item for this subvolume.
    In fact there are 5 subvolumes from around 2021 that have the same
    problem.
    
    It looks like the original report has some older kernels running, and
    caused those zombie subvolumes.
    
    Thankfully upstream commit 8d488a8c7ba2 ("btrfs: fix subvolume/snapshot
    deletion not triggered on mount") has long fixed the bug.
    
    [ENHANCEMENT]
    For repairing such old fs, btrfs-progs will be enhanced.
    
    Considering how delayed the problem will show up (at run delayed ref
    time) and at that time we have to abort transaction already, it is too
    late.
    
    Instead here we reject any half-dropped subvolume for reloc tree at the
    earliest time, preventing confusion and extra time wasted on debugging
    similar bugs.
    
    CC: stable@vger.kernel.org # 5.15+
    Reviewed-by: Filipe Manana <fdmanana@suse.com>
    Signed-off-by: Qu Wenruo <wqu@suse.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

btrfs: don't ignore inode missing when replaying log tree [+ + +]

Author: Filipe Manana <fdmanana@suse.com>
Date:   Fri Jul 11 20:21:28 2025 +0100

    btrfs: don't ignore inode missing when replaying log tree
    
    commit 7ebf381a69421a88265d3c49cd0f007ba7336c9d upstream.
    
    During log replay, at add_inode_ref(), we return -ENOENT if our current
    inode isn't found on the subvolume tree or if a parent directory isn't
    found. The error comes from btrfs_iget_logging() <- btrfs_iget() <-
    btrfs_read_locked_inode().
    
    The single caller of add_inode_ref(), replay_one_buffer(), ignores an
    -ENOENT error because it expects that error to mean only that a parent
    directory wasn't found and that is ok.
    
    Before commit 5f61b961599a ("btrfs: fix inode lookup error handling during
    log replay") we were converting any error when getting a parent directory
    to -ENOENT and any error when getting the current inode to -EIO, so our
    caller would fail log replay in case we can't find the current inode.
    After that commit however in case the current inode is not found we return
    -ENOENT to the caller and therefore it ignores the critical fact that the
    current inode was not found in the subvolume tree.
    
    Fix this by converting -ENOENT to 0 when we don't find a parent directory,
    returning -ENOENT when we don't find the current inode and making the
    caller, replay_one_buffer(), not ignore -ENOENT anymore.
    
    Fixes: 5f61b961599a ("btrfs: fix inode lookup error handling during log replay")
    CC: stable@vger.kernel.org # 6.16
    Reviewed-by: Boris Burkov <boris@bur.io>
    Signed-off-by: Filipe Manana <fdmanana@suse.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

btrfs: don't skip accounting in early ENOTTY return in btrfs_uring_encoded_read() [+ + +]

Author: Caleb Sander Mateos <csander@purestorage.com>
Date:   Thu Jun 19 13:27:45 2025 -0600

    btrfs: don't skip accounting in early ENOTTY return in btrfs_uring_encoded_read()
    
    commit ea124ec327086325fc096abf42837dac471ac7ae upstream.
    
    btrfs_uring_encoded_read() returns early with -ENOTTY if the uring_cmd
    is issued with IO_URING_F_COMPAT but the kernel doesn't support compat
    syscalls. However, this early return bypasses the syscall accounting.
    Go to out_acct instead to ensure the syscall is counted.
    
    Fixes: 34310c442e17 ("btrfs: add io_uring command for encoded reads (ENCODED_READ ioctl)")
    CC: stable@vger.kernel.org # 6.15+
    Signed-off-by: Caleb Sander Mateos <csander@purestorage.com>
    Reviewed-by: David Sterba <dsterba@suse.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

btrfs: don't skip remaining extrefs if dir not found during log replay [+ + +]

Author: Filipe Manana <fdmanana@suse.com>
Date:   Fri Jul 11 20:48:23 2025 +0100

    btrfs: don't skip remaining extrefs if dir not found during log replay
    
    commit 24e066ded45b8147b79c7455ac43a5bff7b5f378 upstream.
    
    During log replay, at add_inode_ref(), if we have an extref item that
    contains multiple extrefs and one of them points to a directory that does
    not exist in the subvolume tree, we are supposed to ignore it and process
    the remaining extrefs encoded in the extref item, since each extref can
    point to a different parent inode. However when that happens we just
    return from the function and ignore the remaining extrefs.
    
    The problem has been around since extrefs were introduced, in commit
    f186373fef00 ("btrfs: extended inode refs"), but it's hard to hit in
    practice because getting extref items encoding multiple extref requires
    getting a hash collision when computing the offset of the extref's
    key. The offset if computed like this:
    
      key.offset = btrfs_extref_hash(dir_ino, name->name, name->len);
    
    and btrfs_extref_hash() is just a wrapper around crc32c().
    
    Fix this by moving to next iteration of the loop when we don't find
    the parent directory that an extref points to.
    
    Fixes: f186373fef00 ("btrfs: extended inode refs")
    CC: stable@vger.kernel.org # 6.1+
    Reviewed-by: Boris Burkov <boris@bur.io>
    Signed-off-by: Filipe Manana <fdmanana@suse.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

btrfs: error on missing block group when unaccounting log tree extent buffers [+ + +]

Author: Filipe Manana <fdmanana@suse.com>
Date:   Wed Jul 16 11:41:21 2025 +0100

    btrfs: error on missing block group when unaccounting log tree extent buffers
    
    commit fc5799986fbca957e2e3c0480027f249951b7bcf upstream.
    
    Currently we only log an error message if we can't find the block group
    for a log tree extent buffer when unaccounting it (while freeing a log
    tree). A missing block group means something is seriously wrong and we
    end up leaking space from the metadata space info. So return -ENOENT in
    case we don't find the block group.
    
    CC: stable@vger.kernel.org # 6.12+
    Reviewed-by: Boris Burkov <boris@bur.io>
    Reviewed-by: Qu Wenruo <wqu@suse.com>
    Signed-off-by: Filipe Manana <fdmanana@suse.com>
    Reviewed-by: David Sterba <dsterba@suse.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

btrfs: fix -ENOSPC mmap write failure on NOCOW files/extents [+ + +]

Author: Filipe Manana <fdmanana@suse.com>
Date:   Tue Jul 8 16:01:19 2025 +0100

    btrfs: fix -ENOSPC mmap write failure on NOCOW files/extents
    
    [ Upstream commit 6599716de2d68ab7b3c0429221deca306dbda878 ]
    
    If we attempt a mmap write into a NOCOW file or a prealloc extent when
    there is no more available data space (or unallocated space to allocate a
    new data block group) and we can do a NOCOW write (there are no reflinks
    for the target extent or snapshots), we always fail due to -ENOSPC, unlike
    for the regular buffered write and direct IO paths where we check that we
    can do a NOCOW write in case we can't reserve data space.
    
    Simple reproducer:
    
      $ cat test.sh
      #!/bin/bash
    
      DEV=/dev/sdi
      MNT=/mnt/sdi
    
      umount $DEV &> /dev/null
      mkfs.btrfs -f -b $((512 * 1024 * 1024)) $DEV
      mount $DEV $MNT
    
      touch $MNT/foobar
      # Make it a NOCOW file.
      chattr +C $MNT/foobar
    
      # Add initial data to file.
      xfs_io -c "pwrite -S 0xab 0 1M" $MNT/foobar
    
      # Fill all the remaining data space and unallocated space with data.
      dd if=/dev/zero of=$MNT/filler bs=4K &> /dev/null
    
      # Overwrite the file with a mmap write. Should succeed.
      xfs_io -c "mmap -w 0 1M"        \
             -c "mwrite -S 0xcd 0 1M" \
             -c "munmap"              \
             $MNT/foobar
    
      # Unmount, mount again and verify the new data was persisted.
      umount $MNT
      mount $DEV $MNT
    
      od -A d -t x1 $MNT/foobar
    
      umount $MNT
    
    Running this:
    
      $ ./test.sh
      (...)
      wrote 1048576/1048576 bytes at offset 0
      1 MiB, 256 ops; 0.0008 sec (1.188 GiB/sec and 311435.5231 ops/sec)
      ./test.sh: line 24: 234865 Bus error               xfs_io -c "mmap -w 0 1M" -c "mwrite -S 0xcd 0 1M" -c "munmap" $MNT/foobar
      0000000 ab ab ab ab ab ab ab ab ab ab ab ab ab ab ab ab
      *
      1048576
    
    Fix this by not failing in case we can't allocate data space and we can
    NOCOW into the target extent - reserving only metadata space in this case.
    
    After this change the test passes:
    
      $ ./test.sh
      (...)
      wrote 1048576/1048576 bytes at offset 0
      1 MiB, 256 ops; 0.0007 sec (1.262 GiB/sec and 330749.3540 ops/sec)
      0000000 cd cd cd cd cd cd cd cd cd cd cd cd cd cd cd cd
      *
      1048576
    
    A test case for fstests will be added soon.
    
    Reviewed-by: Qu Wenruo <wqu@suse.com>
    Signed-off-by: Filipe Manana <fdmanana@suse.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

btrfs: fix iteration bug in __qgroup_excl_accounting() [+ + +]

Author: Boris Burkov <boris@bur.io>
Date:   Wed Jul 30 09:29:23 2025 -0700

    btrfs: fix iteration bug in __qgroup_excl_accounting()
    
    commit 7b632596188e1973c6b3ac1c9f8252f735e1039f upstream.
    
    __qgroup_excl_accounting() uses the qgroup iterator machinery to
    update the account of one qgroups usage for all its parent hierarchy,
    when we either add or remove a relation and have only exclusive usage.
    
    However, there is a small bug there: we loop with an extra iteration
    temporary qgroup called `cur` but never actually refer to that in the
    body of the loop. As a result, we redundantly account the same usage to
    the first qgroup in the list.
    
    This can be reproduced in the following way:
    
      mkfs.btrfs -f -O squota <dev>
      mount <dev> <mnt>
      btrfs subvol create <mnt>/sv
      dd if=/dev/zero of=<mnt>/sv/f bs=1M count=1
      sync
      btrfs qgroup create 1/100 <mnt>
      btrfs qgroup create 2/200 <mnt>
      btrfs qgroup assign 1/100 2/200 <mnt>
      btrfs qgroup assign 0/256 1/100 <mnt>
      btrfs qgroup show <mnt>
    
    and the broken result is (note the 2MiB on 1/100 and 0Mib on 2/100):
    
      Qgroupid    Referenced    Exclusive   Path
      --------    ----------    ---------   ----
      0/5           16.00KiB     16.00KiB   <toplevel>
      0/256          1.02MiB      1.02MiB   sv
    
      Qgroupid    Referenced    Exclusive   Path
      --------    ----------    ---------   ----
      0/5           16.00KiB     16.00KiB   <toplevel>
      0/256          1.02MiB      1.02MiB   sv
      1/100          2.03MiB      2.03MiB   2/100<1 member qgroup>
      2/100            0.00B        0.00B   <0 member qgroups>
    
    With this fix, which simply re-uses `qgroup` as the iteration variable,
    we see the expected result:
    
      Qgroupid    Referenced    Exclusive   Path
      --------    ----------    ---------   ----
      0/5           16.00KiB     16.00KiB   <toplevel>
      0/256          1.02MiB      1.02MiB   sv
    
      Qgroupid    Referenced    Exclusive   Path
      --------    ----------    ---------   ----
      0/5           16.00KiB     16.00KiB   <toplevel>
      0/256          1.02MiB      1.02MiB   sv
      1/100          1.02MiB      1.02MiB   2/100<1 member qgroup>
      2/100          1.02MiB      1.02MiB   <0 member qgroups>
    
    The existing fstests did not exercise two layer inheritance so this bug
    was missed. I intend to add that testing there, as well.
    
    Fixes: a0bdc04b0732 ("btrfs: qgroup: use qgroup_iterator in __qgroup_excl_accounting()")
    CC: stable@vger.kernel.org # 6.12+
    Reviewed-by: Filipe Manana <fdmanana@suse.com>
    Signed-off-by: Boris Burkov <boris@bur.io>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

btrfs: fix log tree replay failure due to file with 0 links and extents [+ + +]

Author: Filipe Manana <fdmanana@suse.com>
Date:   Wed Jul 30 19:18:37 2025 +0100

    btrfs: fix log tree replay failure due to file with 0 links and extents
    
    commit 0a32e4f0025a74c70dcab4478e9b29c22f5ecf2f upstream.
    
    If we log a new inode (not persisted in a past transaction) that has 0
    links and extents, then log another inode with an higher inode number, we
    end up with failing to replay the log tree with -EINVAL. The steps for
    this are:
    
    1) create new file A
    2) write some data to file A
    3) open an fd on file A
    4) unlink file A
    5) fsync file A using the previously open fd
    6) create file B (has higher inode number than file A)
    7) fsync file B
    8) power fail before current transaction commits
    
    Now when attempting to mount the fs, the log replay will fail with
    -ENOENT at replay_one_extent() when attempting to replay the first
    extent of file A. The failure comes when trying to open the inode for
    file A in the subvolume tree, since it doesn't exist.
    
    Before commit 5f61b961599a ("btrfs: fix inode lookup error handling
    during log replay"), the returned error was -EIO instead of -ENOENT,
    since we converted any errors when attempting to read an inode during
    log replay to -EIO.
    
    The reason for this is that the log replay procedure fails to ignore
    the current inode when we are at the stage LOG_WALK_REPLAY_ALL, our
    current inode has 0 links and last inode we processed in the previous
    stage has a non 0 link count. In other words, the issue is that at
    replay_one_extent() we only update wc->ignore_cur_inode if the current
    replay stage is LOG_WALK_REPLAY_INODES.
    
    Fix this by updating wc->ignore_cur_inode whenever we find an inode item
    regardless of the current replay stage. This is a simple solution and easy
    to backport, but later we can do other alternatives like avoid logging
    extents or inode items other than the inode item for inodes with a link
    count of 0.
    
    The problem with the wc->ignore_cur_inode logic has been around since
    commit f2d72f42d5fa ("Btrfs: fix warning when replaying log after fsync
    of a tmpfile") but it only became frequent to hit since the more recent
    commit 5e85262e542d ("btrfs: fix fsync of files with no hard links not
    persisting deletion"), because we stopped skipping inodes with a link
    count of 0 when logging, while before the problem would only be triggered
    if trying to replay a log tree created with an older kernel which has a
    logged inode with 0 links.
    
    A test case for fstests will be submitted soon.
    
    Reported-by: Peter Jung <ptr1337@cachyos.org>
    Link: https://lore.kernel.org/linux-btrfs/fce139db-4458-4788-bb97-c29acf6cb1df@cachyos.org/
    Reported-by: burneddi <burneddi@protonmail.com>
    Link: https://lore.kernel.org/linux-btrfs/lh4W-Lwc0Mbk-QvBhhQyZxf6VbM3E8VtIvU3fPIQgweP_Q1n7wtlUZQc33sYlCKYd-o6rryJQfhHaNAOWWRKxpAXhM8NZPojzsJPyHMf2qY=@protonmail.com/#t
    Reported-by: Russell Haley <yumpusamongus@gmail.com>
    Link: https://lore.kernel.org/linux-btrfs/598ecc75-eb80-41b3-83c2-f2317fbb9864@gmail.com/
    Fixes: f2d72f42d5fa ("Btrfs: fix warning when replaying log after fsync of a tmpfile")
    CC: stable@vger.kernel.org # 5.4+
    Reviewed-by: Boris Burkov <boris@bur.io>
    Signed-off-by: Filipe Manana <fdmanana@suse.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

btrfs: fix ssd_spread overallocation [+ + +]

Author: Boris Burkov <boris@bur.io>
Date:   Mon Jul 14 16:44:28 2025 -0700

    btrfs: fix ssd_spread overallocation
    
    commit 807d9023e75fc20bfd6dd2ac0408ce4af53f1648 upstream.
    
    If the ssd_spread mount option is enabled, then we run the so called
    clustered allocator for data block groups. In practice, this results in
    creating a btrfs_free_cluster which caches a block_group and borrows its
    free extents for allocation.
    
    Since the introduction of allocation size classes in 6.1, there has been
    a bug in the interaction between that feature and ssd_spread.
    find_free_extent() has a number of nested loops. The loop going over the
    allocation stages, stored in ffe_ctl->loop and managed by
    find_free_extent_update_loop(), the loop over the raid levels, and the
    loop over all the block_groups in a space_info. The size class feature
    relies on the block_group loop to ensure it gets a chance to see a
    block_group of a given size class.  However, the clustered allocator
    uses the cached cluster block_group and breaks that loop. Each call to
    do_allocation() will really just go back to the same cached block_group.
    Normally, this is OK, as the allocation either succeeds and we don't
    want to loop any more or it fails, and we clear the cluster and return
    its space to the block_group.
    
    But with size classes, the allocation can succeed, then later fail,
    outside of do_allocation() due to size class mismatch. That latter
    failure is not properly handled due to the highly complex multi loop
    logic. The result is a painful loop where we continue to allocate the
    same num_bytes from the cluster in a tight loop until it fails and
    releases the cluster and lets us try a new block_group. But by then, we
    have skipped great swaths of the available block_groups and are likely
    to fail to allocate, looping the outer loop. In pathological cases like
    the reproducer below, the cached block_group is often the very last one,
    in which case we don't perform this tight bg loop but instead rip
    through the ffe stages to LOOP_CHUNK_ALLOC and allocate a chunk, which
    is now the last one, and we enter the tight inner loop until an
    allocation failure. Then allocation succeeds on the final block_group
    and if the next allocation is a size mismatch, the exact same thing
    happens again.
    
    Triggering this is as easy as mounting with -o ssd_spread and then
    running:
    
      mount -o ssd_spread $dev $mnt
      dd if=/dev/zero of=$mnt/big bs=16M count=1 &>/dev/null
      dd if=/dev/zero of=$mnt/med bs=4M count=1 &>/dev/null
      sync
    
    if you do the two writes + sync in a loop, you can force btrfs to spin
    an excessive amount on semi-successful clustered allocations, before
    ultimately failing and advancing to the stage where we force a chunk
    allocation. This results in 2G of data allocated per iteration, despite
    only using ~20M of data. By using a small size classed extent, the inner
    loop takes longer and we can spin for longer.
    
    The simplest, shortest term fix to unbreak this is to make the clustered
    allocator size_class aware in the dumbest way, where it fails on size
    class mismatch. This may hinder the operation of the clustered
    allocator, but better hindered than completely broken and terribly
    overallocating.
    
    Further re-design improvements are also in the works.
    
    Fixes: 52bb7a2166af ("btrfs: introduce size class to block group allocator")
    CC: stable@vger.kernel.org # 6.1+
    Reported-by: David Sterba <dsterba@suse.com>
    Reviewed-by: Filipe Manana <fdmanana@suse.com>
    Signed-off-by: Boris Burkov <boris@bur.io>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

btrfs: fix wrong length parameter for btrfs_cleanup_ordered_extents() [+ + +]

Author: Qu Wenruo <wqu@suse.com>
Date:   Sun Jul 20 15:01:39 2025 +0930

    btrfs: fix wrong length parameter for btrfs_cleanup_ordered_extents()
    
    commit deaf895212da74635a7f0a420e1ecf8f5eca1fe5 upstream.
    
    Inside nocow_one_range(), if the checksum cloning for data reloc inode
    failed, we call btrfs_cleanup_ordered_extents() to cleanup the just
    allocated ordered extents.
    
    But unlike extent_clear_unlock_delalloc(),
    btrfs_cleanup_ordered_extents() requires a length, not an inclusive end
    bytenr.
    
    This can be problematic, as the @end is normally way larger than @len.
    
    This means btrfs_cleanup_ordered_extents() can be called on folios
    out of the correct range, and if the out-of-range folio is under
    writeback, we can incorrectly clear the ordered flag of the folio, and
    trigger the DEBUG_WARN() inside btrfs_writepage_cow_fixup().
    
    Fix the wrong parameter with correct length instead.
    
    Fixes: 94f6c5c17e52 ("btrfs: move ordered extent cleanup to where they are allocated")
    CC: stable@vger.kernel.org # 6.15+
    Reviewed-by: Boris Burkov <boris@bur.io>
    Signed-off-by: Qu Wenruo <wqu@suse.com>
    Reviewed-by: David Sterba <dsterba@suse.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

btrfs: populate otime when logging an inode item [+ + +]

Author: Qu Wenruo <wqu@suse.com>
Date:   Wed Jul 2 15:08:13 2025 +0930

    btrfs: populate otime when logging an inode item
    
    commit 1ef94169db0958d6de39f9ea6e063ce887342e2d upstream.
    
    [TEST FAILURE WITH EXPERIMENTAL FEATURES]
    When running test case generic/508, the test case will fail with the new
    btrfs shutdown support:
    
    generic/508       - output mismatch (see /home/adam/xfstests/results//generic/508.out.bad)
    #    --- tests/generic/508.out  2022-05-11 11:25:30.806666664 +0930
    #    +++ /home/adam/xfstests/results//generic/508.out.bad       2025-07-02 14:53:22.401824212 +0930
    #    @@ -1,2 +1,6 @@
    #     QA output created by 508
    #     Silence is golden
    #    +Before:
    #    +After : stat.btime = Thu Jan  1 09:30:00 1970
    #    +Before:
    #    +After : stat.btime = Wed Jul  2 14:53:22 2025
    #    ...
    #    (Run 'diff -u /home/adam/xfstests/tests/generic/508.out /home/adam/xfstests/results//generic/508.out.bad'  to see the entire diff)
    Ran: generic/508
    Failures: generic/508
    Failed 1 of 1 tests
    
    Please note that the test case requires shutdown support, thus the test
    case will be skipped using the current upstream kernel, as it doesn't
    have shutdown ioctl support.
    
    [CAUSE]
    The direct cause the 0 time stamp in the log tree:
    
    leaf 30507008 items 2 free space 16057 generation 9 owner TREE_LOG
    leaf 30507008 flags 0x1(WRITTEN) backref revision 1
    checksum stored e522548d
    checksum calced e522548d
    fs uuid 57d45451-481e-43e4-aa93-289ad707a3a0
    chunk uuid d52bd3fd-5163-4337-98a7-7986993ad398
            item 0 key (257 INODE_ITEM 0) itemoff 16123 itemsize 160
                    generation 9 transid 9 size 0 nbytes 0
                    block group 0 mode 100644 links 1 uid 0 gid 0 rdev 0
                    sequence 1 flags 0x0(none)
                    atime 1751432947.492000000 (2025-07-02 14:39:07)
                    ctime 1751432947.492000000 (2025-07-02 14:39:07)
                    mtime 1751432947.492000000 (2025-07-02 14:39:07)
                    otime 0.0 (1970-01-01 09:30:00) <<<
    
    But the old fs tree has all the correct time stamp:
    
    btrfs-progs v6.12
    fs tree key (FS_TREE ROOT_ITEM 0)
    leaf 30425088 items 2 free space 16061 generation 5 owner FS_TREE
    leaf 30425088 flags 0x1(WRITTEN) backref revision 1
    checksum stored 48f6c57e
    checksum calced 48f6c57e
    fs uuid 57d45451-481e-43e4-aa93-289ad707a3a0
    chunk uuid d52bd3fd-5163-4337-98a7-7986993ad398
            item 0 key (256 INODE_ITEM 0) itemoff 16123 itemsize 160
                    generation 3 transid 0 size 0 nbytes 16384
                    block group 0 mode 40755 links 1 uid 0 gid 0 rdev 0
                    sequence 0 flags 0x0(none)
                    atime 1751432947.0 (2025-07-02 14:39:07)
                    ctime 1751432947.0 (2025-07-02 14:39:07)
                    mtime 1751432947.0 (2025-07-02 14:39:07)
                    otime 1751432947.0 (2025-07-02 14:39:07) <<<
    
    The root cause is that fill_inode_item() in tree-log.c is only
    populating a/c/m time, not the otime (or btime in statx output).
    
    Part of the reason is that, the vfs inode only has a/c/m time, no native
    btime support yet.
    
    [FIX]
    Thankfully btrfs has its otime stored in btrfs_inode::i_otime_sec and
    btrfs_inode::i_otime_nsec.
    
    So what we really need is just fill the otime time stamp in
    fill_inode_item() of tree-log.c
    
    There is another fill_inode_item() in inode.c, which is doing the proper
    otime population.
    
    Fixes: 94edf4ae43a5 ("Btrfs: don't bother committing delayed inode updates when fsyncing")
    CC: stable@vger.kernel.org
    Reviewed-by: Filipe Manana <fdmanana@suse.com>
    Signed-off-by: Qu Wenruo <wqu@suse.com>
    Reviewed-by: David Sterba <dsterba@suse.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

btrfs: qgroup: fix qgroup create ioctl returning success after quotas disabled [+ + +]

Author: Filipe Manana <fdmanana@suse.com>
Date:   Tue Jul 1 15:44:16 2025 +0100

    btrfs: qgroup: fix qgroup create ioctl returning success after quotas disabled
    
    commit 08530d6e638427e7e1344bd67bacc03882ba95b9 upstream.
    
    When quotas are disabled qgroup ioctls are supposed to return -ENOTCONN,
    but the qgroup create ioctl stopped doing that when it races with a quota
    disable operation, returning 0 instead. This change of behaviour happened
    in commit 6ed05643ddb1 ("btrfs: create qgroup earlier in snapshot
    creation").
    
    The issue happens as follows:
    
    1) Task A enters btrfs_ioctl_qgroup_create(), qgroups are enabled and so
       qgroup_enabled() returns true since fs_info->quota_root is not NULL;
    
    2) Task B enters btrfs_ioctl_quota_ctl() -> btrfs_quota_disable() and
       disables qgroups, so now fs_info->quota_root is NULL;
    
    3) Task A enters btrfs_create_qgroup() and calls btrfs_qgroup_mode(),
       which returns BTRFS_QGROUP_MODE_DISABLED since quotas are disabled,
       and then btrfs_create_qgroup() returns 0 to the caller, which makes
       the ioctl return 0 instead of -ENOTCONN.
    
       The check for fs_info->quota_root and returning -ENOTCONN if it's NULL
       is made only after the call btrfs_qgroup_mode().
    
    Fix this by moving the check for disabled quotas with btrfs_qgroup_mode()
    into transaction.c:create_pending_snapshot(), so that we don't abort the
    transaction if btrfs_create_qgroup() returns -ENOTCONN and quotas are
    disabled.
    
    Fixes: 6ed05643ddb1 ("btrfs: create qgroup earlier in snapshot creation")
    CC: stable@vger.kernel.org # 6.12+
    Reviewed-by: Qu Wenruo <wqu@suse.com>
    Signed-off-by: Filipe Manana <fdmanana@suse.com>
    Reviewed-by: David Sterba <dsterba@suse.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

btrfs: qgroup: fix race between quota disable and quota rescan ioctl [+ + +]

Author: Filipe Manana <fdmanana@suse.com>
Date:   Mon Jun 30 13:19:20 2025 +0100

    btrfs: qgroup: fix race between quota disable and quota rescan ioctl
    
    commit e1249667750399a48cafcf5945761d39fa584edf upstream.
    
    There's a race between a task disabling quotas and another running the
    rescan ioctl that can result in a use-after-free of qgroup records from
    the fs_info->qgroup_tree rbtree.
    
    This happens as follows:
    
    1) Task A enters btrfs_ioctl_quota_rescan() -> btrfs_qgroup_rescan();
    
    2) Task B enters btrfs_quota_disable() and calls
       btrfs_qgroup_wait_for_completion(), which does nothing because at that
       point fs_info->qgroup_rescan_running is false (it wasn't set yet by
       task A);
    
    3) Task B calls btrfs_free_qgroup_config() which starts freeing qgroups
       from fs_info->qgroup_tree without taking the lock fs_info->qgroup_lock;
    
    4) Task A enters qgroup_rescan_zero_tracking() which starts iterating
       the fs_info->qgroup_tree tree while holding fs_info->qgroup_lock,
       but task B is freeing qgroup records from that tree without holding
       the lock, resulting in a use-after-free.
    
    Fix this by taking fs_info->qgroup_lock at btrfs_free_qgroup_config().
    Also at btrfs_qgroup_rescan() don't start the rescan worker if quotas
    were already disabled.
    
    Reported-by: cen zhang <zzzccc427@gmail.com>
    Link: https://lore.kernel.org/linux-btrfs/CAFRLqsV+cMDETFuzqdKSHk_FDm6tneea45krsHqPD6B3FetLpQ@mail.gmail.com/
    CC: stable@vger.kernel.org # 6.1+
    Reviewed-by: Boris Burkov <boris@bur.io>
    Reviewed-by: Qu Wenruo <wqu@suse.com>
    Signed-off-by: Filipe Manana <fdmanana@suse.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

btrfs: qgroup: set quota enabled bit if quota disable fails flushing reservations [+ + +]

Author: Filipe Manana <fdmanana@suse.com>
Date:   Tue Jul 1 11:39:44 2025 +0100

    btrfs: qgroup: set quota enabled bit if quota disable fails flushing reservations
    
    commit e41c75ca3189341e76e6af64b857c05b68a1d7db upstream.
    
    Before waiting for the rescan worker to finish and flushing reservations,
    we clear the BTRFS_FS_QUOTA_ENABLED flag from fs_info. If we fail flushing
    reservations we leave with the flag not set which is not correct since
    quotas are still enabled - we must set back the flag on error paths, such
    as when we fail to start a transaction, except for error paths that abort
    a transaction. The reservation flushing happens very early before we do
    any operation that actually disables quotas and before we start a
    transaction, so set back BTRFS_FS_QUOTA_ENABLED if it fails.
    
    Fixes: af0e2aab3b70 ("btrfs: qgroup: flush reservations during quota disable")
    CC: stable@vger.kernel.org # 6.12+
    Reviewed-by: Qu Wenruo <wqu@suse.com>
    Signed-off-by: Filipe Manana <fdmanana@suse.com>
    Reviewed-by: David Sterba <dsterba@suse.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

btrfs: send: use fallocate for hole punching with send stream v2 [+ + +]

Author: Filipe Manana <fdmanana@suse.com>
Date:   Fri Jul 18 13:07:29 2025 +0100

    btrfs: send: use fallocate for hole punching with send stream v2
    
    commit 005b0a0c24e1628313e951516b675109a92cacfe upstream.
    
    Currently holes are sent as writes full of zeroes, which results in
    unnecessarily using disk space at the receiving end and increasing the
    stream size.
    
    In some cases we avoid sending writes of zeroes, like during a full
    send operation where we just skip writes for holes.
    
    But for some cases we fill previous holes with writes of zeroes too, like
    in this scenario:
    
    1) We have a file with a hole in the range [2M, 3M), we snapshot the
       subvolume and do a full send. The range [2M, 3M) stays as a hole at
       the receiver since we skip sending write commands full of zeroes;
    
    2) We punch a hole for the range [3M, 4M) in our file, so that now it
       has a 2M hole in the range [2M, 4M), and snapshot the subvolume.
       Now if we do an incremental send, we will send write commands full
       of zeroes for the range [2M, 4M), removing the hole for [2M, 3M) at
       the receiver.
    
    We could improve cases such as this last one by doing additional
    comparisons of file extent items (or their absence) between the parent
    and send snapshots, but that's a lot of code to add plus additional CPU
    and IO costs.
    
    Since the send stream v2 already has a fallocate command and btrfs-progs
    implements a callback to execute fallocate since the send stream v2
    support was added to it, update the kernel to use fallocate for punching
    holes for V2+ streams.
    
    Test coverage is provided by btrfs/284 which is a version of btrfs/007
    that exercises send stream v2 instead of v1, using fsstress with random
    operations and fssum to verify file contents.
    
    Link: https://github.com/kdave/btrfs-progs/issues/1001
    CC: stable@vger.kernel.org # 6.1+
    Reviewed-by: Boris Burkov <boris@bur.io>
    Signed-off-by: Filipe Manana <fdmanana@suse.com>
    Reviewed-by: David Sterba <dsterba@suse.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

btrfs: zoned: do not remove unwritten non-data block group [+ + +]

Author: Naohiro Aota <naohiro.aota@wdc.com>
Date:   Sun Jun 29 23:07:42 2025 +0900

    btrfs: zoned: do not remove unwritten non-data block group
    
    commit 3061801420469610c8fa6080a950e56770773ef1 upstream.
    
    There are some reports of "unable to find chunk map for logical 2147483648
    length 16384" error message appears in dmesg. This means some IOs are
    occurring after a block group is removed.
    
    When a metadata tree node is cleaned on a zoned setup, we keep that node
    still dirty and write it out not to create a write hole. However, this can
    make a block group's used bytes == 0 while there is a dirty region left.
    
    Such an unused block group is moved into the unused_bg list and processed
    for removal. When the removal succeeds, the block group is removed from the
    transaction->dirty_bgs list, so the unused dirty nodes in the block group
    are not sent at the transaction commit time. It will be written at some
    later time e.g, sync or umount, and causes "unable to find chunk map"
    errors.
    
    This can happen relatively easy on SMR whose zone size is 256MB. However,
    calling do_zone_finish() on such block group returns -EAGAIN and keep that
    block group intact, which is why the issue is hidden until now.
    
    Fixes: afba2bc036b0 ("btrfs: zoned: implement active zone tracking")
    CC: stable@vger.kernel.org # 6.1+
    Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com>
    Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

btrfs: zoned: do not select metadata BG as finish target [+ + +]

Author: Naohiro Aota <naohiro.aota@wdc.com>
Date:   Wed Jul 16 16:59:52 2025 +0900

    btrfs: zoned: do not select metadata BG as finish target
    
    commit 3a931e9b39c7ff8066657042f5f00d3b7e6ad315 upstream.
    
    We call btrfs_zone_finish_one_bg() to zone finish one block group and make
    room to activate another block group. Currently, we can choose a metadata
    block group as a target. But, as we reserve an active metadata block group,
    we no longer want to select a metadata block group. So, skip it in the
    loop.
    
    CC: stable@vger.kernel.org # 6.6+
    Reviewed-by: Damien Le Moal <dlemoal@kernel.org>
    Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com>
    Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com>
    Reviewed-by: David Sterba <dsterba@suse.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

btrfs: zoned: requeue to unused block group list if zone finish failed [+ + +]

Author: Naohiro Aota <naohiro.aota@wdc.com>
Date:   Sun Jun 29 23:18:29 2025 +0900

    btrfs: zoned: requeue to unused block group list if zone finish failed
    
    commit 62be7afcc13b2727bdc6a4c91aefed6b452e6ecc upstream.
    
    btrfs_zone_finish() can fail for several reason. If it is -EAGAIN, we need
    to try it again later. So, put the block group to the retry list properly.
    
    Failing to do so will keep the removable block group intact until remount
    and can causes unnecessary ENOSPC.
    
    Fixes: 74e91b12b115 ("btrfs: zoned: zone finish unused block group")
    CC: stable@vger.kernel.org # 6.1+
    Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com>
    Signed-off-by: Naohiro Aota <naohiro.aota@wdc.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

btrfs: zoned: reserve data_reloc block group on mount [+ + +]

Author: Johannes Thumshirn <johannes.thumshirn@wdc.com>
Date:   Tue Jun 3 08:14:01 2025 +0200

    btrfs: zoned: reserve data_reloc block group on mount
    
    commit 694ce5e143d67267ad26b04463e790a597500b00 upstream.
    
    Create a block group dedicated for data relocation on mount of a zoned
    filesystem.
    
    If there is already more than one empty DATA block group on mount, this
    one is picked for the data relocation block group, instead of a newly
    created one.
    
    This is done to ensure, there is always space for performing garbage
    collection and the filesystem is not hitting ENOSPC under heavy overwrite
    workloads.
    
    CC: stable@vger.kernel.org # 6.6+
    Reviewed-by: Filipe Manana <fdmanana@suse.com>
    Signed-off-by: Johannes Thumshirn <johannes.thumshirn@wdc.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

btrfs: zoned: use filesystem size not disk size for reclaim decision [+ + +]

Author: Johannes Thumshirn <johannes.thumshirn@wdc.com>
Date:   Tue May 20 09:20:47 2025 +0200

    btrfs: zoned: use filesystem size not disk size for reclaim decision
    
    commit 55f7c65b2f69c7e4cb7aa7c1654a228ccf734fd8 upstream.
    
    When deciding if a zoned filesystem is reaching the threshold to reclaim
    data block groups, look at the size of the filesystem not to potentially
    total available size of all drives in the filesystem.
    
    Especially if a filesystem was created with mkfs' -b option, constraining
    it to only a portion of the block device, the numbers won't match and
    potentially garbage collection is kicking in too late.
    
    Fixes: 3687fcb0752a ("btrfs: zoned: make auto-reclaim less aggressive")
    CC: stable@vger.kernel.org # 6.1+
    Reviewed-by: Damien Le Moal <dlemoal@kernel.org>
    Tested-by: Damien Le Moal <dlemoal@kernel.org>
    Signed-off-by: Johannes Thumshirn <johannes.thumshirn@wdc.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

bus: mhi: host: pci_generic: Add Telit FN990B40 modem support [+ + +]

Author: Daniele Palmas <dnlplm@gmail.com>
Date:   Wed Jul 16 11:18:36 2025 +0200

    bus: mhi: host: pci_generic: Add Telit FN990B40 modem support
    
    [ Upstream commit 00559ba3ae740e7544b48fb509b2b97f56615892 ]
    
    Add SDX72 based modem Telit FN990B40, reusing FN920C04 configuration.
    
    01:00.0 Unassigned class [ff00]: Qualcomm Device 0309
            Subsystem: Device 1c5d:201a
    
    Signed-off-by: Daniele Palmas <dnlplm@gmail.com>
    [mani: added sdx72 in the comment to identify the chipset]
    Signed-off-by: Manivannan Sadhasivam <mani@kernel.org>
    Link: https://patch.msgid.link/20250716091836.999364-1-dnlplm@gmail.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

bus: mhi: host: pci_generic: Disable runtime PM for QDU100 [+ + +]

Author: Vivek Pernamitta <quic_vpernami@quicinc.com>
Date:   Fri Apr 25 12:49:31 2025 +0530

    bus: mhi: host: pci_generic: Disable runtime PM for QDU100
    
    [ Upstream commit 0494cf9793b7c250f63fdb2cb6b648473e9d4ae6 ]
    
    The QDU100 device does not support the MHI M3 state, necessitating the
    disabling of runtime PM for this device. It is essential to disable
    runtime PM if the device does not support M3 state.
    
    Signed-off-by: Vivek Pernamitta <quic_vpernami@quicinc.com>
    [mani: Fixed the kdoc comment for no_m3]
    Signed-off-by: Manivannan Sadhasivam <mani@kernel.org>
    Reviewed-by: Krishna Chaitanya Chundru <krishna.chundru@oss.qualcomm.com>
    Link: https://patch.msgid.link/20250425-vdev_next-20250411_pm_disable-v4-1-d4870a73ebf9@quicinc.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

can: ti_hecc: fix -Woverflow compiler warning [+ + +]

Author: Vincent Mailhol <mailhol.vincent@wanadoo.fr>
Date:   Tue Jul 15 20:28:11 2025 +0900

    can: ti_hecc: fix -Woverflow compiler warning
    
    [ Upstream commit 7cae4d04717b002cffe41169da3f239c845a0723 ]
    
    Fix below default (W=0) warning:
    
      drivers/net/can/ti_hecc.c: In function 'ti_hecc_start':
      drivers/net/can/ti_hecc.c:386:20: warning: conversion from 'long unsigned int' to 'u32' {aka 'unsigned int'} changes value from '18446744073709551599' to '4294967279' [-Woverflow]
        386 |         mbx_mask = ~BIT(HECC_RX_LAST_MBOX);
            |                    ^
    
    Signed-off-by: Vincent Mailhol <mailhol.vincent@wanadoo.fr>
    Link: https://patch.msgid.link/20250715-can-compile-test-v2-1-f7fd566db86f@wanadoo.fr
    Signed-off-by: Marc Kleine-Budde <mkl@pengutronix.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

cdc-acm: fix race between initial clearing halt and open [+ + +]

Author: Oliver Neukum <oneukum@suse.com>
Date:   Thu Jul 17 16:12:50 2025 +0200

    cdc-acm: fix race between initial clearing halt and open
    
    commit 64690a90cd7c6db16d3af8616be1f4bf8d492850 upstream.
    
    On the devices that need their endpoints to get an
    initial clear_halt, this needs to be done before
    the devices can be opened. That means it needs to be
    before the devices are registered.
    
    Fixes: 15bf722e6f6c0 ("cdc-acm: Add support of ATOL FPrint fiscal printers")
    Cc: stable <stable@kernel.org>
    Signed-off-by: Oliver Neukum <oneukum@suse.com>
    Link: https://lore.kernel.org/r/20250717141259.2345605-1-oneukum@suse.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

char: misc: Fix improper and inaccurate error code returned by misc_init() [+ + +]

Author: Zijun Hu <zijun.hu@oss.qualcomm.com>
Date:   Fri Jun 20 22:35:20 2025 +0800

    char: misc: Fix improper and inaccurate error code returned by misc_init()
    
    [ Upstream commit 0ef1fe4bc38673db72e39b700b29c50dfcc5a415 ]
    
    misc_init() returns -EIO for __register_chrdev() invocation failure, but:
    
    - -EIO is for I/O error normally, but __register_chrdev() does not do I/O.
    - -EIO can not cover various error codes returned by __register_chrdev().
    
    Fix by returning error code of __register_chrdev().
    
    Signed-off-by: Zijun Hu <zijun.hu@oss.qualcomm.com>
    Link: https://lore.kernel.org/r/20250620-fix_mischar-v1-3-6c2716bbf1fa@oss.qualcomm.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

cifs: Fix calling CIFSFindFirst() for root path without msearch [+ + +]

Author: Pali Rohár <pali@kernel.org>
Date:   Mon Dec 30 20:54:11 2024 +0100

    cifs: Fix calling CIFSFindFirst() for root path without msearch
    
    [ Upstream commit b460249b9a1dab7a9f58483e5349d045ad6d585c ]
    
    To query root path (without msearch wildcard) it is needed to
    send pattern '\' instead of '' (empty string).
    
    This allows to use CIFSFindFirst() to query information about root path
    which is being used in followup changes.
    
    This change fixes the stat() syscall called on the root path on the mount.
    It is because stat() syscall uses the cifs_query_path_info() function and
    it can fallback to the CIFSFindFirst() usage with msearch=false.
    
    Signed-off-by: Pali Rohár <pali@kernel.org>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

cifs: Fix collect_sample() to handle any iterator type [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Mon Aug 11 08:34:04 2025 +0100

    cifs: Fix collect_sample() to handle any iterator type
    
    [ Upstream commit b63335fb3d32579c5ff0b7038b9cc23688fff528 ]
    
    collect_sample() is used to gather samples of the data in a Write op for
    analysis to try and determine if the compression algorithm is likely to
    achieve anything more quickly than actually running the compression
    algorithm.
    
    However, collect_sample() assumes that the data it is going to be sampling
    is stored in an ITER_XARRAY-type iterator (which it now should never be)
    and doesn't actually check that it is before accessing the underlying
    xarray directly.
    
    Fix this by replacing the code with a loop that just uses the standard
    iterator functions to sample every other 2KiB block, skipping the
    intervening ones.  It's not quite the same as the previous algorithm as it
    doesn't necessarily align to the pages within an ordinary write from the
    pagecache.
    
    Note that the btrfs code from which this was derived samples the inode's
    pagecache directly rather than the iterator - but that doesn't necessarily
    work for network filesystems if O_DIRECT is in operation.
    
    Fixes: 94ae8c3fee94 ("smb: client: compress: LZ77 code improvements cleanup")
    Signed-off-by: David Howells <dhowells@redhat.com>
    Acked-by: Paulo Alcantara (Red Hat) <pc@manguebit.org>
    cc: Enzo Matsumiya <ematsumiya@suse.de>
    cc: Shyam Prasad N <sprasad@microsoft.com>
    cc: Tom Talpey <tom@talpey.com>
    cc: linux-cifs@vger.kernel.org
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

cifs: reset iface weights when we cannot find a candidate [+ + +]

Author: Shyam Prasad N <sprasad@microsoft.com>
Date:   Thu Jul 17 17:36:13 2025 +0530

    cifs: reset iface weights when we cannot find a candidate
    
    commit 9d5eff7821f6d70f7d1b4d8a60680fba4de868a7 upstream.
    
    We now do a weighted selection of server interfaces when allocating
    new channels. The weights are decided based on the speed advertised.
    The fulfilled weight for an interface is a counter that is used to
    track the interface selection. It should be reset back to zero once
    all interfaces fulfilling their weight.
    
    In cifs_chan_update_iface, this reset logic was missing. As a result
    when the server interface list changes, the client may not be able
    to find a new candidate for other channels after all interfaces have
    been fulfilled.
    
    Fixes: a6d8fb54a515 ("cifs: distribute channels across interfaces based on speed")
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Shyam Prasad N <sprasad@microsoft.com>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

clk: qcom: dispcc-sm8750: Fix setting rate byte and pixel clocks [+ + +]

Author: Krzysztof Kozlowski <krzysztof.kozlowski@linaro.org>
Date:   Tue May 20 11:07:42 2025 +0200

    clk: qcom: dispcc-sm8750: Fix setting rate byte and pixel clocks
    
    commit 0acf9e65a47d1e489c8b24c45a64436e30bcccf4 upstream.
    
    On SM8750 the setting rate of pixel and byte clocks, while the parent
    DSI PHY PLL, fails with:
    
      disp_cc_mdss_byte0_clk_src: rcg didn't update its configuration.
    
    DSI PHY PLL has to be unprepared and its "PLL Power Down" bits in
    CMN_CTRL_0 asserted.
    
    Mark these clocks with CLK_OPS_PARENT_ENABLE to ensure the parent is
    enabled during rate changes.
    
    Cc: stable@vger.kernel.org
    Fixes: f1080d8dab0f ("clk: qcom: dispcc-sm8750: Add SM8750 Display clock controller")
    Signed-off-by: Krzysztof Kozlowski <krzysztof.kozlowski@linaro.org>
    Reviewed-by: Abhinav Kumar <quic_abhinavk@quicinc.com>
    Reviewed-by: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
    Link: https://lore.kernel.org/r/20250520090741.45820-2-krzysztof.kozlowski@linaro.org
    Signed-off-by: Bjorn Andersson <andersson@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

clk: qcom: gcc-ipq8074: fix broken freq table for nss_port6_tx_clk_src [+ + +]

Author: Christian Marangi <ansuelsmth@gmail.com>
Date:   Thu May 22 22:25:55 2025 +0200

    clk: qcom: gcc-ipq8074: fix broken freq table for nss_port6_tx_clk_src
    
    commit 077ec7bcec9a8987d2a133afb7e13011878c7576 upstream.
    
    With the conversion done by commit e88f03230dc0 ("clk: qcom: gcc-ipq8074:
    rework nss_port5/6 clock to multiple conf") a Copy-Paste error was made
    for the nss_port6_tx_clk_src frequency table.
    
    This was caused by the wrong setting of the parent in
    ftbl_nss_port6_tx_clk_src that was wrongly set to P_UNIPHY1_RX instead
    of P_UNIPHY2_TX.
    
    This cause the UNIPHY2 port to malfunction when it needs to be scaled to
    higher clock. The malfunction was observed with the example scenario
    with an Aquantia 10G PHY connected and a speed higher than 1G (example
    2.5G)
    
    Fix the broken frequency table to restore original functionality.
    
    Cc: stable@vger.kernel.org
    Fixes: e88f03230dc0 ("clk: qcom: gcc-ipq8074: rework nss_port5/6 clock to multiple conf")
    Signed-off-by: Christian Marangi <ansuelsmth@gmail.com>
    Tested-by: Robert Marko <robimarko@gmail.com>
    Link: https://lore.kernel.org/r/20250522202600.4028-1-ansuelsmth@gmail.com
    Signed-off-by: Bjorn Andersson <andersson@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

clk: qcom: ipq5018: keep XO clock always on [+ + +]

Author: George Moussalem <george.moussalem@outlook.com>
Date:   Fri May 16 16:36:08 2025 +0400

    clk: qcom: ipq5018: keep XO clock always on
    
    [ Upstream commit 693a723291d0634eaea24cff2f9d807f3223f204 ]
    
    The XO clock must not be disabled to avoid the kernel trying to disable
    the it. As such, keep the XO clock always on by flagging it as critical.
    
    Signed-off-by: George Moussalem <george.moussalem@outlook.com>
    Reviewed-by: Konrad Dybcio <konrad.dybcio@oss.qualcomm.com>
    Link: https://lore.kernel.org/r/20250516-ipq5018-cmn-pll-v4-1-389a6b30e504@outlook.com
    Signed-off-by: Bjorn Andersson <andersson@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

clk: renesas: rzg2l: Postpone updating priv->clks[] [+ + +]

Author: Claudiu Beznea <claudiu.beznea.uj@bp.renesas.com>
Date:   Wed May 14 12:04:08 2025 +0300

    clk: renesas: rzg2l: Postpone updating priv->clks[]
    
    [ Upstream commit 2f96afdffad4ef74e3c511207058c41c54a2d014 ]
    
    Since the sibling data is filled after the priv->clks[] array entry is
    populated, the first clock that is probed and has a sibling will
    temporarily behave as its own sibling until its actual sibling is
    populated. To avoid any issues, postpone updating priv->clks[] until after
    the sibling is populated.
    
    Signed-off-by: Claudiu Beznea <claudiu.beznea.uj@bp.renesas.com>
    Reviewed-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Link: https://lore.kernel.org/20250514090415.4098534-2-claudiu.beznea.uj@bp.renesas.com
    Signed-off-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

clk: samsung: exynos850: fix a comment [+ + +]

Author: André Draszik <andre.draszik@linaro.org>
Date:   Tue Jun 3 16:43:21 2025 +0100

    clk: samsung: exynos850: fix a comment
    
    commit 320e7efce30e2613c2c7877acc46a8e71192cdcd upstream.
    
    The code below the updated comment is for CMU_CPUCL1, not CMU_CPUCL0.
    
    Fixes: dedf87341ad6 ("clk: samsung: exynos850: Add CMU_CPUCL0 and CMU_CPUCL1")
    Cc: stable@vger.kernel.org
    Signed-off-by: André Draszik <andre.draszik@linaro.org>
    Link: https://lore.kernel.org/r/20250603-samsung-clk-fixes-v1-3-49daf1ff4592@linaro.org
    Signed-off-by: Krzysztof Kozlowski <krzysztof.kozlowski@linaro.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

clk: samsung: gs101: fix alternate mout_hsi0_usb20_ref parent clock [+ + +]

Author: André Draszik <andre.draszik@linaro.org>
Date:   Tue Jun 3 16:43:20 2025 +0100

    clk: samsung: gs101: fix alternate mout_hsi0_usb20_ref parent clock
    
    commit ca243e653f71d8c4724a68c9033923f945b1084d upstream.
    
    The alternate parent clock for this mux is mout_pll_usb, not the pll
    itself.
    
    Fixes: 1891e4d48755 ("clk: samsung: gs101: add support for cmu_hsi0")
    Cc: stable@vger.kernel.org
    Signed-off-by: André Draszik <andre.draszik@linaro.org>
    Link: https://lore.kernel.org/r/20250603-samsung-clk-fixes-v1-2-49daf1ff4592@linaro.org
    Signed-off-by: Krzysztof Kozlowski <krzysztof.kozlowski@linaro.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

clk: samsung: gs101: fix CLK_DOUT_CMU_G3D_BUSD [+ + +]

Author: André Draszik <andre.draszik@linaro.org>
Date:   Tue Jun 3 16:43:19 2025 +0100

    clk: samsung: gs101: fix CLK_DOUT_CMU_G3D_BUSD
    
    commit 29a9361f0b50be2b16d308695e30ee030fedea2c upstream.
    
    Use the correct Linux clock ID when instantiating the G3D_BUSD
    div_clock.
    
    Fixes: 2c597bb7d66a ("clk: samsung: clk-gs101: Add cmu_top, cmu_misc and cmu_apm support")
    Cc: stable@vger.kernel.org
    Signed-off-by: André Draszik <andre.draszik@linaro.org>
    Link: https://lore.kernel.org/r/20250603-samsung-clk-fixes-v1-1-49daf1ff4592@linaro.org
    Signed-off-by: Krzysztof Kozlowski <krzysztof.kozlowski@linaro.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

clk: tegra: periph: Fix error handling and resolve unsigned compare warning [+ + +]

Author: Pei Xiao <xiaopei01@kylinos.cn>
Date:   Wed Jul 9 15:37:13 2025 +0800

    clk: tegra: periph: Fix error handling and resolve unsigned compare warning
    
    [ Upstream commit 2dc2ca9000eea2eb749f658196204cb84d4306f7 ]
    
    ./drivers/clk/tegra/clk-periph.c:59:5-9: WARNING:
            Unsigned expression compared with zero: rate < 0
    
    The unsigned long 'rate' variable caused:
    - Incorrect handling of negative errors
    - Compile warning: "Unsigned expression compared with zero"
    
    Fix by changing to long type and adding req->rate cast.
    
    Signed-off-by: Pei Xiao <xiaopei01@kylinos.cn>
    Link: https://lore.kernel.org/r/79c7f01e29876c612e90d6d0157fb1572ca8b3fb.1752046270.git.xiaopei01@kylinos.cn
    Acked-by: Thierry Reding <treding@nvidia.com>
    Signed-off-by: Stephen Boyd <sboyd@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

clk: thead: Mark essential bus clocks as CLK_IGNORE_UNUSED [+ + +]

Author: Michal Wilczynski <m.wilczynski@samsung.com>
Date:   Mon Jun 23 20:08:53 2025 +0200

    clk: thead: Mark essential bus clocks as CLK_IGNORE_UNUSED
    
    [ Upstream commit 0370395d45ca6dd53bb931978f0e91ac8dd6f1c5 ]
    
    Probing peripherals in the AON and PERI domains, such as the PVT thermal
    sensor and the PWM controller, can lead to boot hangs or unresponsive
    devices on the LPi4A board. The root cause is that their parent bus
    clocks ('CLK_CPU2AON_X2H' and the 'CLK_PERISYS_APB' clocks) are
    automatically gated by the kernel's power-saving mechanisms when the bus
    is perceived as idle.
    
    Alternative solutions were investigated, including modeling the parent
    bus in the Device Tree with 'simple-pm-bus' or refactoring the clock
    driver's parentage. The 'simple-pm-bus' approach is not viable due to
    the lack of defined bus address ranges in the hardware manual and its
    creation of improper dependencies on the 'pm_runtime' API for consumer
    drivers.
    
    Therefore, applying the'`CLK_IGNORE_UNUSED' flag directly to the
    essential bus clocks is the most direct and targeted fix. This prevents
    the kernel from auto-gating these buses and ensures peripherals remain
    accessible.
    
    This change fixes the boot hang associated with the PVT sensor and
    resolves the functional issues with the PWM controller.
    
    Link: https://lore.kernel.org/all/9e8a12db-236d-474c-b110-b3be96edf057@samsung.com/ [1]
    
    Reviewed-by: Drew Fustini <drew@pdp7.com>
    Acked-by: Stephen Boyd <sboyd@kernel.org>
    Signed-off-by: Michal Wilczynski <m.wilczynski@samsung.com>
    Signed-off-by: Drew Fustini <drew@pdp7.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

comedi: fix race between polling and detaching [+ + +]

Author: Ian Abbott <abbotti@mev.co.uk>
Date:   Tue Jul 22 16:53:16 2025 +0100

    comedi: fix race between polling and detaching
    
    commit 35b6fc51c666fc96355be5cd633ed0fe4ccf68b2 upstream.
    
    syzbot reports a use-after-free in comedi in the below link, which is
    due to comedi gladly removing the allocated async area even though poll
    requests are still active on the wait_queue_head inside of it. This can
    cause a use-after-free when the poll entries are later triggered or
    removed, as the memory for the wait_queue_head has been freed.  We need
    to check there are no tasks queued on any of the subdevices' wait queues
    before allowing the device to be detached by the `COMEDI_DEVCONFIG`
    ioctl.
    
    Tasks will read-lock `dev->attach_lock` before adding themselves to the
    subdevice wait queue, so fix the problem in the `COMEDI_DEVCONFIG` ioctl
    handler by write-locking `dev->attach_lock` before checking that all of
    the subdevices are safe to be deleted.  This includes testing for any
    sleepers on the subdevices' wait queues.  It remains locked until the
    device has been detached.  This requires the `comedi_device_detach()`
    function to be refactored slightly, moving the bulk of it into new
    function `comedi_device_detach_locked()`.
    
    Note that the refactor of `comedi_device_detach()` results in
    `comedi_device_cancel_all()` now being called while `dev->attach_lock`
    is write-locked, which wasn't the case previously, but that does not
    matter.
    
    Thanks to Jens Axboe for diagnosing the problem and co-developing this
    patch.
    
    Cc: stable <stable@kernel.org>
    Fixes: 2f3fdcd7ce93 ("staging: comedi: add rw_semaphore to protect against device detachment")
    Link: https://lore.kernel.org/all/687bd5fe.a70a0220.693ce.0091.GAE@google.com/
    Reported-by: syzbot+01523a0ae5600aef5895@syzkaller.appspotmail.com
    Closes: https://syzkaller.appspot.com/bug?extid=01523a0ae5600aef5895
    Co-developed-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Ian Abbott <abbotti@mev.co.uk>
    Tested-by: Jens Axboe <axboe@kernel.dk>
    Link: https://lore.kernel.org/r/20250722155316.27432-1-abbotti@mev.co.uk
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

cpufreq: CPPC: Mark driver with NEED_UPDATE_LIMITS flag [+ + +]

Author: Prashant Malani <pmalani@google.com>
Date:   Tue Jul 22 05:55:40 2025 +0000

    cpufreq: CPPC: Mark driver with NEED_UPDATE_LIMITS flag
    
    [ Upstream commit 0a1416a49e63c320f6e6c1c8d07e1b58c0d4a3f3 ]
    
    AMU counters on certain CPPC-based platforms tend to yield inaccurate
    delivered performance measurements on systems that are idle/mostly idle.
    This results in an inaccurate frequency being stored by cpufreq in its
    policy structure when the CPU is brought online. [1]
    
    Consequently, if the userspace governor tries to set the frequency to a
    new value, there is a possibility that it would be the erroneous value
    stored earlier. In such a scenario, cpufreq would assume that the
    requested frequency has already been set and return early, resulting in
    the correct/new frequency request never making it to the hardware.
    
    Since the operating frequency is liable to this sort of inconsistency,
    mark the CPPC driver with CPUFREQ_NEED_UPDATE_LIMITS so that it is always
    invoked when a target frequency update is requested.
    
    Link: https://lore.kernel.org/linux-pm/20250619000925.415528-3-pmalani@google.com/ [1]
    Suggested-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Signed-off-by: Prashant Malani <pmalani@google.com>
    Acked-by: Viresh Kumar <viresh.kumar@linaro.org>
    Link: https://patch.msgid.link/20250722055611.130574-2-pmalani@google.com
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

cpufreq: Exit governor when failed to start old governor [+ + +]

Author: Lifeng Zheng <zhenglifeng1@huawei.com>
Date:   Wed Jul 9 18:41:45 2025 +0800

    cpufreq: Exit governor when failed to start old governor
    
    [ Upstream commit 0ae204405095abfbc2d694ee0fbb49bcbbe55c57 ]
    
    Detect the result of starting old governor in cpufreq_set_policy(). If it
    fails, exit the governor and clear policy->governor.
    
    Signed-off-by: Lifeng Zheng <zhenglifeng1@huawei.com>
    Link: https://patch.msgid.link/20250709104145.2348017-5-zhenglifeng1@huawei.com
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

cpufreq: intel_pstate: Add Granite Rapids support in no-HWP mode [+ + +]

Author: Li RongQing <lirongqing@baidu.com>
Date:   Mon Jun 23 18:56:01 2025 +0800

    cpufreq: intel_pstate: Add Granite Rapids support in no-HWP mode
    
    [ Upstream commit fc64e0421598aaa87d61184f6777b52614a095be ]
    
    Users may disable HWP in firmware, in which case intel_pstate
    wouldn't load unless the CPU model is explicitly supported.
    
    Signed-off-by: Li RongQing <lirongqing@baidu.com>
    Link: https://patch.msgid.link/20250623105601.3924-1-lirongqing@baidu.com
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

cpuidle: governors: menu: Avoid using invalid recent intervals data [+ + +]

Author: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
Date:   Mon Aug 11 17:03:11 2025 +0200

    cpuidle: governors: menu: Avoid using invalid recent intervals data
    
    [ Upstream commit fa3fa55de0d6177fdcaf6fc254f13cc8f33c3eed ]
    
    Marc has reported that commit 85975daeaa4d ("cpuidle: menu: Avoid
    discarding useful information") caused the number of wakeup interrupts
    to increase on an idle system [1], which was not expected to happen
    after merely allowing shallower idle states to be selected by the
    governor in some cases.
    
    However, on the system in question, all of the idle states deeper than
    WFI are rejected by the driver due to a firmware issue [2].  This causes
    the governor to only consider the recent interval duriation data
    corresponding to attempts to enter WFI that are successful and the
    recent invervals table is filled with values lower than the scheduler
    tick period.  Consequently, the governor predicts an idle duration
    below the scheduler tick period length and avoids stopping the tick
    more often which leads to the observed symptom.
    
    Address it by modifying the governor to update the recent intervals
    table also when entering the previously selected idle state fails, so
    it knows that the short idle intervals might have been the minority
    had the selected idle states been actually entered every time.
    
    Fixes: 85975daeaa4d ("cpuidle: menu: Avoid discarding useful information")
    Link: https://lore.kernel.org/linux-pm/86o6sv6n94.wl-maz@kernel.org/ [1]
    Link: https://lore.kernel.org/linux-pm/7ffcb716-9a1b-48c2-aaa4-469d0df7c792@arm.com/ [2]
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Tested-by: Christian Loehle <christian.loehle@arm.com>
    Tested-by: Marc Zyngier <maz@kernel.org>
    Reviewed-by: Christian Loehle <christian.loehle@arm.com>
    Link: https://patch.msgid.link/2793874.mvXUDI8C0e@rafael.j.wysocki
    Signed-off-by: Sasha Levin <sashal@kernel.org>

crypto: caam - Support iMX8QXP and variants thereof [+ + +]

Author: John Ernberg <john.ernberg@actia.se>
Date:   Wed Jun 11 11:38:08 2025 +0000

    crypto: caam - Support iMX8QXP and variants thereof
    
    [ Upstream commit ac8aff0035fa58e53b39bd565ad6422a90ccdc87 ]
    
    The iMX8QXP (and variants such as the QX, DX, DXP) all identify as iMX8QXP.
    
    They have the exact same restrictions as the supported iMX8QM introduced
    at commit 61bb8db6f682 ("crypto: caam - Add support for i.MX8QM")
    
    Loosen the check a little bit with a wildcard to also match the iMX8QXP
    and its variants.
    
    Signed-off-by: John Ernberg <john.ernberg@actia.se>
    Reviewed-by: Frank Li <Frank.Li@nxp.com>
    Reviewed-by: Peng Fan <peng.fan@nxp.com>
    Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

crypto: ccp - Add missing bootloader info reg for pspv6 [+ + +]

Author: Mario Limonciello <mario.limonciello@amd.com>
Date:   Mon May 19 10:21:01 2025 -0500

    crypto: ccp - Add missing bootloader info reg for pspv6
    
    [ Upstream commit aaeff14688d0254b39731d9bb303c79bfd610f7d ]
    
    The bootloader info reg for pspv6 is the same as pspv4 and pspv5.
    
    Suggested-by: Tom Lendacky <thomas.lendacky@amd.com>
    Signed-off-by: Mario Limonciello <mario.limonciello@amd.com>
    Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

crypto: hisilicon/hpre - fix dma unmap sequence [+ + +]

Author: Zhiqi Song <songzhiqi1@huawei.com>
Date:   Fri Jul 18 18:05:01 2025 +0800

    crypto: hisilicon/hpre - fix dma unmap sequence
    
    [ Upstream commit 982fd1a74de63c388c060e4fa6f7fbd088d6d02e ]
    
    Perform DMA unmapping operations before processing data.
    Otherwise, there may be unsynchronized data accessed by
    the CPU when the SWIOTLB is enabled.
    
    Signed-off-by: Zhiqi Song <songzhiqi1@huawei.com>
    Signed-off-by: Chenghai Huang <huangchenghai2@huawei.com>
    Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

crypto: jitter - fix intermediary handling [+ + +]

Author: Markus Theil <theil.markus@gmail.com>
Date:   Sat Jun 21 13:36:43 2025 +0200

    crypto: jitter - fix intermediary handling
    
    [ Upstream commit 735b72568c73875269a6b73ab9543a70f6ac8a9f ]
    
    The intermediary value was included in the wrong
    hash state. While there, adapt to user-space by
    setting the timestamp to 0 if stuck and inserting
    the values nevertheless.
    
    Acked-by: Stephan Mueller <smueller@chronox.de>
    Signed-off-by: Markus Theil <theil.markus@gmail.com>
    Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

crypto: octeontx2 - add timeout for load_fvc completion poll [+ + +]

Author: Bharat Bhushan <bbhushan2@marvell.com>
Date:   Thu May 22 15:36:24 2025 +0530

    crypto: octeontx2 - add timeout for load_fvc completion poll
    
    [ Upstream commit 2157e50f65d2030f07ea27ef7ac4cfba772e98ac ]
    
    Adds timeout to exit from possible infinite loop, which polls
    on CPT instruction(load_fvc) completion.
    
    Signed-off-by: Srujana Challa <schalla@marvell.com>
    Signed-off-by: Bharat Bhushan <bbhushan2@marvell.com>
    Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

dm-mpath: don't print the "loaded" message if registering fails [+ + +]

Author: Mikulas Patocka <mpatocka@redhat.com>
Date:   Mon Jun 30 15:24:22 2025 +0200

    dm-mpath: don't print the "loaded" message if registering fails
    
    [ Upstream commit 6e11952a6abc4641dc8ae63f01b318b31b44e8db ]
    
    If dm_register_path_selector, don't print the "version X loaded" message.
    
    Signed-off-by: Mikulas Patocka <mpatocka@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

dm-stripe: limit chunk_sectors to the stripe size [+ + +]

Author: John Garry <john.g.garry@oracle.com>
Date:   Fri Jul 11 10:52:57 2025 +0000

    dm-stripe: limit chunk_sectors to the stripe size
    
    [ Upstream commit 5fb9d4341b782a80eefa0dc1664d131ac3c8885d ]
    
    Same as done for raid0, set chunk_sectors limit to appropriately set the
    atomic write size limit.
    
    Setting chunk_sectors limit in this way overrides the stacked limit
    already calculated based on the bottom device limits. This is ok, as
    when any bios are sent to the bottom devices, the block layer will still
    respect the bottom device chunk_sectors.
    
    Reviewed-by: Nilay Shroff <nilay@linux.ibm.com>
    Reviewed-by: Mikulas Patocka <mpatocka@redhat.com>
    Signed-off-by: John Garry <john.g.garry@oracle.com>
    Reviewed-by: Martin K. Petersen <martin.petersen@oracle.com>
    Link: https://lore.kernel.org/r/20250711105258.3135198-6-john.g.garry@oracle.com
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

dm-table: fix checking for rq stackable devices [+ + +]

Author: Benjamin Marzinski <bmarzins@redhat.com>
Date:   Fri Jun 13 19:08:52 2025 -0400

    dm-table: fix checking for rq stackable devices
    
    [ Upstream commit 8ca719b81987be690f197e82fdb030580c0a07f3 ]
    
    Due to the semantics of iterate_devices(), the current code allows a
    request-based dm table as long as it includes one request-stackable
    device. It is supposed to only allow tables where there are no
    non-request-stackable devices.
    
    Signed-off-by: Benjamin Marzinski <bmarzins@redhat.com>
    Reviewed-by: Mike Snitzer <snitzer@kernel.org>
    Signed-off-by: Mikulas Patocka <mpatocka@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

dm: Always split write BIOs to zoned device limits [+ + +]

Author: Damien Le Moal <dlemoal@kernel.org>
Date:   Wed Jun 25 18:33:25 2025 +0900

    dm: Always split write BIOs to zoned device limits
    
    commit 2df7168717b7d2d32bcf017c68be16e4aae9dd13 upstream.
    
    Any zoned DM target that requires zone append emulation will use the
    block layer zone write plugging. In such case, DM target drivers must
    not split BIOs using dm_accept_partial_bio() as doing so can potentially
    lead to deadlocks with queue freeze operations. Regular write operations
    used to emulate zone append operations also cannot be split by the
    target driver as that would result in an invalid writen sector value
    return using the BIO sector.
    
    In order for zoned DM target drivers to avoid such incorrect BIO
    splitting, we must ensure that large BIOs are split before being passed
    to the map() function of the target, thus guaranteeing that the
    limits for the mapped device are not exceeded.
    
    dm-crypt and dm-flakey are the only target drivers supporting zoned
    devices and using dm_accept_partial_bio().
    
    In the case of dm-crypt, this function is used to split BIOs to the
    internal max_write_size limit (which will be suppressed in a different
    patch). However, since crypt_alloc_buffer() uses a bioset allowing only
    up to BIO_MAX_VECS (256) vectors in a BIO. The dm-crypt device
    max_segments limit, which is not set and so default to BLK_MAX_SEGMENTS
    (128), must thus be respected and write BIOs split accordingly.
    
    In the case of dm-flakey, since zone append emulation is not required,
    the block layer zone write plugging is not used and no splitting of BIOs
    required.
    
    Modify the function dm_zone_bio_needs_split() to use the block layer
    helper function bio_needs_zone_write_plugging() to force a call to
    bio_split_to_limits() in dm_split_and_process_bio(). This allows DM
    target drivers to avoid using dm_accept_partial_bio() for write
    operations on zoned DM devices.
    
    Fixes: f211268ed1f9 ("dm: Use the block layer zone append emulation")
    Cc: stable@vger.kernel.org
    Signed-off-by: Damien Le Moal <dlemoal@kernel.org>
    Reviewed-by: Mikulas Patocka <mpatocka@redhat.com>
    Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com>
    Link: https://lore.kernel.org/r/20250625093327.548866-4-dlemoal@kernel.org
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

dm: split write BIOs on zone boundaries when zone append is not emulated [+ + +]

Author: Shin'ichiro Kawasaki <shinichiro.kawasaki@wdc.com>
Date:   Thu Jul 17 19:35:39 2025 +0900

    dm: split write BIOs on zone boundaries when zone append is not emulated
    
    commit 675f940576351bb049f5677615140b9d0a7712d0 upstream.
    
    Commit 2df7168717b7 ("dm: Always split write BIOs to zoned device
    limits") updates the device-mapper driver to perform splits for the
    write BIOs. However, it did not address the cases where DM targets do
    not emulate zone append, such as in the cases of dm-linear or dm-flakey.
    For these targets, when the write BIOs span across zone boundaries, they
    trigger WARN_ON_ONCE(bio_straddles_zones(bio)) in
    blk_zone_wplug_handle_write(). This results in I/O errors. The errors
    are reproduced by running blktests test case zbd/004 using zoned
    dm-linear or dm-flakey devices.
    
    To avoid the I/O errors, handle the write BIOs regardless whether DM
    targets emulate zone append or not, so that all write BIOs are split at
    zone boundaries. For that purpose, drop the check for zone append
    emulation in dm_zone_bio_needs_split(). Its argument 'md' is no longer
    used then drop it also.
    
    Fixes: 2df7168717b7 ("dm: Always split write BIOs to zoned device limits")
    Signed-off-by: Shin'ichiro Kawasaki <shinichiro.kawasaki@wdc.com>
    Reviewed-by: Damien Le Moal <dlemoal@kernel.org>
    Reviewed-by: Mikulas Patocka <mpatocka@redhat.com>
    Link: https://lore.kernel.org/r/20250717103539.37279-1-shinichiro.kawasaki@wdc.com
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

dmaengine: stm32-dma: configure next sg only if there are more than 2 sgs [+ + +]

Author: Amelie Delaunay <amelie.delaunay@foss.st.com>
Date:   Tue Jun 24 09:31:37 2025 +0200

    dmaengine: stm32-dma: configure next sg only if there are more than 2 sgs
    
    [ Upstream commit e19bdbaa31082b43dab1d936e20efcebc30aa73d ]
    
    DMA operates in Double Buffer Mode (DBM) when the transfer is cyclic and
    there are at least two periods.
    When DBM is enabled, the DMA toggles between two memory targets (SxM0AR and
    SxM1AR), indicated by the SxSCR.CT bit (Current Target).
    There is no need to update the next memory address if two periods are
    configured, as SxM0AR and SxM1AR are already properly set up before the
    transfer begins in the stm32_dma_start_transfer() function.
    This avoids unnecessary updates to SxM0AR/SxM1AR, thereby preventing
    potential Transfer Errors. Specifically, when the channel is enabled,
    SxM0AR and SxM1AR can only be written if SxSCR.CT=1 and SxSCR.CT=0,
    respectively. Otherwise, a Transfer Error interrupt is triggered, and the
    stream is automatically disabled.
    
    Signed-off-by: Amelie Delaunay <amelie.delaunay@foss.st.com>
    Link: https://lore.kernel.org/r/20250624-stm32_dma_dbm_fix-v1-1-337c40d6c93e@foss.st.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

Documentation: ACPI: Fix parent device references [+ + +]

Author: Andy Shevchenko <andriy.shevchenko@linux.intel.com>
Date:   Thu Jul 10 20:00:23 2025 +0300

    Documentation: ACPI: Fix parent device references
    
    commit e65cb011349e653ded541dddd6469c2ca813edcf upstream.
    
    The _CRS resources in many cases want to have ResourceSource field
    to be a type of ACPI String. This means that to compile properly
    we need to enclosure the name path into double quotes. This will
    in practice defer the interpretation to a run-time stage, However,
    this may be interpreted differently on different OSes and ACPI
    interpreter implementations. In particular ACPICA might not correctly
    recognize the leading '^' (caret) character and will not resolve
    the relative name path properly. On top of that, this piece may be
    used in SSDTs which are loaded after the DSDT and on itself may also
    not resolve relative name paths outside of their own scopes.
    With this all said, fix documentation to use fully-qualified name
    paths always to avoid any misinterpretations, which is proven to
    work.
    
    Fixes: 8eb5c87a92c0 ("i2c: add ACPI support for I2C mux ports")
    Reported-by: Yevhen Kondrashyn <e.kondrashyn@gmail.com>
    Cc: All applicable <stable@vger.kernel.org>
    Signed-off-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com>
    Link: https://patch.msgid.link/20250710170225.961303-1-andriy.shevchenko@linux.intel.com
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

dpaa_eth: don't use fixed_phy_change_carrier [+ + +]

Author: Heiner Kallweit <hkallweit1@gmail.com>
Date:   Mon Jun 16 23:24:05 2025 +0200

    dpaa_eth: don't use fixed_phy_change_carrier
    
    [ Upstream commit d8155c1df5c8b717052567b188455d41fa7a8908 ]
    
    This effectively reverts 6e8b0ff1ba4c ("dpaa_eth: Add change_carrier()
    for Fixed PHYs"). Usage of fixed_phy_change_carrier() requires that
    fixed_phy_register() has been called before, directly or indirectly.
    And that's not the case in this driver.
    
    Signed-off-by: Heiner Kallweit <hkallweit1@gmail.com>
    Reviewed-by: Jacob Keller <jacob.e.keller@intel.com>
    Link: https://patch.msgid.link/7eb189b3-d5fd-4be6-8517-a66671a4e4e3@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drbd: add missing kref_get in handle_write_conflicts [+ + +]

Author: Sarah Newman <srn@prgmr.com>
Date:   Fri Jun 27 11:57:28 2025 +0200

    drbd: add missing kref_get in handle_write_conflicts
    
    [ Upstream commit 00c9c9628b49e368d140cfa61d7df9b8922ec2a8 ]
    
    With `two-primaries` enabled, DRBD tries to detect "concurrent" writes
    and handle write conflicts, so that even if you write to the same sector
    simultaneously on both nodes, they end up with the identical data once
    the writes are completed.
    
    In handling "superseeded" writes, we forgot a kref_get,
    resulting in a premature drbd_destroy_device and use after free,
    and further to kernel crashes with symptoms.
    
    Relevance: No one should use DRBD as a random data generator, and apparently
    all users of "two-primaries" handle concurrent writes correctly on layer up.
    That is cluster file systems use some distributed lock manager,
    and live migration in virtualization environments stops writes on one node
    before starting writes on the other node.
    
    Which means that other than for "test cases",
    this code path is never taken in real life.
    
    FYI, in DRBD 9, things are handled differently nowadays.  We still detect
    "write conflicts", but no longer try to be smart about them.
    We decided to disconnect hard instead: upper layers must not submit concurrent
    writes. If they do, that's their fault.
    
    Signed-off-by: Sarah Newman <srn@prgmr.com>
    Signed-off-by: Lars Ellenberg <lars@linbit.com>
    Signed-off-by: Christoph Böhmwalder <christoph.boehmwalder@linbit.com>
    Link: https://lore.kernel.org/r/20250627095728.800688-1-christoph.boehmwalder@linbit.com
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amd/display: add null check [+ + +]

Author: Peichen Huang <PeiChen.Huang@amd.com>
Date:   Thu Jun 12 16:06:41 2025 +0800

    drm/amd/display: add null check
    
    [ Upstream commit 158b9201c17fc93ed4253c2f03b77fd2671669a1 ]
    
    [WHY]
    Prevents null pointer dereferences to enhance function robustness
    
    [HOW]
    Adds early null check and return false if invalid.
    
    Reviewed-by: Cruise Hung <cruise.hung@amd.com>
    Signed-off-by: Peichen Huang <PeiChen.Huang@amd.com>
    Signed-off-by: Ray Wu <ray.wu@amd.com>
    Tested-by: Daniel Wheeler <daniel.wheeler@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amd/display: Allow DCN301 to clear update flags [+ + +]

Author: Ivan Lipski <ivan.lipski@amd.com>
Date:   Thu Jul 17 13:58:35 2025 -0400

    drm/amd/display: Allow DCN301 to clear update flags
    
    commit 2d418e4fd9f1eca7dfce80de86dd702d36a06a25 upstream.
    
    [Why & How]
    Not letting DCN301 to clear after surface/stream update results
    in artifacts when switching between active overlay planes. The issue
    is known and has been solved initially. See below:
    (https://gitlab.freedesktop.org/drm/amd/-/issues/3441)
    
    Fixes: f354556e29f4 ("drm/amd/display: limit clear_update_flags t dcn32 and above")
    Reviewed-by: Mario Limonciello <mario.limonciello@amd.com>
    Signed-off-by: Ivan Lipski <ivan.lipski@amd.com>
    Tested-by: Daniel Wheeler <daniel.wheeler@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/amd/display: Avoid configuring PSR granularity if PSR-SU not supported [+ + +]

Author: Mario Limonciello <mario.limonciello@amd.com>
Date:   Sun Jul 6 08:38:05 2025 -0500

    drm/amd/display: Avoid configuring PSR granularity if PSR-SU not supported
    
    [ Upstream commit a5ce8695d6d1b40d6960d2d298b579042c158f25 ]
    
    [Why]
    If PSR-SU is disabled on the link, then configuring su_y granularity in
    mod_power_calc_psr_configs() can lead to assertions in
    psr_su_set_dsc_slice_height().
    
    [How]
    Check the PSR version in amdgpu_dm_link_setup_psr() to determine whether
    or not to configure granularity.
    
    Reviewed-by: Sun peng (Leo) Li <sunpeng.li@amd.com>
    Signed-off-by: Mario Limonciello <mario.limonciello@amd.com>
    Signed-off-by: Ivan Lipski <ivan.lipski@amd.com>
    Tested-by: Daniel Wheeler <daniel.wheeler@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amd/display: Avoid trying AUX transactions on disconnected ports [+ + +]

Author: Wayne Lin <Wayne.Lin@amd.com>
Date:   Tue May 13 16:06:50 2025 +0800

    drm/amd/display: Avoid trying AUX transactions on disconnected ports
    
    [ Upstream commit deb24e64c8881c462b29e2c69afd9e6669058be5 ]
    
    [Why & How]
    Observe that we try to access DPCD 0x600h of disconnected DP ports.
    In order not to wasting time on retrying these ports, call
    dpcd_write_rx_power_ctrl() after checking its connection status.
    
    Reviewed-by: Aurabindo Pillai <aurabindo.pillai@amd.com>
    Signed-off-by: Wayne Lin <Wayne.Lin@amd.com>
    Tested-by: Daniel Wheeler <daniel.wheeler@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amd/display: Disable dsc_power_gate for dcn314 by default [+ + +]

Author: Roman Li <Roman.Li@amd.com>
Date:   Mon Jul 14 14:37:33 2025 -0400

    drm/amd/display: Disable dsc_power_gate for dcn314 by default
    
    [ Upstream commit 02f3ec53177243d32ee8b6f8ba99136d7887ee3a ]
    
    [Why]
    "REG_WAIT timeout 1us * 1000 tries - dcn314_dsc_pg_control line"
    warnings seen after resuming from s2idle.
    DCN314 has issues with DSC power gating that cause REG_WAIT timeouts
    when attempting to power down DSC blocks.
    
    [How]
    Disable dsc_power_gate for dcn314 by default.
    
    Reviewed-by: Nicholas Kazlauskas <nicholas.kazlauskas@amd.com>
    Signed-off-by: Roman Li <Roman.Li@amd.com>
    Signed-off-by: Ivan Lipski <ivan.lipski@amd.com>
    Tested-by: Daniel Wheeler <daniel.wheeler@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amd/display: Fix 'failed to blank crtc!' [+ + +]

Author: Wen Chen <Wen.Chen3@amd.com>
Date:   Mon Jun 2 16:37:08 2025 -0400

    drm/amd/display: Fix 'failed to blank crtc!'
    
    [ Upstream commit 01f60348d8fb6b3fbcdfc7bdde5d669f95b009a4 ]
    
    [why]
    DCN35 is having “DC: failed to blank crtc!” when running HPO
    test cases. It's caused by not having sufficient udelay time.
    
    [how]
    Replace the old wait_for_blank_complete function with fsleep function to
    sleep just until the next frame should come up. This way it doesn't poll
    in case the pixel clock or other clock was bugged or until vactive and
    the vblank are hit again.
    
    Reviewed-by: Nicholas Kazlauskas <nicholas.kazlauskas@amd.com>
    Signed-off-by: Wen Chen <Wen.Chen3@amd.com>
    Signed-off-by: Fangzhi Zuo <jerry.zuo@amd.com>
    Tested-by: Daniel Wheeler <daniel.wheeler@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amd/display: Initialize mode_select to 0 [+ + +]

Author: Alex Hung <alex.hung@amd.com>
Date:   Tue Jun 10 15:40:18 2025 -0600

    drm/amd/display: Initialize mode_select to 0
    
    [ Upstream commit 592ddac93f8c02e13f19175745465f8c4d0f56cd ]
    
    [WHAT]
    mode_select was supposed to be initialized in mpc_read_gamut_remap but
    is not set in default case. This can cause indeterminate
    behaviors.
    
    This is reported as an UNINIT error by Coverity.
    
    Reviewed-by: Aurabindo Pillai <aurabindo.pillai@amd.com>
    Signed-off-by: Alex Hung <alex.hung@amd.com>
    Signed-off-by: Fangzhi Zuo <jerry.zuo@amd.com>
    Tested-by: Daniel Wheeler <daniel.wheeler@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amd/display: limit clear_update_flags to dcn32 and above [+ + +]

Author: Charlene Liu <Charlene.Liu@amd.com>
Date:   Thu Jun 26 16:36:17 2025 -0400

    drm/amd/display: limit clear_update_flags to dcn32 and above
    
    [ Upstream commit f354556e29f40ef44fa8b13dc914817db3537e20 ]
    
    [why]
    dc has some code out of sync:
    dc_commit_updates_for_stream handles v1/v2/v3,
    but dc_update_planes_and_stream makes v1 asic to use v2.
    
    as a reression fix: limit clear_update_flags to dcn32 or newer asic.
    need to follow up that v1 asic using v2 issue.
    
    Reviewed-by: Syed Hassan <syed.hassan@amd.com>
    Signed-off-by: Charlene Liu <Charlene.Liu@amd.com>
    Signed-off-by: Ivan Lipski <ivan.lipski@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amd/display: Only finalize atomic_obj if it was initialized [+ + +]

Author: Mario Limonciello <mario.limonciello@amd.com>
Date:   Tue Jul 15 14:41:46 2025 -0500

    drm/amd/display: Only finalize atomic_obj if it was initialized
    
    [ Upstream commit b174084b3fe15ad1acc69530e673c1535d2e4f85 ]
    
    [Why]
    If amdgpu_dm failed to initalize before amdgpu_dm_initialize_drm_device()
    completed then freeing atomic_obj will lead to list corruption.
    
    [How]
    Check if atomic_obj state is initialized before trying to free.
    
    Reviewed-by: Harry Wentland <harry.wentland@amd.com>
    Signed-off-by: Mario Limonciello <mario.limonciello@amd.com>
    Signed-off-by: Ivan Lipski <ivan.lipski@amd.com>
    Tested-by: Daniel Wheeler <daniel.wheeler@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amd/display: Separate set_gsl from set_gsl_source_select [+ + +]

Author: Ilya Bakoulin <Ilya.Bakoulin@amd.com>
Date:   Wed Jun 18 13:07:14 2025 -0400

    drm/amd/display: Separate set_gsl from set_gsl_source_select
    
    [ Upstream commit 660a467a5e7366cd6642de61f1aaeaf0d253ee68 ]
    
    [Why/How]
    Separate the checks for set_gsl and set_gsl_source_select, since
    source_select may not be implemented/necessary.
    
    Reviewed-by: Nevenko Stupar <nevenko.stupar@amd.com>
    Signed-off-by: Ilya Bakoulin <Ilya.Bakoulin@amd.com>
    Signed-off-by: Ray Wu <ray.wu@amd.com>
    Tested-by: Daniel Wheeler <daniel.wheeler@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amd/display: Stop storing failures into adev->dm.cached_state [+ + +]

Author: Mario Limonciello <mario.limonciello@amd.com>
Date:   Sun Jun 1 20:44:31 2025 -0500

    drm/amd/display: Stop storing failures into adev->dm.cached_state
    
    [ Upstream commit 709a37ab9c63297da2194dc36f604537f9d2d417 ]
    
    If drm_atomic_helper_suspend() has failed for any reason, it's stored
    in adev->dm.cached_state.  This isn't expected because the resume
    (or complete()) sequence will attempt to use the stored state to
    resume.
    
    Reviewed-by: Alex Hung <alex.hung@amd.com>
    Link: https://lore.kernel.org/r/20250602014432.3538345-3-superm1@kernel.org
    Signed-off-by: Mario Limonciello <mario.limonciello@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amd/display: Update DMCUB loading sequence for DCN3.5 [+ + +]

Author: Nicholas Kazlauskas <nicholas.kazlauskas@amd.com>
Date:   Fri May 9 11:18:26 2025 -0400

    drm/amd/display: Update DMCUB loading sequence for DCN3.5
    
    [ Upstream commit d42b2331e158fa6bcdc89e4c8c470dc5da20be1f ]
    
    [Why]
    New sequence from HW for reset and firmware reloading has been
    provided that aims to stabilize the reload sequence in the case the
    firmware is hung or has outstanding requests.
    
    [How]
    Update the sequence to remove the DMUIF reset and the redundant
    writes in the release.
    
    Reviewed-by: Ovidiu Bunea <ovidiu.bunea@amd.com>
    Signed-off-by: Nicholas Kazlauskas <nicholas.kazlauskas@amd.com>
    Signed-off-by: Wayne Lin <wayne.lin@amd.com>
    Tested-by: Daniel Wheeler <daniel.wheeler@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amd/pm: fix null pointer access [+ + +]

Author: Umio Yasuno <coelacanth_dream@protonmail.com>
Date:   Tue Jul 15 14:44:35 2025 +0000

    drm/amd/pm: fix null pointer access
    
    [ Upstream commit d524d40e3a6152a3ea1125af729f8cd8ca65efde ]
    
    Writing a string without delimiters (' ', '\n', '\0') to the under
    gpu_od/fan_ctrl sysfs or pp_power_profile_mode for the CUSTOM profile
    will result in a null pointer dereference.
    
    Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/4401
    Signed-off-by: Umio Yasuno <coelacanth_dream@protonmail.com>
    Acked-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amd/pm: Use pointer type for typecheck() [+ + +]

Author: Lijo Lazar <lijo.lazar@amd.com>
Date:   Thu Jun 12 17:43:58 2025 +0530

    drm/amd/pm: Use pointer type for typecheck()
    
    [ Upstream commit b49e3d7ca71aaf1e3412d41522a11a56563799b5 ]
    
    typecheck creates local variables based on the type passed. That could
    result in stack frame size warnings like below in certain configs:
    
    drivers/gpu/drm/amd/amdgpu/../pm/swsmu/smu13/smu_v13_0_6_ppt.c:2885:1: error: the frame size of 8304 bytes is larger than 2048 bytes [-Werror=frame-larger-than=]
    
    Checking against the pointer type is sufficient for the purpose of
    getting a diagnostic message during build time.
    
    Signed-off-by: Lijo Lazar <lijo.lazar@amd.com>
    Reviewed-by: Palmer Dabbelt <palmer@dabbelt.com>
    Tested-by: Palmer Dabbelt <palmer@dabbelt.com>
    Link: https://lore.kernel.org/r/20250610212141.19445-1-palmer@dabbelt.com
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amd: Allow printing VanGogh OD SCLK levels without setting dpm to manual [+ + +]

Author: Mario Limonciello <mario.limonciello@amd.com>
Date:   Sun Jun 8 22:12:26 2025 -0500

    drm/amd: Allow printing VanGogh OD SCLK levels without setting dpm to manual
    
    [ Upstream commit 2d1ec1e955414e8e8358178011c35afca1a1c0b1 ]
    
    Several other ASICs allow printing OD SCLK levels without setting DPM
    control to manual.  When OD is disabled it will show the range the
    hardware supports. When OD is enabled it will show what values have
    been programmed. Adjust VanGogh to work the same.
    
    Cc: Pierre-Loup A. Griffais <pgriffais@valvesoftware.com>
    Reported-by: Vicki Pfau <vi@endrift.com>
    Reviewed-by: Alex Deucher <alexander.deucher@amd.com>
    Link: https://lore.kernel.org/r/20250609031227.479079-1-superm1@kernel.org
    Signed-off-by: Mario Limonciello <mario.limonciello@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amdgpu: Add more checks to PSP mailbox [+ + +]

Author: Lijo Lazar <lijo.lazar@amd.com>
Date:   Mon Jun 2 12:55:14 2025 +0530

    drm/amdgpu: Add more checks to PSP mailbox
    
    [ Upstream commit 8345a71fc54b28e4d13a759c45ce2664d8540d28 ]
    
    Instead of checking the response flag, use status mask also to check
    against any unexpected failures like a device drop. Also, log error if
    waiting on a psp response fails/times out.
    
    Signed-off-by: Lijo Lazar <lijo.lazar@amd.com>
    Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amdgpu: clear pa and mca record counter when resetting eeprom [+ + +]

Author: ganglxie <ganglxie@amd.com>
Date:   Thu Jun 5 14:00:28 2025 +0800

    drm/amdgpu: clear pa and mca record counter when resetting eeprom
    
    [ Upstream commit d0cc8d2b7df1848f98f0fea8135ba706814b1d13 ]
    
    clear pa and mca record counter when resetting eeprom, so that
    ras_num_bad_pages can be calculated correctly
    
    Signed-off-by: ganglxie <ganglxie@amd.com>
    Reviewed-by: Tao Zhou <tao.zhou1@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amdgpu: fix incorrect vm flags to map bo [+ + +]

Author: Jack Xiao <Jack.Xiao@amd.com>
Date:   Mon Aug 11 15:20:55 2025 +0800

    drm/amdgpu: fix incorrect vm flags to map bo
    
    [ Upstream commit 040bc6d0e0e9c814c9c663f6f1544ebaff6824a8 ]
    
    It should use vm flags instead of pte flags
    to specify bo vm attributes.
    
    Fixes: 7946340fa389 ("drm/amdgpu: Move csa related code to separate file")
    Signed-off-by: Jack Xiao <Jack.Xiao@amd.com>
    Reviewed-by: Likun Gao <Likun.Gao@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    (cherry picked from commit b08425fa77ad2f305fe57a33dceb456be03b653f)
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amdgpu: fix vram reservation issue [+ + +]

Author: YiPeng Chai <YiPeng.Chai@amd.com>
Date:   Tue Aug 12 09:17:58 2025 +0800

    drm/amdgpu: fix vram reservation issue
    
    [ Upstream commit 10ef476aad1c848449934e7bec2ab2374333c7b6 ]
    
    The vram block allocation flag must be cleared
    before making vram reservation, otherwise reserving
    addresses within the currently freed memory range
    will always fail.
    
    Fixes: c9cad937c0c5 ("drm/amdgpu: add drm buddy support to amdgpu")
    Signed-off-by: YiPeng Chai <YiPeng.Chai@amd.com>
    Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    (cherry picked from commit d38eaf27de1b8584f42d6fb3f717b7ec44b3a7a1)
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amdgpu: Suspend IH during mode-2 reset [+ + +]

Author: Lijo Lazar <lijo.lazar@amd.com>
Date:   Tue Jun 3 11:43:36 2025 +0530

    drm/amdgpu: Suspend IH during mode-2 reset
    
    [ Upstream commit 3f1e81ecb61923934bd11c3f5c1e10893574e607 ]
    
    On multi-aid SOCs, there could be a continuous stream of interrupts from
    GC after poison consumption. Suspend IH to disable them before doing
    mode-2 reset. This avoids conflicts in hardware accesses during
    interrupt handlers while a reset is ongoing.
    
    Signed-off-by: Lijo Lazar <lijo.lazar@amd.com>
    Reviewed-by: Asad Kamal <asad.kamal@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amdgpu: Use correct severity for BP threshold exceed event [+ + +]

Author: Xiang Liu <xiang.liu@amd.com>
Date:   Fri Jun 27 23:14:08 2025 +0800

    drm/amdgpu: Use correct severity for BP threshold exceed event
    
    [ Upstream commit 4a33ca3f6ee9a013a423a867426704e9c9d785bd ]
    
    The severity of CPER for BP threshold exceed event should be set as
    CPER_SEV_FATAL to match the OOB implementation.
    
    Signed-off-by: Xiang Liu <xiang.liu@amd.com>
    Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/fbdev-client: Skip DRM clients if modesetting is absent [+ + +]

Author: Thierry Reding <treding@nvidia.com>
Date:   Fri Jun 13 14:28:38 2025 +0200

    drm/fbdev-client: Skip DRM clients if modesetting is absent
    
    [ Upstream commit cce91f29c088ba902dd2abfc9c3216ba9a2fb2fe ]
    
    Recent generations of Tegra have moved the display components outside of
    host1x, leading to a device that has no CRTCs attached and hence doesn't
    support any of the modesetting functionality. When this is detected, the
    driver clears the DRIVER_MODESET and DRIVER_ATOMIC flags for the device.
    
    Unfortunately, this causes the following errors during boot:
    
        [      15.418958] ERR KERN drm drm: [drm] *ERROR* Failed to register client: -95
        [      15.425311] WARNING KERN drm drm: [drm] Failed to set up DRM client; error -95
    
    These originate from the fbdev client checking for the presence of the
    DRIVER_MODESET flag and returning -EOPNOTSUPP. However, if a driver does
    not support DRIVER_MODESET this is entirely expected and the error isn't
    helpful.
    
    Prevent this misleading error message by setting up the DRM clients only
    if modesetting is enabled.
    
    Changes in v2:
    - use DRIVER_MODESET check to avoid registering any clients
    
    Reported-by: Jonathan Hunter <jonathanh@nvidia.com>
    Signed-off-by: Thierry Reding <treding@nvidia.com>
    Acked-by: Jon Hunter <jonathanh@nvidia.com>
    Tested-by: Jon Hunter <jonathanh@nvidia.com>
    Reviewed-by: Thomas Zimmermann <tzimmermann@suse.de>
    Link: https://lore.kernel.org/r/20250613122838.2082334-1-thierry.reding@gmail.com
    Signed-off-by: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/i915/fbc: fix the implementation of wa_18038517565 [+ + +]

Author: Vinod Govindapillai <vinod.govindapillai@intel.com>
Date:   Tue Jul 29 15:46:48 2025 +0300

    drm/i915/fbc: fix the implementation of wa_18038517565
    
    [ Upstream commit fd56b9c9507f32b16159f9a922e1af5628254567 ]
    
    As per the wa_18038517565, we need to disable FBC compressor
    clock gating before enabling FBC and enable after disabling
    FBC. Placing the enabling of clock gating in the fbc deactivate
    function can make the above wa logic go wrong in case of
    frontbuffer rendering FBC mechanism. FBC deactivate can get
    called during fb invalidate and then the corresponding FBC
    activate can get called without properly disabling the clock
    gating and can result in compression stalled. So move the
    enable clock gating at the end of one FBC session after FBC
    is completely disabled for a pipe.
    
    Bspec: 74212, 72197, 69741, 65555
    Fixes: 010363c46189 ("drm/i915/display: implement wa_18038517565")
    Signed-off-by: Vinod Govindapillai <vinod.govindapillai@intel.com>
    Reviewed-by: Jouni Högander <jouni.hogander@intel.com>
    Link: https://lore.kernel.org/r/20250729124648.288497-1-vinod.govindapillai@intel.com
    (cherry picked from commit 82dde0407ab126f8413fd6c51429e5057ced5ba2)
    Signed-off-by: Tvrtko Ursulin <tursulin@ursulin.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/i915/psr: Do not trigger Frame Change events from frontbuffer flush [+ + +]

Author: Jouni Högander <jouni.hogander@intel.com>
Date:   Fri Aug 1 09:29:05 2025 +0300

    drm/i915/psr: Do not trigger Frame Change events from frontbuffer flush
    
    [ Upstream commit 184889dfe0568528fd6d14bba864dd57ed45bbf2 ]
    
    We want to get rid of triggering "Frame Change" events from
    frontbuffer flush calls. We are about to move using TRANS_PUSH
    register for this on LunarLake and onwards. Touching TRANS_PUSH
    register from fronbuffer flush would be problematic as it's written by
    DSB as well.
    
    Fix this by using intel_psr_exit when flush or invalidate is done on
    LunarLake and onwards. This is not possible on AlderLake and
    MeteorLake due to HW bug in PSR2 disable.
    
    This patch is also fixing problems with cursor plane where cursor is
    disappearing or duplicate cursor is seen on the screen.
    
    v2: Commit message updated
    
    Bspec: 68927, 68934, 66624
    Reported-by: Janna Martl <janna.martl109@gmail.com>
    Closes: https://gitlab.freedesktop.org/drm/xe/kernel/-/issues/5522
    Fixes: 411ad63877bb ("drm/i915/psr: Use SFF_CTL on invalidate/flush for LunarLake onwards")
    Tested-by: Janna Martl <janna.martl109@gmail.com>
    Signed-off-by: Jouni Högander <jouni.hogander@intel.com>
    Reviewed-by: Suraj Kandpal <suraj.kandpal@intel.com>
    Link: https://lore.kernel.org/r/20250801062905.564453-1-jouni.hogander@intel.com
    (cherry picked from commit 46fb38cb20c0d185a6391ab524b23e0e0219c41f)
    Signed-off-by: Tvrtko Ursulin <tursulin@ursulin.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/imagination: Clear runtime PM errors while resetting the GPU [+ + +]

Author: Alessio Belle <alessio.belle@imgtec.com>
Date:   Tue Jun 24 16:01:31 2025 +0100

    drm/imagination: Clear runtime PM errors while resetting the GPU
    
    [ Upstream commit 551507e0d0bf32ce1d7d27533c4b98307380804c ]
    
    The runtime PM might be left in error state if one of the callbacks
    returned an error, e.g. if the (auto)suspend callback failed following
    a firmware crash.
    
    When that happens, any further attempt to acquire or release a power
    reference will then also fail, making it impossible to do anything else
    with the GPU. The driver logic will eventually reach the reset code.
    
    In pvr_power_reset(), replace pvr_power_get() with a new API
    pvr_power_get_clear() which also attempts to clear any runtime PM error
    state if acquiring a power reference is not possible.
    
    Signed-off-by: Alessio Belle <alessio.belle@imgtec.com>
    Reviewed-by: Matt Coster <matt.coster@imgtec.com>
    Link: https://lore.kernel.org/r/20250624-clear-rpm-errors-gpu-reset-v1-1-b8ff2ae55aac@imgtec.com
    Signed-off-by: Matt Coster <matt.coster@imgtec.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/msm: Add error handling for krealloc in metadata setup [+ + +]

Author: Yuan Chen <chenyuan@kylinos.cn>
Date:   Fri Jun 27 10:16:43 2025 +0800

    drm/msm: Add error handling for krealloc in metadata setup
    
    [ Upstream commit 1c8c354098ea9d4376a58c96ae6b65288a6f15d8 ]
    
    Function msm_ioctl_gem_info_set_metadata() now checks for krealloc
    failure and returns -ENOMEM, avoiding potential NULL pointer dereference.
    Explicitly avoids __GFP_NOFAIL due to deadlock risks and allocation constraints.
    
    Signed-off-by: Yuan Chen <chenyuan@kylinos.cn>
    Patchwork: https://patchwork.freedesktop.org/patch/661235/
    Signed-off-by: Rob Clark <robin.clark@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/msm: Update register xml [+ + +]

Author: Rob Clark <robin.clark@oss.qualcomm.com>
Date:   Thu Jul 3 10:51:19 2025 -0700

    drm/msm: Update register xml
    
    [ Upstream commit 6733d8276ac02a8790e571d2af4a69a9039d0522 ]
    
    Sync register xml from mesa commit eb3e0b7164a3 ("freedreno/a6xx: Split
    descriptors out into their own file").
    
    Signed-off-by: Rob Clark <robin.clark@oss.qualcomm.com>
    Acked-by: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
    Patchwork: https://patchwork.freedesktop.org/patch/662470/
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/msm: use trylock for debugfs [+ + +]

Author: Rob Clark <robin.clark@oss.qualcomm.com>
Date:   Sun Jun 29 13:13:22 2025 -0700

    drm/msm: use trylock for debugfs
    
    [ Upstream commit 0a1ff88ec5b60b41ba830c5bf08b6cd8f45ab411 ]
    
    This resolves a potential deadlock vs msm_gem_vm_close().  Otherwise for
    _NO_SHARE buffers msm_gem_describe() could be trying to acquire the
    shared vm resv, while already holding priv->obj_lock.  But _vm_close()
    might drop the last reference to a GEM obj while already holding the vm
    resv, and msm_gem_free_object() needs to grab priv->obj_lock, a locking
    inversion.
    
    OTOH this is only for debugfs and it isn't critical if we undercount by
    skipping a locked obj.  So just use trylock() and move along if we can't
    get the lock.
    
    Signed-off-by: Rob Clark <robdclark@chromium.org>
    Signed-off-by: Rob Clark <robin.clark@oss.qualcomm.com>
    Tested-by: Antonino Maniscalco <antomani103@gmail.com>
    Reviewed-by: Antonino Maniscalco <antomani103@gmail.com>
    Patchwork: https://patchwork.freedesktop.org/patch/661525/
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/panel: raydium-rm67200: Move initialization from enable() to prepare stage [+ + +]

Author: Andy Yan <andy.yan@rock-chips.com>
Date:   Wed Jun 18 17:15:20 2025 +0800

    drm/panel: raydium-rm67200: Move initialization from enable() to prepare stage
    
    [ Upstream commit 691674a282bdbf8f8bce4094369a2d1e4b5645e9 ]
    
    The DSI host has different modes in prepare() and enable() functions,
    prepare() is in LP command mode and enable() is in HS video mode.
    
    >From our experience, generally the initialization sequence needs to be
    sent in the LP command mode.
    
    Move the setup init function from enable() to prepare() to fix a display
    shift on rk3568 evb.
    
    Tested on rk3568/rk3576/rk3588 EVB.
    
    Signed-off-by: Andy Yan <andy.yan@rock-chips.com>
    Reviewed-by: Neil Armstrong <neil.armstrong@linaro.org>
    Signed-off-by: Neil Armstrong <neil.armstrong@linaro.org>
    Link: https://lore.kernel.org/r/20250618091520.691590-1-andyshrk@163.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/sched/tests: Add unit test for cancel_job() [+ + +]

Author: Philipp Stanner <phasta@kernel.org>
Date:   Thu Jul 10 14:54:08 2025 +0200

    drm/sched/tests: Add unit test for cancel_job()
    
    [ Upstream commit c2668a0e03501a26cedc82e32fe9f3f692d330db ]
    
    The scheduler unit tests now provide a new callback, cancel_job(). This
    callback gets used by drm_sched_fini() for all still pending jobs to
    cancel them.
    
    Implement a new unit test to test this.
    
    Reviewed-by: Tvrtko Ursulin <tvrtko.ursulin@igalia.com>
    Signed-off-by: Philipp Stanner <phasta@kernel.org>
    Link: https://lore.kernel.org/r/20250710125412.128476-6-phasta@kernel.org
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/sched: Avoid memory leaks with cancel_job() callback [+ + +]

Author: Philipp Stanner <phasta@kernel.org>
Date:   Thu Jul 10 14:54:06 2025 +0200

    drm/sched: Avoid memory leaks with cancel_job() callback
    
    [ Upstream commit bf8bbaefaa6ae0a07971ea57b3208df60e8ad0a4 ]
    
    Since its inception, the GPU scheduler can leak memory if the driver
    calls drm_sched_fini() while there are still jobs in flight.
    
    The simplest way to solve this in a backwards compatible manner is by
    adding a new callback, drm_sched_backend_ops.cancel_job(), which
    instructs the driver to signal the hardware fence associated with the
    job. Afterwards, the scheduler can safely use the established free_job()
    callback for freeing the job.
    
    Implement the new backend_ops callback cancel_job().
    
    Suggested-by: Tvrtko Ursulin <tvrtko.ursulin@igalia.com>
    Link: https://lore.kernel.org/dri-devel/20250418113211.69956-1-tvrtko.ursulin@igalia.com/
    Reviewed-by: Maíra Canal <mcanal@igalia.com>
    Acked-by: Tvrtko Ursulin <tvrtko.ursulin@igalia.com>
    Signed-off-by: Philipp Stanner <phasta@kernel.org>
    Link: https://lore.kernel.org/r/20250710125412.128476-4-phasta@kernel.org
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/ttm: Respect the shrinker core free target [+ + +]

Author: Tvrtko Ursulin <tvrtko.ursulin@igalia.com>
Date:   Tue Jun 3 12:27:49 2025 +0100

    drm/ttm: Respect the shrinker core free target
    
    [ Upstream commit eac21f8ebeb4f84d703cf41dc3f81d16fa9dc00a ]
    
    Currently the TTM shrinker aborts shrinking as soon as it frees pages from
    any of the page order pools and by doing so it can fail to respect the
    freeing target which was configured by the shrinker core.
    
    We use the wording "can fail" because the number of freed pages will
    depend on the presence of pages in the pools and the order of the pools on
    the LRU list. For example if there are no free pages in the high order
    pools the shrinker core may require multiple passes over the TTM shrinker
    before it will free the default target of 128 pages (assuming there are
    free pages in the low order pools). This inefficiency can be compounded by
    the pool LRU where multiple further calls into the TTM shrinker are
    required to end up looking at the pool with pages.
    
    Improve this by never freeing less than the shrinker core has requested.
    
    At the same time we start reporting the number of scanned pages (freed in
    this case), which prevents the core shrinker from giving up on the TTM
    shrinker too soon and moving on.
    
    v2:
     * Simplify loop logic. (Christian)
     * Improve commit message.
    
    Signed-off-by: Tvrtko Ursulin <tvrtko.ursulin@igalia.com>
    Cc: Christian König <christian.koenig@amd.com>
    Cc: Thomas Hellström <thomas.hellstrom@linux.intel.com>
    Reviewed-by: Christian König <christian.koenig@amd.com>
    Signed-off-by: Tvrtko Ursulin <tursulin@ursulin.net>
    Link: https://lore.kernel.org/r/20250603112750.34997-2-tvrtko.ursulin@igalia.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/ttm: Should to return the evict error [+ + +]

Author: Emily Deng <Emily.Deng@amd.com>
Date:   Tue Jun 3 17:11:54 2025 +0800

    drm/ttm: Should to return the evict error
    
    [ Upstream commit 4e16a9a00239db5d819197b9a00f70665951bf50 ]
    
    For the evict fail case, the evict error should be returned.
    
    v2: Consider ENOENT case.
    
    v3: Abort directly when the eviction failed for some reason (except for -ENOENT)
     and not wait for the move to finish
    
    Signed-off-by: Emily Deng <Emily.Deng@amd.com>
    Reviewed-by: Christian König <christian.koenig@amd.com>
    Signed-off-by: Christian König <christian.koenig@amd.com>
    Link: https://lore.kernel.org/r/20250603091154.3472646-1-Emily.Deng@amd.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/xe/hwmon: Add SW clamp for power limits writes [+ + +]

Author: Karthik Poosa <karthik.poosa@intel.com>
Date:   Sat Aug 9 00:23:10 2025 +0530

    drm/xe/hwmon: Add SW clamp for power limits writes
    
    [ Upstream commit 55d49f06162e45686399df4ae6292167f0deab7c ]
    
    Clamp writes to power limits powerX_crit/currX_crit, powerX_cap,
    powerX_max, to the maximum supported by the pcode mailbox
    when sysfs-provided values exceed this limit.
    Although the pcode already performs clamping, values beyond the pcode
    mailbox's supported range get truncated, leading to incorrect
    critical power settings.
    This patch ensures proper clamping to prevent such truncation.
    
    v2:
     - Address below review comments. (Riana)
     - Split comments into multiple sentences.
     - Use local variables for readability.
     - Add a debug log.
     - Use u64 instead of unsigned long.
    
    v3:
     - Change drm_dbg logs to drm_info. (Badal)
    
    v4:
     - Rephrase the drm_info log. (Rodrigo, Riana)
     - Rename variable max_mbx_power_limit to max_supp_power_limit, as
       limit is same for platforms with and without mailbox power limit
       support.
    
    Signed-off-by: Karthik Poosa <karthik.poosa@intel.com>
    Fixes: 92d44a422d0d ("drm/xe/hwmon: Expose card reactive critical power")
    Fixes: fb1b70607f73 ("drm/xe/hwmon: Expose power attributes")
    Reviewed-by: Riana Tauro <riana.tauro@intel.com>
    Link: https://lore.kernel.org/r/20250808185310.3466529-1-karthik.poosa@intel.com
    Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com>
    (cherry picked from commit d301eb950da59f962bafe874cf5eb6d61a85b2c2)
    Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/xe/migrate: don't overflow max copy size [+ + +]

Author: Matthew Auld <matthew.auld@intel.com>
Date:   Thu Jul 31 10:38:10 2025 +0100

    drm/xe/migrate: don't overflow max copy size
    
    [ Upstream commit 4126cb327a2e3273c81fcef1c594c5b7b645c44c ]
    
    With non-page aligned copy, we need to use 4 byte aligned pitch, however
    the size itself might still be close to our maximum of ~8M, and so the
    dimensions of the copy can easily exceed the S16_MAX limit of the copy
    command leading to the following assert:
    
    xe 0000:03:00.0: [drm] Assertion `size / pitch <= ((s16)(((u16)~0U) >> 1))` failed!
    platform: BATTLEMAGE subplatform: 1
    graphics: Xe2_HPG 20.01 step A0
    media: Xe2_HPM 13.01 step A1
    tile: 0 VRAM 10.0 GiB
    GT: 0 type 1
    
    WARNING: CPU: 23 PID: 10605 at drivers/gpu/drm/xe/xe_migrate.c:673 emit_copy+0x4b5/0x4e0 [xe]
    
    To fix this account for the pitch when calculating the number of current
    bytes to copy.
    
    Fixes: 270172f64b11 ("drm/xe: Update xe_ttm_access_memory to use GPU for non-visible access")
    Signed-off-by: Matthew Auld <matthew.auld@intel.com>
    Cc: Maciej Patelczyk <maciej.patelczyk@intel.com>
    Cc: Matthew Brost <matthew.brost@intel.com>
    Reviewed-by: Stuart Summers <stuart.summers@intel.com>
    Link: https://lore.kernel.org/r/20250731093807.207572-7-matthew.auld@intel.com
    (cherry picked from commit 8c2d61e0e916e077fda7e7b8e67f25ffe0f361fc)
    Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/xe/migrate: prevent infinite recursion [+ + +]

Author: Matthew Auld <matthew.auld@intel.com>
Date:   Thu Jul 31 10:38:09 2025 +0100

    drm/xe/migrate: prevent infinite recursion
    
    [ Upstream commit 9d7a1cbebbb691891671def57407ba2f8ee914e8 ]
    
    If the buf + offset is not aligned to XE_CAHELINE_BYTES we fallback to
    using a bounce buffer. However the bounce buffer here is allocated on
    the stack, and the only alignment requirement here is that it's
    naturally aligned to u8, and not XE_CACHELINE_BYTES. If the bounce
    buffer is also misaligned we then recurse back into the function again,
    however the new bounce buffer might also not be aligned, and might never
    be until we eventually blow through the stack, as we keep recursing.
    
    Instead of using the stack use kmalloc, which should respect the
    power-of-two alignment request here. Fixes a kernel panic when
    triggering this path through eudebug.
    
    v2 (Stuart):
     - Add build bug check for power-of-two restriction
     - s/EINVAL/ENOMEM/
    
    Fixes: 270172f64b11 ("drm/xe: Update xe_ttm_access_memory to use GPU for non-visible access")
    Signed-off-by: Matthew Auld <matthew.auld@intel.com>
    Cc: Maciej Patelczyk <maciej.patelczyk@intel.com>
    Cc: Stuart Summers <stuart.summers@intel.com>
    Cc: Matthew Brost <matthew.brost@intel.com>
    Reviewed-by: Stuart Summers <stuart.summers@intel.com>
    Link: https://lore.kernel.org/r/20250731093807.207572-6-matthew.auld@intel.com
    (cherry picked from commit 38b34e928a08ba594c4bbf7118aa3aadacd62fff)
    Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/xe/migrate: prevent potential UAF [+ + +]

Author: Matthew Auld <matthew.auld@intel.com>
Date:   Thu Jul 31 10:38:11 2025 +0100

    drm/xe/migrate: prevent potential UAF
    
    [ Upstream commit 145832fbdd17b1d77ffd6cdd1642259e101d1b7e ]
    
    If we hit the error path, the previous fence (if there is one) has
    already been put() prior to this, so doing a fence_wait could lead to
    UAF. Tweak the flow to do to the put() until after we do the wait.
    
    Fixes: 270172f64b11 ("drm/xe: Update xe_ttm_access_memory to use GPU for non-visible access")
    Signed-off-by: Matthew Auld <matthew.auld@intel.com>
    Cc: Maciej Patelczyk <maciej.patelczyk@intel.com>
    Cc: Matthew Brost <matthew.brost@intel.com>
    Reviewed-by: Stuart Summers <stuart.summers@intel.com>
    Link: https://lore.kernel.org/r/20250731093807.207572-8-matthew.auld@intel.com
    (cherry picked from commit 9b7ca35ed28fe5fad86e9d9c24ebd1271e4c9c3e)
    Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/xe/xe_query: Use separate iterator while filling GT list [+ + +]

Author: Matt Roper <matthew.d.roper@intel.com>
Date:   Tue Jul 1 13:13:28 2025 -0700

    drm/xe/xe_query: Use separate iterator while filling GT list
    
    [ Upstream commit d4eb4a010262ea7801e576d1033b355910f2f7d4 ]
    
    The 'id' value updated by for_each_gt() is the uapi GT ID of the GTs
    being iterated over, and may skip over values if a GT is not present on
    the device.  Use a separate iterator for GT list array assignments to
    ensure that the array will be filled properly on future platforms where
    index in the GT query list may not match the uapi ID.
    
    v2:
     - Include the missing increment of the iterator.  (Jonathan)
    
    Cc: Jonathan Cavitt <jonathan.cavitt@intel.com>
    Reviewed-by: Jonathan Cavitt <jonathan.cavitt@intel.com>
    Link: https://lore.kernel.org/r/20250701201320.2514369-16-matthew.d.roper@intel.com
    Signed-off-by: Matt Roper <matthew.d.roper@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/xe: Make dma-fences compliant with the safe access rules [+ + +]

Author: Tvrtko Ursulin <tvrtko.ursulin@igalia.com>
Date:   Tue Jun 10 17:42:26 2025 +0100

    drm/xe: Make dma-fences compliant with the safe access rules
    
    [ Upstream commit 6bd90e700b4285e6a7541e00f969cab0d696adde ]
    
    Xe can free some of the data pointed to by the dma-fences it exports. Most
    notably the timeline name can get freed if userspace closes the associated
    submit queue. At the same time the fence could have been exported to a
    third party (for example a sync_fence fd) which will then cause an use-
    after-free on subsequent access.
    
    To make this safe we need to make the driver compliant with the newly
    documented dma-fence rules. Driver has to ensure a RCU grace period
    between signalling a fence and freeing any data pointed to by said fence.
    
    For the timeline name we simply make the queue be freed via kfree_rcu and
    for the shared lock associated with multiple queues we add a RCU grace
    period before freeing the per GT structure holding the lock.
    
    Signed-off-by: Tvrtko Ursulin <tvrtko.ursulin@igalia.com>
    Reviewed-by: Matthew Brost <matthew.brost@intel.com>
    Acked-by: Lucas De Marchi <lucas.demarchi@intel.com>
    Signed-off-by: Tvrtko Ursulin <tursulin@ursulin.net>
    Link: https://lore.kernel.org/r/20250610164226.10817-5-tvrtko.ursulin@igalia.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm: renesas: rz-du: mipi_dsi: Add min check for VCLK range [+ + +]

Author: Lad Prabhakar <prabhakar.mahadev-lad.rj@bp.renesas.com>
Date:   Mon Jun 9 23:56:22 2025 +0100

    drm: renesas: rz-du: mipi_dsi: Add min check for VCLK range
    
    [ Upstream commit e37a95d01d5acce211da8446fefbd8684c67f516 ]
    
    The VCLK range for Renesas RZ/G2L SoC is 5.803 MHz to 148.5 MHz. Add a
    minimum clock check in the mode_valid callback to ensure that the clock
    value does not fall below the valid range.
    
    Co-developed-by: Fabrizio Castro <fabrizio.castro.jz@renesas.com>
    Signed-off-by: Fabrizio Castro <fabrizio.castro.jz@renesas.com>
    Signed-off-by: Lad Prabhakar <prabhakar.mahadev-lad.rj@bp.renesas.com>
    Reviewed-by: Biju Das <biju.das.jz@bp.renesas.com>
    Reviewed-by: Laurent Pinchart <laurent.pinchart+renesas@ideasonboard.com>
    Signed-off-by: Biju Das <biju.das.jz@bp.renesas.com>
    Link: https://lore.kernel.org/r/20250609225630.502888-2-prabhakar.mahadev-lad.rj@bp.renesas.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

EDAC/ie31200: Enable support for Core i5-14600 and i7-14700 [+ + +]

Author: George Gaidarov <gdgaidarov+lkml@gmail.com>
Date:   Thu May 29 16:29:32 2025 +0000

    EDAC/ie31200: Enable support for Core i5-14600 and i7-14700
    
    [ Upstream commit 493f9c930e5ff72b3508755b45488d1ae2c9650e ]
    
    Device ID '0xa740' is shared by i7-14700, i7-14700K, and i7-14700T.
    Device ID '0xa704' is shared by i5-14600, i5-14600K, and i5-14600T.
    
    Tested locally on my i7-14700K.
    
    Signed-off-by: George Gaidarov <gdgaidarov+lkml@gmail.com>
    Reviewed-by: Qiuxu Zhuo <qiuxu.zhuo@intel.com>
    Signed-off-by: Tony Luck <tony.luck@intel.com>
    Signed-off-by: Borislav Petkov (AMD) <bp@alien8.de>
    Link: https://lore.kernel.org/r/20250529162933.1228735-1-gdgaidarov+lkml@gmail.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

EDAC/synopsys: Clear the ECC counters on init [+ + +]

Author: Shubhrajyoti Datta <shubhrajyoti.datta@amd.com>
Date:   Sun Jul 13 10:37:53 2025 +0530

    EDAC/synopsys: Clear the ECC counters on init
    
    [ Upstream commit b1dc7f097b78eb8d25b071ead2384b07a549692b ]
    
    Clear the ECC error and counter registers during initialization/probe to avoid
    reporting stale errors that may have occurred before EDAC registration.
    
    For that, unify the Zynq and ZynqMP ECC state reading paths and simplify the
    code.
    
      [ bp: Massage commit message.
        Fix an -Wsometimes-uninitialized warning as reported by
        Reported-by: kernel test robot <lkp@intel.com>
        Closes: https://lore.kernel.org/oe-kbuild-all/202507141048.obUv3ZUm-lkp@intel.com ]
    
    Signed-off-by: Shubhrajyoti Datta <shubhrajyoti.datta@amd.com>
    Signed-off-by: Borislav Petkov (AMD) <bp@alien8.de>
    Link: https://lore.kernel.org/20250713050753.7042-1-shubhrajyoti.datta@amd.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

erofs: fix block count report when 48-bit layout is on [+ + +]

Author: Gao Xiang <xiang@kernel.org>
Date:   Thu Aug 7 16:20:19 2025 +0800

    erofs: fix block count report when 48-bit layout is on
    
    [ Upstream commit 0b96d9bed324a1c1b7d02bfb9596351ef178428d ]
    
    Fix incorrect shift order when combining the 48-bit block count.
    
    Fixes: 2e1473d5195f ("erofs: implement 48-bit block addressing for unencoded inodes")
    Signed-off-by: Gao Xiang <hsiangkao@linux.alibaba.com>
    Link: https://lore.kernel.org/r/20250807082019.3093539-1-hsiangkao@linux.alibaba.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

et131x: Add missing check after DMA map [+ + +]

Author: Thomas Fourier <fourier.thomas@gmail.com>
Date:   Wed Jul 16 11:47:30 2025 +0200

    et131x: Add missing check after DMA map
    
    [ Upstream commit d61f6cb6f6ef3c70d2ccc0d9c85c508cb8017da9 ]
    
    The DMA map functions can fail and should be tested for errors.
    If the mapping fails, unmap and return an error.
    
    Signed-off-by: Thomas Fourier <fourier.thomas@gmail.com>
    Acked-by: Mark Einon <mark.einon@gmail.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://patch.msgid.link/20250716094733.28734-2-fourier.thomas@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

eth: bnxt: take page size into account for page pool recycling rings [+ + +]

Author: Jakub Kicinski <kuba@kernel.org>
Date:   Thu Jun 26 09:54:41 2025 -0700

    eth: bnxt: take page size into account for page pool recycling rings
    
    [ Upstream commit f7dbedba63124256feb9d9fcf36e8a2e43858d1e ]
    
    The Rx rings are filled with Rx buffers. Which are supposed to fit
    packet headers (or MTU if HW-GRO is disabled). The aggregation buffers
    are filled with "device pages". Adjust the sizes of the page pool
    recycling ring appropriately, based on ratio of the size of the
    buffer on given ring vs system page size. Otherwise on a system
    with 64kB pages we end up with >700MB of memory sitting in every
    single page pool cache.
    
    Correct the size calculation for the head_pool. Since the buffers
    there are always small I'm pretty sure I meant to cap the size
    at 1k, rather than make it the lowest possible size. With 64k pages
    1k cache with a 1k ring is 64x larger than we need.
    
    Reviewed-by: Michael Chan <michael.chan@broadcom.com>
    Link: https://patch.msgid.link/20250626165441.4125047-1-kuba@kernel.org
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

exfat: add cluster chain loop check for dir [+ + +]

Author: Yuezhang Mo <Yuezhang.Mo@sony.com>
Date:   Tue Mar 18 17:00:49 2025 +0800

    exfat: add cluster chain loop check for dir
    
    [ Upstream commit 99f9a97dce39ad413c39b92c90393bbd6778f3fd ]
    
    An infinite loop may occur if the following conditions occur due to
    file system corruption.
    
    (1) Condition for exfat_count_dir_entries() to loop infinitely.
        - The cluster chain includes a loop.
        - There is no UNUSED entry in the cluster chain.
    
    (2) Condition for exfat_create_upcase_table() to loop infinitely.
        - The cluster chain of the root directory includes a loop.
        - There are no UNUSED entry and up-case table entry in the cluster
          chain of the root directory.
    
    (3) Condition for exfat_load_bitmap() to loop infinitely.
        - The cluster chain of the root directory includes a loop.
        - There are no UNUSED entry and bitmap entry in the cluster chain
          of the root directory.
    
    (4) Condition for exfat_find_dir_entry() to loop infinitely.
        - The cluster chain includes a loop.
        - The unused directory entries were exhausted by some operation.
    
    (5) Condition for exfat_check_dir_empty() to loop infinitely.
        - The cluster chain includes a loop.
        - The unused directory entries were exhausted by some operation.
        - All files and sub-directories under the directory are deleted.
    
    This commit adds checks to break the above infinite loop.
    
    Signed-off-by: Yuezhang Mo <Yuezhang.Mo@sony.com>
    Signed-off-by: Namjae Jeon <linkinjeon@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ext2: Handle fiemap on empty files to prevent EINVAL [+ + +]

Author: Wei Gao <wegao@suse.com>
Date:   Fri Jun 13 11:18:38 2025 -0400

    ext2: Handle fiemap on empty files to prevent EINVAL
    
    [ Upstream commit a099b09a3342a0b28ea330e405501b5b4d0424b4 ]
    
    Previously, ext2_fiemap would unconditionally apply "len = min_t(u64, len,
    i_size_read(inode));", When inode->i_size was 0 (for an empty file), this
    would reduce the requested len to 0. Passing len = 0 to iomap_fiemap could
    then result in an -EINVAL error, even for valid queries on empty files.
    
    Link: https://github.com/linux-test-project/ltp/issues/1246
    Signed-off-by: Wei Gao <wegao@suse.com>
    Signed-off-by: Jan Kara <jack@suse.cz>
    Link: https://patch.msgid.link/20250613152402.3432135-1-wegao@suse.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ext4: do not BUG when INLINE_DATA_FL lacks system.data xattr [+ + +]

Author: Theodore Ts'o <tytso@mit.edu>
Date:   Thu Jul 17 10:54:34 2025 -0400

    ext4: do not BUG when INLINE_DATA_FL lacks system.data xattr
    
    [ Upstream commit 099b847ccc6c1ad2f805d13cfbcc83f5b6d4bc42 ]
    
    A syzbot fuzzed image triggered a BUG_ON in ext4_update_inline_data()
    when an inode had the INLINE_DATA_FL flag set but was missing the
    system.data extended attribute.
    
    Since this can happen due to a maiciouly fuzzed file system, we
    shouldn't BUG, but rather, report it as a corrupted file system.
    
    Add similar replacements of BUG_ON with EXT4_ERROR_INODE() ii
    ext4_create_inline_data() and ext4_inline_data_truncate().
    
    Reported-by: syzbot+544248a761451c0df72f@syzkaller.appspotmail.com
    Signed-off-by: Theodore Ts'o <tytso@mit.edu>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ext4: fix largest free orders lists corruption on mb_optimize_scan switch [+ + +]

Author: Baokun Li <libaokun1@huawei.com>
Date:   Mon Jul 14 21:03:21 2025 +0800

    ext4: fix largest free orders lists corruption on mb_optimize_scan switch
    
    commit 7d345aa1fac4c2ec9584fbd6f389f2c2368671d5 upstream.
    
    The grp->bb_largest_free_order is updated regardless of whether
    mb_optimize_scan is enabled. This can lead to inconsistencies between
    grp->bb_largest_free_order and the actual s_mb_largest_free_orders list
    index when mb_optimize_scan is repeatedly enabled and disabled via remount.
    
    For example, if mb_optimize_scan is initially enabled, largest free
    order is 3, and the group is in s_mb_largest_free_orders[3]. Then,
    mb_optimize_scan is disabled via remount, block allocations occur,
    updating largest free order to 2. Finally, mb_optimize_scan is re-enabled
    via remount, more block allocations update largest free order to 1.
    
    At this point, the group would be removed from s_mb_largest_free_orders[3]
    under the protection of s_mb_largest_free_orders_locks[2]. This lock
    mismatch can lead to list corruption.
    
    To fix this, whenever grp->bb_largest_free_order changes, we now always
    attempt to remove the group from its old order list. However, we only
    insert the group into the new order list if `mb_optimize_scan` is enabled.
    This approach helps prevent lock inconsistencies and ensures the data in
    the order lists remains reliable.
    
    Fixes: 196e402adf2e ("ext4: improve cr 0 / cr 1 group scanning")
    CC: stable@vger.kernel.org
    Suggested-by: Jan Kara <jack@suse.cz>
    Signed-off-by: Baokun Li <libaokun1@huawei.com>
    Reviewed-by: Zhang Yi <yi.zhang@huawei.com>
    Link: https://patch.msgid.link/20250714130327.1830534-12-libaokun1@huawei.com
    Signed-off-by: Theodore Ts'o <tytso@mit.edu>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ext4: fix zombie groups in average fragment size lists [+ + +]

Author: Baokun Li <libaokun1@huawei.com>
Date:   Mon Jul 14 21:03:20 2025 +0800

    ext4: fix zombie groups in average fragment size lists
    
    commit 1c320d8e92925bb7615f83a7b6e3f402a5c2ca63 upstream.
    
    Groups with no free blocks shouldn't be in any average fragment size list.
    However, when all blocks in a group are allocated(i.e., bb_fragments or
    bb_free is 0), we currently skip updating the average fragment size, which
    means the group isn't removed from its previous s_mb_avg_fragment_size[old]
    list.
    
    This created "zombie" groups that were always skipped during traversal as
    they couldn't satisfy any block allocation requests, negatively impacting
    traversal efficiency.
    
    Therefore, when a group becomes completely full, bb_avg_fragment_size_order
    is now set to -1. If the old order was not -1, a removal operation is
    performed; if the new order is not -1, an insertion is performed.
    
    Fixes: 196e402adf2e ("ext4: improve cr 0 / cr 1 group scanning")
    CC: stable@vger.kernel.org
    Signed-off-by: Baokun Li <libaokun1@huawei.com>
    Reviewed-by: Jan Kara <jack@suse.cz>
    Reviewed-by: Zhang Yi <yi.zhang@huawei.com>
    Link: https://patch.msgid.link/20250714130327.1830534-11-libaokun1@huawei.com
    Signed-off-by: Theodore Ts'o <tytso@mit.edu>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ext4: initialize superblock fields in the kballoc-test.c kunit tests [+ + +]

Author: Zhang Yi <yi.zhang@huawei.com>
Date:   Fri Jul 25 10:15:50 2025 +0800

    ext4: initialize superblock fields in the kballoc-test.c kunit tests
    
    commit 82e6381e23f1ea7a14f418215068aaa2ca046c84 upstream.
    
    Various changes in the "ext4: better scalability for ext4 block
    allocation" patch series have resulted in kunit test failures, most
    notably in the test_new_blocks_simple and the test_mb_mark_used tests.
    The root cause of these failures is that various in-memory ext4 data
    structures were not getting initialized, and while previous versions
    of the functions exercised by the unit tests didn't use these
    structure members, this was arguably a test bug.
    
    Since one of the patches in the block allocation scalability patches
    is a fix which is has a cc:stable tag, this commit also has a
    cc:stable tag.
    
    CC: stable@vger.kernel.org
    Link: https://lore.kernel.org/r/20250714130327.1830534-1-libaokun1@huawei.com
    Link: https://patch.msgid.link/20250725021550.3177573-1-yi.zhang@huaweicloud.com
    Link: https://patch.msgid.link/20250725021654.3188798-1-yi.zhang@huaweicloud.com
    Reported-by: Guenter Roeck <linux@roeck-us.net>
    Closes: https://lore.kernel.org/linux-ext4/b0635ad0-7ebf-4152-a69b-58e7e87d5085@roeck-us.net/
    Tested-by: Guenter Roeck <linux@roeck-us.net>
    Signed-off-by: Zhang Yi <yi.zhang@huawei.com>
    Signed-off-by: Theodore Ts'o <tytso@mit.edu>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ext4: limit the maximum folio order [+ + +]

Author: Zhang Yi <yi.zhang@huawei.com>
Date:   Mon Jul 7 22:08:14 2025 +0800

    ext4: limit the maximum folio order
    
    [ Upstream commit b12f423d598fd874df9ecfb2436789d582fda8e6 ]
    
    In environments with a page size of 64KB, the maximum size of a folio
    can reach up to 128MB. Consequently, during the write-back of folios,
    the 'rsv_blocks' will be overestimated to 1,577, which can make
    pressure on the journal space where the journal is small. This can
    easily exceed the limit of a single transaction. Besides, an excessively
    large folio is meaningless and will instead increase the overhead of
    traversing the bhs within the folio. Therefore, limit the maximum order
    of a folio to 2048 filesystem blocks.
    
    Reported-by: Naresh Kamboju <naresh.kamboju@linaro.org>
    Reported-by: Joseph Qi <jiangqi903@gmail.com>
    Closes: https://lore.kernel.org/linux-ext4/CA+G9fYsyYQ3ZL4xaSg1-Tt5Evto7Zd+hgNWZEa9cQLbahA1+xg@mail.gmail.com/
    Signed-off-by: Zhang Yi <yi.zhang@huawei.com>
    Tested-by: Joseph Qi <joseph.qi@linux.alibaba.com>
    Reviewed-by: Jan Kara <jack@suse.cz>
    Link: https://patch.msgid.link/20250707140814.542883-12-yi.zhang@huaweicloud.com
    Signed-off-by: Theodore Ts'o <tytso@mit.edu>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

f2fs: check the generic conditions first [+ + +]

Author: Jaegeuk Kim <jaegeuk@kernel.org>
Date:   Mon Jun 30 16:06:09 2025 +0000

    f2fs: check the generic conditions first
    
    [ Upstream commit e23ab8028de0d92df5921a570f5212c0370db3b5 ]
    
    Let's return errors caught by the generic checks. This fixes generic/494 where
    it expects to see EBUSY by setattr_prepare instead of EINVAL by f2fs for active
    swapfile.
    
    Reviewed-by: Chao Yu <chao@kernel.org>
    Signed-off-by: Jaegeuk Kim <jaegeuk@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

f2fs: handle nat.blkaddr corruption in f2fs_get_node_info() [+ + +]

Author: Chao Yu <chao@kernel.org>
Date:   Mon Jun 9 15:27:12 2025 +0800

    f2fs: handle nat.blkaddr corruption in f2fs_get_node_info()
    
    [ Upstream commit 1773f63d108b1b9b9d053d8c95f8300c556f93b8 ]
    
    F2FS-fs (dm-55): access invalid blkaddr:972878540
    Call trace:
     dump_backtrace+0xec/0x128
     show_stack+0x18/0x28
     dump_stack_lvl+0x40/0x88
     dump_stack+0x18/0x24
     __f2fs_is_valid_blkaddr+0x360/0x3b4
     f2fs_is_valid_blkaddr+0x10/0x20
     f2fs_get_node_info+0x21c/0x60c
     __write_node_page+0x15c/0x734
     f2fs_sync_node_pages+0x4f8/0x700
     f2fs_write_checkpoint+0x4a8/0x99c
     __checkpoint_and_complete_reqs+0x7c/0x20c
     issue_checkpoint_thread+0x4c/0xd8
     kthread+0x11c/0x1b0
     ret_from_fork+0x10/0x20
    
    If nat.blkaddr is corrupted, during checkpoint, f2fs_sync_node_pages()
    will loop to flush node page w/ corrupted nat.blkaddr.
    
    Although, it tags SBI_NEED_FSCK, checkpoint can not persist it due
    to deadloop.
    
    Let's call f2fs_handle_error(, ERROR_INCONSISTENT_NAT) to record such
    error into superblock, it expects fsck can detect the error and repair
    inconsistent nat.blkaddr after device reboot.
    
    Note that, let's add sanity check in f2fs_get_node_info() to detect
    in-memory nat.blkaddr inconsistency, but only if CONFIG_F2FS_CHECK_FS
    is enabled.
    
    Signed-off-by: Chao Yu <chao@kernel.org>
    Signed-off-by: Jaegeuk Kim <jaegeuk@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

fbdev: fix potential buffer overflow in do_register_framebuffer() [+ + +]

Author: Yongzhen Zhang <zhangyongzhen@kylinos.cn>
Date:   Tue Jul 1 17:07:04 2025 +0800

    fbdev: fix potential buffer overflow in do_register_framebuffer()
    
    [ Upstream commit 523b84dc7ccea9c4d79126d6ed1cf9033cf83b05 ]
    
    The current implementation may lead to buffer overflow when:
    1.  Unregistration creates NULL gaps in registered_fb[]
    2.  All array slots become occupied despite num_registered_fb < FB_MAX
    3.  The registration loop exceeds array bounds
    
    Add boundary check to prevent registered_fb[FB_MAX] access.
    
    Signed-off-by: Yongzhen Zhang <zhangyongzhen@kylinos.cn>
    Signed-off-by: Helge Deller <deller@gmx.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

fbdev: Fix vmalloc out-of-bounds write in fast_imageblit [+ + +]

Author: Sravan Kumar Gundu <sravankumarlpu@gmail.com>
Date:   Thu Jul 31 15:36:18 2025 -0500

    fbdev: Fix vmalloc out-of-bounds write in fast_imageblit
    
    commit af0db3c1f898144846d4c172531a199bb3ca375d upstream.
    
    This issue triggers when a userspace program does an ioctl
    FBIOPUT_CON2FBMAP by passing console number and frame buffer number.
    Ideally this maps console to frame buffer and updates the screen if
    console is visible.
    
    As part of mapping it has to do resize of console according to frame
    buffer info. if this resize fails and returns from vc_do_resize() and
    continues further. At this point console and new frame buffer are mapped
    and sets display vars. Despite failure still it continue to proceed
    updating the screen at later stages where vc_data is related to previous
    frame buffer and frame buffer info and display vars are mapped to new
    frame buffer and eventully leading to out-of-bounds write in
    fast_imageblit(). This bheviour is excepted only when fg_console is
    equal to requested console which is a visible console and updates screen
    with invalid struct references in fbcon_putcs().
    
    Reported-and-tested-by: syzbot+c4b7aa0513823e2ea880@syzkaller.appspotmail.com
    Closes: https://syzkaller.appspot.com/bug?extid=c4b7aa0513823e2ea880
    Signed-off-by: Sravan Kumar Gundu <sravankumarlpu@gmail.com>
    Cc: stable@vger.kernel.org
    Signed-off-by: Helge Deller <deller@gmx.de>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

fbdev: nvidiafb: add depends on HAS_IOPORT [+ + +]

Author: Randy Dunlap <rdunlap@infradead.org>
Date:   Sun Jun 15 11:36:51 2025 -0700

    fbdev: nvidiafb: add depends on HAS_IOPORT
    
    commit ecdd7df997fd992f0ec70b788e3b12258008a2bf upstream.
    
    The nvidiafb driver uses inb()/outb() without depending on HAS_IOPORT,
    which leads to build errors since kernel v6.13-rc1:
    commit 6f043e757445 ("asm-generic/io.h: Remove I/O port accessors
    for HAS_IOPORT=n")
    
    Add the HAS_IOPORT dependency to prevent the build errors.
    
    (Found in ARCH=um allmodconfig builds)
    
    drivers/video/fbdev/nvidia/nv_accel.c: In function ‘NVDmaWait’:
    include/asm-generic/io.h:596:15: error: call to ‘_outb’ declared with attribute error: outb() requires CONFIG_HAS_IOPORT
      596 | #define _outb _outb
    
    Signed-off-by: Randy Dunlap <rdunlap@infradead.org>
    Cc: Arnd Bergmann <arnd@kernel.org>
    Cc: Niklas Schnelle <schnelle@linux.ibm.com>
    Cc: Antonino Daplas <adaplas@gmail.com>
    Cc: Helge Deller <deller@gmx.de>
    Cc: linux-fbdev@vger.kernel.org
    Cc: dri-devel@lists.freedesktop.org
    Cc: stable@vger.kernel.org # v6.13+
    Signed-off-by: Helge Deller <deller@gmx.de>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

fhandle: raise FILEID_IS_DIR in handle_type [+ + +]

Author: Christian Brauner <brauner@kernel.org>
Date:   Tue Jun 24 10:29:04 2025 +0200

    fhandle: raise FILEID_IS_DIR in handle_type
    
    commit cc678bf7aa9e2e6c2356fd7f955513c1bd7d4c97 upstream.
    
    Currently FILEID_IS_DIR is raised in fh_flags which is wrong.
    Raise it in handle->handle_type were it's supposed to be.
    
    Link: https://lore.kernel.org/20250624-work-pidfs-fhandle-v2-1-d02a04858fe3@kernel.org
    Fixes: c374196b2b9f ("fs: name_to_handle_at() support for "explicit connectable" file handles")
    Reviewed-by: Jan Kara <jack@suse.cz>
    Reviewed-by: Amir Goldstein <amir73il@gmail.com>
    Cc: stable@kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

firmware: arm_ffa: Change initcall level of ffa_init() to rootfs_initcall [+ + +]

Author: Yeoreum Yun <yeoreum.yun@arm.com>
Date:   Sat Jul 19 13:27:02 2025 +0300

    firmware: arm_ffa: Change initcall level of ffa_init() to rootfs_initcall
    
    [ Upstream commit 0e0546eabcd6c19765a8dbf5b5db3723e7b0ea75 ]
    
    The Linux IMA (Integrity Measurement Architecture) subsystem used for
    secure boot, file integrity, or remote attestation cannot be a loadable
    module for few reasons listed below:
    
     o Boot-Time Integrity: IMA’s main role is to measure and appraise files
     before they are used. This includes measuring critical system files during
     early boot (e.g., init, init scripts, login binaries). If IMA were a
     module, it would be loaded too late to cover those.
    
     o TPM Dependency: IMA integrates tightly with the TPM to record
     measurements into PCRs. The TPM must be initialized early (ideally before
     init_ima()), which aligns with IMA being built-in.
    
     o Security Model: IMA is part of a Trusted Computing Base (TCB). Making it
     a module would weaken the security model, as a potentially compromised
     system could delay or tamper with its initialization.
    
    IMA must be built-in to ensure it starts measuring from the earliest
    possible point in boot which inturn implies TPM must be initialised and
    ready to use before IMA.
    
    To enable integration of tpm_event_log with the IMA subsystem, the TPM
    drivers (tpm_crb and tpm_crb_ffa) also needs to be built-in. However with
    FF-A driver also being initialised at device initcall level, it can lead to
    an initialization order issue where:
     - crb_acpi_driver_init() may run before tpm_crb_ffa_driver()_init and
       ffa_init()
     - As a result, probing the TPM device via CRB over FFA is deferred
     - ima_init() (called as a late initcall) runs before deferred probe
       completes, IMA fails to find the TPM and logs the below error:
    
       |  ima: No TPM chip found, activating TPM-bypass!
    
    Eventually it fails to generate boot_aggregate with PCR values.
    
    Because of the above stated dependency, the ffa driver needs to initialised
    before tpm_crb_ffa module to ensure IMA finds the TPM successfully when
    present.
    
    [ jarkko: reformatted some of the paragraphs because they were going past
      the 75 character boundary. ]
    
    Signed-off-by: Yeoreum Yun <yeoreum.yun@arm.com>
    Reviewed-by: Mimi Zohar <zohar@linux.ibm.com>
    Reviewed-by: Sudeep Holla <sudeep.holla@arm.com>
    Reviewed-by: Jarkko Sakkinen <jarkko@kernel.org>
    Signed-off-by: Jarkko Sakkinen <jarkko@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

firmware: arm_scmi: Convert to SYSTEM_SLEEP_PM_OPS [+ + +]

Author: Arnd Bergmann <arnd@arndb.de>
Date:   Wed Jul 9 09:01:01 2025 +0200

    firmware: arm_scmi: Convert to SYSTEM_SLEEP_PM_OPS
    
    commit 62d6b81e8bd207ad44eff39d1a0fe17f0df510a5 upstream.
    
    The old SET_SYSTEM_SLEEP_PM_OPS() macro leads to a warning about an
    unused function:
    
      |  drivers/firmware/arm_scmi/scmi_power_control.c:363:12: error:
      |     'scmi_system_power_resume' defined but not used [-Werror=unused-function]
      |         static int scmi_system_power_resume(struct device *dev)
    
    The proper way to do this these days is to use SYSTEM_SLEEP_PM_OPS()
    and pm_sleep_ptr().
    
    Fixes: 9a0658d3991e ("firmware: arm_scmi: power_control: Ensure SCMI_SYSPOWER_IDLE is set early during resume")
    Signed-off-by: Arnd Bergmann <arnd@arndb.de>
    Acked-by: Peng Fan <peng.fan@nxp.com>
    Message-Id: <20250709070107.1388512-1-arnd@kernel.org>
    Signed-off-by: Sudeep Holla <sudeep.holla@arm.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

firmware: arm_scmi: power_control: Ensure SCMI_SYSPOWER_IDLE is set early during resume [+ + +]

Author: Peng Fan <peng.fan@nxp.com>
Date:   Fri Jul 4 11:09:36 2025 +0800

    firmware: arm_scmi: power_control: Ensure SCMI_SYSPOWER_IDLE is set early during resume
    
    [ Upstream commit 9a0658d3991e6c82df87584b253454842f22f965 ]
    
    Fix a race condition where a second suspend notification from another
    SCMI agent wakes the system before SCMI_SYSPOWER_IDLE is set, leading
    to ignored suspend requests. This is due to interrupts triggering early
    execution of `scmi_userspace_notifier()` before the SCMI state is updated.
    
    To resolve this, set SCMI_SYSPOWER_IDLE earlier in the device resume
    path, prior to `thaw_processes()`. This ensures the SCMI state is
    correct when the notifier runs, allowing the system to suspend again
    as expected.
    
    On some platforms using SCMI, SCP cannot distinguish between CPU idle
    and suspend since both result in cluster power-off. By explicitly setting
    the idle state early, the Linux SCMI agent can correctly re-suspend in
    response to external notifications.
    
    Signed-off-by: Peng Fan <peng.fan@nxp.com>
    Message-Id: <20250704-scmi-pm-v2-2-9316cec2f9cc@nxp.com>
    Signed-off-by: Sudeep Holla <sudeep.holla@arm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

firmware: tegra: Fix IVC dependency problems [+ + +]

Author: Thierry Reding <treding@nvidia.com>
Date:   Tue May 6 15:31:16 2025 +0200

    firmware: tegra: Fix IVC dependency problems
    
    [ Upstream commit 78eb18020a88a4eed15f5af7700ed570642ff8f1 ]
    
    The IVC code is library code that other drivers need to select if they
    need that library. However, if the symbol is user-selectable this can
    lead to conflicts.
    
    Fix this by making the symbol only selectable for COMPILE_TEST and add
    a select TEGRA_IVC to TEGRA_BPMP, which is currently the only user.
    
    Link: https://lore.kernel.org/r/20250506133118.1011777-10-thierry.reding@gmail.com
    Signed-off-by: Thierry Reding <treding@nvidia.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

Linux: fix locking in efi_secret_unlink() [+ + +]

Author: Al Viro <viro@zeniv.linux.org.uk>
Date:   Tue May 14 08:48:58 2024 -0600

    fix locking in efi_secret_unlink()
    
    [ Upstream commit 2c58d42de71f9c73e40afacc9d062892d2cc8862 ]
    
    We used to need securityfs_remove() to undo simple_pin_fs() done when
    the file had been created and to drop the second extra reference
    taken at the same time.  Now that neither is needed (or done by
    securityfs_remove()), we can simply call simple_unlink() and be done
    with that - the broken games with locking had been there only for the
    sake of securityfs_remove().
    
    Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

fs/ntfs3: Add sanity check for file name [+ + +]

Author: Lizhi Xu <lizhi.xu@windriver.com>
Date:   Fri Jun 6 13:16:16 2025 +0800

    fs/ntfs3: Add sanity check for file name
    
    [ Upstream commit e841ecb139339602bc1853f5f09daa5d1ea920a2 ]
    
    The length of the file name should be smaller than the directory entry size.
    
    Reported-by: syzbot+598057afa0f49e62bd23@syzkaller.appspotmail.com
    Closes: https://syzkaller.appspot.com/bug?extid=598057afa0f49e62bd23
    Signed-off-by: Lizhi Xu <lizhi.xu@windriver.com>
    Signed-off-by: Konstantin Komarov <almaz.alexandrovich@paragon-software.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

fs/ntfs3: correctly create symlink for relative path [+ + +]

Author: Rong Zhang <ulin0208@gmail.com>
Date:   Wed May 7 15:35:34 2025 +0800

    fs/ntfs3: correctly create symlink for relative path
    
    [ Upstream commit b1e9d89408f402858c00103f9831b25ffa0994d3 ]
    
    After applying this patch, could correctly create symlink:
    
    ln -s "relative/path/to/file" symlink
    
    Signed-off-by: Rong Zhang <ulin0208@gmail.com>
    [almaz.alexandrovich@paragon-software.com: added cpu_to_le32 macro to
    rs->Flags assignment]
    Signed-off-by: Konstantin Komarov <almaz.alexandrovich@paragon-software.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

fs/orangefs: use snprintf() instead of sprintf() [+ + +]

Author: Amir Mohammad Jahangirzad <a.jahangirzad@gmail.com>
Date:   Sun Jun 8 20:05:59 2025 +0330

    fs/orangefs: use snprintf() instead of sprintf()
    
    [ Upstream commit cdfa1304657d6f23be8fd2bb0516380a3c89034e ]
    
    sprintf() is discouraged for use with bounded destination buffers
    as it does not prevent buffer overflows when the formatted output
    exceeds the destination buffer size. snprintf() is a safer
    alternative as it limits the number of bytes written and ensures
    NUL-termination.
    
    Replace sprintf() with snprintf() for copying the debug string
    into a temporary buffer, using ORANGEFS_MAX_DEBUG_STRING_LEN as
    the maximum size to ensure safe formatting and prevent memory
    corruption in edge cases.
    
    EDIT: After this patch sat on linux-next for a few days, Dan
    Carpenter saw it and suggested that I use scnprintf instead of
    snprintf. I made the change and retested.
    
    Signed-off-by: Amir Mohammad Jahangirzad <a.jahangirzad@gmail.com>
    Signed-off-by: Mike Marshall <hubcap@omnibond.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

fs: Prevent file descriptor table allocations exceeding INT_MAX [+ + +]

Author: Sasha Levin <sashal@kernel.org>
Date:   Sun Jun 29 03:40:21 2025 -0400

    fs: Prevent file descriptor table allocations exceeding INT_MAX
    
    commit 04a2c4b4511d186b0fce685da21085a5d4acd370 upstream.
    
    When sysctl_nr_open is set to a very high value (for example, 1073741816
    as set by systemd), processes attempting to use file descriptors near
    the limit can trigger massive memory allocation attempts that exceed
    INT_MAX, resulting in a WARNING in mm/slub.c:
    
      WARNING: CPU: 0 PID: 44 at mm/slub.c:5027 __kvmalloc_node_noprof+0x21a/0x288
    
    This happens because kvmalloc_array() and kvmalloc() check if the
    requested size exceeds INT_MAX and emit a warning when the allocation is
    not flagged with __GFP_NOWARN.
    
    Specifically, when nr_open is set to 1073741816 (0x3ffffff8) and a
    process calls dup2(oldfd, 1073741880), the kernel attempts to allocate:
    - File descriptor array: 1073741880 * 8 bytes = 8,589,935,040 bytes
    - Multiple bitmaps: ~400MB
    - Total allocation size: > 8GB (exceeding INT_MAX = 2,147,483,647)
    
    Reproducer:
    1. Set /proc/sys/fs/nr_open to 1073741816:
       # echo 1073741816 > /proc/sys/fs/nr_open
    
    2. Run a program that uses a high file descriptor:
       #include <unistd.h>
       #include <sys/resource.h>
    
       int main() {
           struct rlimit rlim = {1073741824, 1073741824};
           setrlimit(RLIMIT_NOFILE, &rlim);
           dup2(2, 1073741880);  // Triggers the warning
           return 0;
       }
    
    3. Observe WARNING in dmesg at mm/slub.c:5027
    
    systemd commit a8b627a introduced automatic bumping of fs.nr_open to the
    maximum possible value. The rationale was that systems with memory
    control groups (memcg) no longer need separate file descriptor limits
    since memory is properly accounted. However, this change overlooked
    that:
    
    1. The kernel's allocation functions still enforce INT_MAX as a maximum
       size regardless of memcg accounting
    2. Programs and tests that legitimately test file descriptor limits can
       inadvertently trigger massive allocations
    3. The resulting allocations (>8GB) are impractical and will always fail
    
    systemd's algorithm starts with INT_MAX and keeps halving the value
    until the kernel accepts it. On most systems, this results in nr_open
    being set to 1073741816 (0x3ffffff8), which is just under 1GB of file
    descriptors.
    
    While processes rarely use file descriptors near this limit in normal
    operation, certain selftests (like
    tools/testing/selftests/core/unshare_test.c) and programs that test file
    descriptor limits can trigger this issue.
    
    Fix this by adding a check in alloc_fdtable() to ensure the requested
    allocation size does not exceed INT_MAX. This causes the operation to
    fail with -EMFILE instead of triggering a kernel warning and avoids the
    impractical >8GB memory allocation request.
    
    Fixes: 9cfe015aa424 ("get rid of NR_OPEN and introduce a sysctl_nr_open")
    Cc: stable@vger.kernel.org
    Signed-off-by: Sasha Levin <sashal@kernel.org>
    Link: https://lore.kernel.org/20250629074021.1038845-1-sashal@kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

fscrypt: Don't use problematic non-inline crypto engines [+ + +]

Author: Eric Biggers <ebiggers@kernel.org>
Date:   Fri Jul 4 00:03:22 2025 -0700

    fscrypt: Don't use problematic non-inline crypto engines
    
    commit b41c1d8d07906786c60893980d52688f31d114a6 upstream.
    
    Make fscrypt no longer use Crypto API drivers for non-inline crypto
    engines, even when the Crypto API prioritizes them over CPU-based code
    (which unfortunately it often does).  These drivers tend to be really
    problematic, especially for fscrypt's workload.  This commit has no
    effect on inline crypto engines, which are different and do work well.
    
    Specifically, exclude drivers that have CRYPTO_ALG_KERN_DRIVER_ONLY or
    CRYPTO_ALG_ALLOCATES_MEMORY set.  (Later, CRYPTO_ALG_ASYNC should be
    excluded too.  That's omitted for now to keep this commit backportable,
    since until recently some CPU-based code had CRYPTO_ALG_ASYNC set.)
    
    There are two major issues with these drivers: bugs and performance.
    
    First, these drivers tend to be buggy.  They're fundamentally much more
    error-prone and harder to test than the CPU-based code.  They often
    don't get tested before kernel releases, and even if they do, the crypto
    self-tests don't properly test these drivers.  Released drivers have
    en/decrypted or hashed data incorrectly.  These bugs cause issues for
    fscrypt users who often didn't even want to use these drivers, e.g.:
    
    - https://github.com/google/fscryptctl/issues/32
    - https://github.com/google/fscryptctl/issues/9
    - https://lore.kernel.org/r/PH0PR02MB731916ECDB6C613665863B6CFFAA2@PH0PR02MB7319.namprd02.prod.outlook.com
    
    These drivers have also similarly caused issues for dm-crypt users,
    including data corruption and deadlocks.  Since Linux v5.10, dm-crypt
    has disabled most of them by excluding CRYPTO_ALG_ALLOCATES_MEMORY.
    
    Second, these drivers tend to be *much* slower than the CPU-based code.
    This may seem counterintuitive, but benchmarks clearly show it.  There's
    a *lot* of overhead associated with going to a hardware driver, off the
    CPU, and back again.  To prove this, I gathered as many systems with
    this type of crypto engine as I could, and I measured synchronous
    encryption of 4096-byte messages (which matches fscrypt's workload):
    
    Intel Emerald Rapids server:
       AES-256-XTS:
          xts-aes-vaes-avx512   16171 MB/s  [CPU-based, Vector AES]
          qat_aes_xts             289 MB/s  [Offload, Intel QuickAssist]
    
    Qualcomm SM8650 HDK:
       AES-256-XTS:
          xts-aes-ce             4301 MB/s  [CPU-based, ARMv8 Crypto Extensions]
          xts-aes-qce              73 MB/s  [Offload, Qualcomm Crypto Engine]
    
    i.MX 8M Nano LPDDR4 EVK:
       AES-256-XTS:
          xts-aes-ce              647 MB/s   [CPU-based, ARMv8 Crypto Extensions]
          xts(ecb-aes-caam)        20 MB/s   [Offload, CAAM]
       AES-128-CBC-ESSIV:
          essiv(cbc-aes-caam,sha256-lib) 23 MB/s   [Offload, CAAM]
    
    STM32MP157F-DK2:
       AES-256-XTS:
          xts-aes-neonbs         13.2 MB/s   [CPU-based, ARM NEON]
          xts(stm32-ecb-aes)     3.1 MB/s    [Offload, STM32 crypto engine]
       AES-128-CBC-ESSIV:
          essiv(cbc-aes-neonbs,sha256-lib)
                                 14.7 MB/s   [CPU-based, ARM NEON]
          essiv(stm32-cbc-aes,sha256-lib)
                                 3.2 MB/s    [Offload, STM32 crypto engine]
       Adiantum:
          adiantum(xchacha12-arm,aes-arm,nhpoly1305-neon)
                                 52.8 MB/s   [CPU-based, ARM scalar + NEON]
    
    So, there was no case in which the crypto engine was even *close* to
    being faster.  On the first three, which have AES instructions in the
    CPU, the CPU was 30 to 55 times faster (!).  Even on STM32MP157F-DK2
    which has a Cortex-A7 CPU that doesn't have AES instructions, AES was
    over 4 times faster on the CPU.  And Adiantum encryption, which is what
    actually should be used on CPUs like that, was over 17 times faster.
    
    Other justifications that have been given for these non-inline crypto
    engines (almost always coming from the hardware vendors, not actual
    users) don't seem very plausible either:
    
      - The crypto engine throughput could be improved by processing
        multiple requests concurrently.  Currently irrelevant to fscrypt,
        since it doesn't do that.  This would also be complex, and unhelpful
        in many cases.  2 of the 4 engines I tested even had only one queue.
    
      - Some of the engines, e.g. STM32, support hardware keys.  Also
        currently irrelevant to fscrypt, since it doesn't support these.
        Interestingly, the STM32 driver itself doesn't support this either.
    
      - Free up CPU for other tasks and/or reduce energy usage.  Not very
        plausible considering the "short" message length, driver overhead,
        and scheduling overhead.  There's just very little time for the CPU
        to do something else like run another task or enter low-power state,
        before the message finishes and it's time to process the next one.
    
      - Some of these engines resist power analysis and electromagnetic
        attacks, while the CPU-based crypto generally does not.  In theory,
        this sounds great.  In practice, if this benefit requires the use of
        an off-CPU offload that massively regresses performance and has a
        low-quality, buggy driver, the price for this hardening (which is
        not relevant to most fscrypt users, and tends to be incomplete) is
        just too high.  Inline crypto engines are much more promising here,
        as are on-CPU solutions like RISC-V High Assurance Cryptography.
    
    Fixes: b30ab0e03407 ("ext4 crypto: add ext4 encryption facilities")
    Cc: stable@vger.kernel.org
    Acked-by: Ard Biesheuvel <ardb@kernel.org>
    Link: https://lore.kernel.org/r/20250704070322.20692-1-ebiggers@kernel.org
    Signed-off-by: Eric Biggers <ebiggers@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

futex: Use user_write_access_begin/_end() in futex_put_value() [+ + +]

Author: Waiman Long <longman@redhat.com>
Date:   Mon Aug 11 10:11:47 2025 -0400

    futex: Use user_write_access_begin/_end() in futex_put_value()
    
    commit dfb36e4a8db0cd56f92d4cb445f54e85a9b40897 upstream.
    
    Commit cec199c5e39b ("futex: Implement FUTEX2_NUMA") introduced the
    futex_put_value() helper to write a value to the given user
    address.
    
    However, it uses user_read_access_begin() before the write. For
    architectures that differentiate between read and write accesses, like
    PowerPC, futex_put_value() fails with -EFAULT.
    
    Fix that by using the user_write_access_begin/user_write_access_end() pair
    instead.
    
    Fixes: cec199c5e39b ("futex: Implement FUTEX2_NUMA")
    Signed-off-by: Waiman Long <longman@redhat.com>
    Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
    Cc: stable@vger.kernel.org
    Link: https://lore.kernel.org/all/20250811141147.322261-1-longman@redhat.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

gfs2: Set .migrate_folio in gfs2_{rgrp,meta}_aops [+ + +]

Author: Andrew Price <anprice@redhat.com>
Date:   Mon Jul 14 16:21:15 2025 +0100

    gfs2: Set .migrate_folio in gfs2_{rgrp,meta}_aops
    
    [ Upstream commit 5c8f12cf1e64e0e8e6cb80b0c935389973e8be8d ]
    
    Clears up the warning added in 7ee3647243e5 ("migrate: Remove call to
    ->writepage") that occurs in various xfstests, causing "something found
    in dmesg" failures.
    
    [  341.136573] gfs2_meta_aops does not implement migrate_folio
    [  341.136953] WARNING: CPU: 1 PID: 36 at mm/migrate.c:944 move_to_new_folio+0x2f8/0x300
    
    Signed-off-by: Andrew Price <anprice@redhat.com>
    Signed-off-by: Andreas Gruenbacher <agruenba@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

gfs2: Validate i_depth for exhash directories [+ + +]

Author: Andrew Price <anprice@redhat.com>
Date:   Wed Jul 16 14:12:07 2025 +0100

    gfs2: Validate i_depth for exhash directories
    
    [ Upstream commit 557c024ca7250bb65ae60f16c02074106c2f197b ]
    
    A fuzzer test introduced corruption that ends up with a depth of 0 in
    dir_e_read(), causing an undefined shift by 32 at:
    
      index = hash >> (32 - dip->i_depth);
    
    As calculated in an open-coded way in dir_make_exhash(), the minimum
    depth for an exhash directory is ilog2(sdp->sd_hash_ptrs) and 0 is
    invalid as sdp->sd_hash_ptrs is fixed as sdp->bsize / 16 at mount time.
    
    So we can avoid the undefined behaviour by checking for depth values
    lower than the minimum in gfs2_dinode_in(). Values greater than the
    maximum are already being checked for there.
    
    Also switch the calculation in dir_make_exhash() to use ilog2() to
    clarify how the depth is calculated.
    
    Tested with the syzkaller repro.c and xfstests '-g quick'.
    
    Reported-by: syzbot+4708579bb230a0582a57@syzkaller.appspotmail.com
    Signed-off-by: Andrew Price <anprice@redhat.com>
    Signed-off-by: Andreas Gruenbacher <agruenba@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

gpio: loongson-64bit: Extend GPIO irq support [+ + +]

Author: Binbin Zhou <zhoubinbin@loongson.cn>
Date:   Mon Jul 14 14:45:42 2025 +0800

    gpio: loongson-64bit: Extend GPIO irq support
    
    [ Upstream commit 27cb8f702eb789f97f7a8bd5a91d76c65a937b2f ]
    
    Add the interrupt enable register offset (inten_offset) so that GPIO
    interrupts can be enabled normally on more models.
    
    According to the latest interface specifications, the definition of GPIO
    interrupts in ACPI is similar to that in FDT. The GPIO interrupts are
    listed one by one according to the GPIO number, and the corresponding
    interrupt number can be obtained directly through the GPIO number
    specified by the consumer.
    
    Signed-off-by: Xi Ruoyao <xry111@xry111.site>
    Signed-off-by: Binbin Zhou <zhoubinbin@loongson.cn>
    Reviewed-by: Huacai Chen <chenhuacai@loongson.cn>
    Link: https://lore.kernel.org/r/20250714064542.2276247-1-zhoubinbin@loongson.cn
    [Bartosz: tweaked the commit message]
    Signed-off-by: Bartosz Golaszewski <bartosz.golaszewski@linaro.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

gpio: mlxbf2: use platform_get_irq_optional() [+ + +]

Author: David Thompson <davthompson@nvidia.com>
Date:   Mon Jul 28 10:46:19 2025 -0400

    gpio: mlxbf2: use platform_get_irq_optional()
    
    commit 63c7bc53a35e785accdc2ceab8f72d94501931ab upstream.
    
    The gpio-mlxbf2 driver interfaces with four GPIO controllers,
    device instances 0-3. There are two IRQ resources shared between
    the four controllers, and they are found in the ACPI table for
    instances 0 and 3. The driver should not use platform_get_irq(),
    otherwise this error is logged when probing instances 1 and 2:
      mlxbf2_gpio MLNXBF22:01: error -ENXIO: IRQ index 0 not found
    
    Fixes: 2b725265cb08 ("gpio: mlxbf2: Introduce IRQ support")
    Cc: stable@vger.kernel.org
    Signed-off-by: David Thompson <davthompson@nvidia.com>
    Reviewed-by: Shravan Kumar Ramani <shravankr@nvidia.com>
    Reviewed-by: Mika Westerberg <mika.westerberg@linux.intel.com>
    Link: https://lore.kernel.org/r/20250728144619.29894-1-davthompson@nvidia.com
    Signed-off-by: Bartosz Golaszewski <bartosz.golaszewski@linaro.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

gpio: mlxbf3: use platform_get_irq_optional() [+ + +]

Author: David Thompson <davthompson@nvidia.com>
Date:   Mon Aug 11 13:50:45 2025 -0400

    gpio: mlxbf3: use platform_get_irq_optional()
    
    commit 810bd9066fb1871b8a9528f31f2fdbf2a8b73bf2 upstream.
    
    The gpio-mlxbf3 driver interfaces with two GPIO controllers,
    device instance 0 and 1. There is a single IRQ resource shared
    between the two controllers, and it is found in the ACPI table for
    device instance 0. The driver should not use platform_get_irq(),
    otherwise this error is logged when probing instance 1:
        mlxbf3_gpio MLNXBF33:01: error -ENXIO: IRQ index 0 not found
    
    Cc: stable@vger.kernel.org
    Fixes: cd33f216d241 ("gpio: mlxbf3: Add gpio driver support")
    Signed-off-by: David Thompson <davthompson@nvidia.com>
    Reviewed-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com>
    Link: https://lore.kernel.org/r/ce70b98a201ce82b9df9aa80ac7a5eeaa2268e52.1754928650.git.davthompson@nvidia.com
    Signed-off-by: Bartosz Golaszewski <bartosz.golaszewski@linaro.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

gpio: tps65912: check the return value of regmap_update_bits() [+ + +]

Author: Bartosz Golaszewski <bartosz.golaszewski@linaro.org>
Date:   Mon Jul 7 09:50:15 2025 +0200

    gpio: tps65912: check the return value of regmap_update_bits()
    
    [ Upstream commit a0b2a6bbff8c26aafdecd320f38f52c341d5cafa ]
    
    regmap_update_bits() can fail, check its return value like we do
    elsewhere in the driver.
    
    Link: https://lore.kernel.org/r/20250707-gpiochip-set-rv-gpio-round4-v1-2-35668aaaf6d2@linaro.org
    Signed-off-by: Bartosz Golaszewski <bartosz.golaszewski@linaro.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

gpio: virtio: Fix config space reading. [+ + +]

Author: Harald Mommer <harald.mommer@oss.qualcomm.com>
Date:   Thu Jul 24 16:36:53 2025 +0200

    gpio: virtio: Fix config space reading.
    
    commit 4740e1e2f320061c2f0dbadc0dd3dfb58df986d5 upstream.
    
    Quote from the virtio specification chapter 4.2.2.2:
    
    "For the device-specific configuration space, the driver MUST use 8 bit
    wide accesses for 8 bit wide fields, 16 bit wide and aligned accesses
    for 16 bit wide fields and 32 bit wide and aligned accesses for 32 and
    64 bit wide fields."
    
    Signed-off-by: Harald Mommer <harald.mommer@oss.qualcomm.com>
    Cc: stable@vger.kernel.org
    Fixes: 3a29355a22c0 ("gpio: Add virtio-gpio driver")
    Acked-by: Viresh Kumar <viresh.kumar@linaro.org>
    Link: https://lore.kernel.org/r/20250724143718.5442-2-harald.mommer@oss.qualcomm.com
    Signed-off-by: Bartosz Golaszewski <bartosz.golaszewski@linaro.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

gpio: wcd934x: check the return value of regmap_update_bits() [+ + +]

Author: Bartosz Golaszewski <bartosz.golaszewski@linaro.org>
Date:   Wed Jul 9 08:41:39 2025 +0200

    gpio: wcd934x: check the return value of regmap_update_bits()
    
    [ Upstream commit ff0f0d7c6587e38c308be9905e36f86e98fb9c1f ]
    
    regmap_update_bits() can fail so check its return value in
    wcd_gpio_direction_output() for consistency with the rest of the code
    and propagate any errors.
    
    Link: https://lore.kernel.org/r/20250709-gpiochip-set-rv-gpio-remaining-v1-2-b8950f69618d@linaro.org
    Signed-off-by: Bartosz Golaszewski <bartosz.golaszewski@linaro.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

gve: Return error for unknown admin queue command [+ + +]

Author: Alok Tiwari <alok.a.tiwari@oracle.com>
Date:   Sun Jun 15 22:45:01 2025 -0700

    gve: Return error for unknown admin queue command
    
    [ Upstream commit b11344f63fdd9e8c5121148a6965b41079071dd2 ]
    
    In gve_adminq_issue_cmd(), return -EINVAL instead of 0 when an unknown
    admin queue command opcode is encountered.
    
    This prevents the function from silently succeeding on invalid input
    and prevents undefined behavior by ensuring the function fails gracefully
    when an unrecognized opcode is provided.
    
    These changes improve error handling.
    
    Signed-off-by: Alok Tiwari <alok.a.tiwari@oracle.com>
    Link: https://patch.msgid.link/20250616054504.1644770-2-alok.a.tiwari@oracle.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

habanalabs: fix UAF in export_dmabuf() [+ + +]

Author: Al Viro <viro@zeniv.linux.org.uk>
Date:   Sat Jul 12 06:02:31 2025 +0100

    habanalabs: fix UAF in export_dmabuf()
    
    [ Upstream commit 33927f3d0ecdcff06326d6e4edb6166aed42811c ]
    
    As soon as we'd inserted a file reference into descriptor table, another
    thread could close it.  That's fine for the case when all we are doing is
    returning that descriptor to userland (it's a race, but it's a userland
    race and there's nothing the kernel can do about it).  However, if we
    follow fd_install() with any kind of access to objects that would be
    destroyed on close (be it the struct file itself or anything destroyed
    by its ->release()), we have a UAF.
    
    dma_buf_fd() is a combination of reserving a descriptor and fd_install().
    habanalabs export_dmabuf() calls it and then proceeds to access the
    objects destroyed on close.  In particular, it grabs an extra reference to
    another struct file that will be dropped as part of ->release() for ours;
    that "will be" is actually "might have already been".
    
    Fix that by reserving descriptor before anything else and do fd_install()
    only when everything had been set up.  As a side benefit, we no longer
    have the failure exit with file already created, but reference to
    underlying file (as well as ->dmabuf_export_cnt, etc.) not grabbed yet;
    unlike dma_buf_fd(), fd_install() can't fail.
    
    Fixes: db1a8dd916aa ("habanalabs: add support for dma-buf exporter")
    Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

hamradio: ignore ops-locked netdevs [+ + +]

Author: Stanislav Fomichev <sdf@fomichev.me>
Date:   Wed Aug 6 14:37:26 2025 -0700

    hamradio: ignore ops-locked netdevs
    
    [ Upstream commit c64237960819aee1766d03f446ae6de94b1e3f73 ]
    
    Syzkaller managed to trigger lock dependency in xsk_notify via
    register_netdevice. As discussed in [0], using register_netdevice
    in the notifiers is problematic so skip adding hamradio for ops-locked
    devices.
    
           xsk_notifier+0x89/0x230 net/xdp/xsk.c:1664
           notifier_call_chain+0x1b6/0x3e0 kernel/notifier.c:85
           call_netdevice_notifiers_extack net/core/dev.c:2267 [inline]
           call_netdevice_notifiers net/core/dev.c:2281 [inline]
           unregister_netdevice_many_notify+0x14d7/0x1ff0 net/core/dev.c:12156
           unregister_netdevice_many net/core/dev.c:12219 [inline]
           unregister_netdevice_queue+0x33c/0x380 net/core/dev.c:12063
           register_netdevice+0x1689/0x1ae0 net/core/dev.c:11241
           bpq_new_device drivers/net/hamradio/bpqether.c:481 [inline]
           bpq_device_event+0x491/0x600 drivers/net/hamradio/bpqether.c:523
           notifier_call_chain+0x1b6/0x3e0 kernel/notifier.c:85
           call_netdevice_notifiers_extack net/core/dev.c:2267 [inline]
           call_netdevice_notifiers net/core/dev.c:2281 [inline]
           __dev_notify_flags+0x18d/0x2e0 net/core/dev.c:-1
           netif_change_flags+0xe8/0x1a0 net/core/dev.c:9608
           dev_change_flags+0x130/0x260 net/core/dev_api.c:68
           devinet_ioctl+0xbb4/0x1b50 net/ipv4/devinet.c:1200
           inet_ioctl+0x3c0/0x4c0 net/ipv4/af_inet.c:1001
    
    0: https://lore.kernel.org/netdev/20250625140357.6203d0af@kernel.org/
    Fixes: 4c975fd70002 ("net: hold instance lock during NETDEV_REGISTER/UP")
    Suggested-by: Jakub Kicinski <kuba@kernel.org>
    Reported-by: syzbot+e6300f66a999a6612477@syzkaller.appspotmail.com
    Closes: https://syzkaller.appspot.com/bug?extid=e6300f66a999a6612477
    Signed-off-by: Stanislav Fomichev <sdf@fomichev.me>
    Link: https://patch.msgid.link/20250806213726.1383379-2-sdf@fomichev.me
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

hfs: fix general protection fault in hfs_find_init() [+ + +]

Author: Viacheslav Dubeyko <slava@dubeyko.com>
Date:   Thu Jul 10 14:36:57 2025 -0700

    hfs: fix general protection fault in hfs_find_init()
    
    [ Upstream commit 736a0516a16268995f4898eded49bfef077af709 ]
    
    The hfs_find_init() method can trigger the crash
    if tree pointer is NULL:
    
    [   45.746290][ T9787] Oops: general protection fault, probably for non-canonical address 0xdffffc0000000008: 0000 [#1] SMP KAI
    [   45.747287][ T9787] KASAN: null-ptr-deref in range [0x0000000000000040-0x0000000000000047]
    [   45.748716][ T9787] CPU: 2 UID: 0 PID: 9787 Comm: repro Not tainted 6.16.0-rc3 #10 PREEMPT(full)
    [   45.750250][ T9787] Hardware name: QEMU Ubuntu 24.04 PC (i440FX + PIIX, 1996), BIOS 1.16.3-debian-1.16.3-2 04/01/2014
    [   45.751983][ T9787] RIP: 0010:hfs_find_init+0x86/0x230
    [   45.752834][ T9787] Code: c1 ea 03 80 3c 02 00 0f 85 9a 01 00 00 4c 8d 6b 40 48 c7 45 18 00 00 00 00 48 b8 00 00 00 00 00 fc
    [   45.755574][ T9787] RSP: 0018:ffffc90015157668 EFLAGS: 00010202
    [   45.756432][ T9787] RAX: dffffc0000000000 RBX: 0000000000000000 RCX: ffffffff819a4d09
    [   45.757457][ T9787] RDX: 0000000000000008 RSI: ffffffff819acd3a RDI: ffffc900151576e8
    [   45.758282][ T9787] RBP: ffffc900151576d0 R08: 0000000000000005 R09: 0000000000000000
    [   45.758943][ T9787] R10: 0000000080000000 R11: 0000000000000001 R12: 0000000000000004
    [   45.759619][ T9787] R13: 0000000000000040 R14: ffff88802c50814a R15: 0000000000000000
    [   45.760293][ T9787] FS:  00007ffb72734540(0000) GS:ffff8880cec64000(0000) knlGS:0000000000000000
    [   45.761050][ T9787] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    [   45.761606][ T9787] CR2: 00007f9bd8225000 CR3: 000000010979a000 CR4: 00000000000006f0
    [   45.762286][ T9787] Call Trace:
    [   45.762570][ T9787]  <TASK>
    [   45.762824][ T9787]  hfs_ext_read_extent+0x190/0x9d0
    [   45.763269][ T9787]  ? submit_bio_noacct_nocheck+0x2dd/0xce0
    [   45.763766][ T9787]  ? __pfx_hfs_ext_read_extent+0x10/0x10
    [   45.764250][ T9787]  hfs_get_block+0x55f/0x830
    [   45.764646][ T9787]  block_read_full_folio+0x36d/0x850
    [   45.765105][ T9787]  ? __pfx_hfs_get_block+0x10/0x10
    [   45.765541][ T9787]  ? const_folio_flags+0x5b/0x100
    [   45.765972][ T9787]  ? __pfx_hfs_read_folio+0x10/0x10
    [   45.766415][ T9787]  filemap_read_folio+0xbe/0x290
    [   45.766840][ T9787]  ? __pfx_filemap_read_folio+0x10/0x10
    [   45.767325][ T9787]  ? __filemap_get_folio+0x32b/0xbf0
    [   45.767780][ T9787]  do_read_cache_folio+0x263/0x5c0
    [   45.768223][ T9787]  ? __pfx_hfs_read_folio+0x10/0x10
    [   45.768666][ T9787]  read_cache_page+0x5b/0x160
    [   45.769070][ T9787]  hfs_btree_open+0x491/0x1740
    [   45.769481][ T9787]  hfs_mdb_get+0x15e2/0x1fb0
    [   45.769877][ T9787]  ? __pfx_hfs_mdb_get+0x10/0x10
    [   45.770316][ T9787]  ? find_held_lock+0x2b/0x80
    [   45.770731][ T9787]  ? lockdep_init_map_type+0x5c/0x280
    [   45.771200][ T9787]  ? lockdep_init_map_type+0x5c/0x280
    [   45.771674][ T9787]  hfs_fill_super+0x38e/0x720
    [   45.772092][ T9787]  ? __pfx_hfs_fill_super+0x10/0x10
    [   45.772549][ T9787]  ? snprintf+0xbe/0x100
    [   45.772931][ T9787]  ? __pfx_snprintf+0x10/0x10
    [   45.773350][ T9787]  ? do_raw_spin_lock+0x129/0x2b0
    [   45.773796][ T9787]  ? find_held_lock+0x2b/0x80
    [   45.774215][ T9787]  ? set_blocksize+0x40a/0x510
    [   45.774636][ T9787]  ? sb_set_blocksize+0x176/0x1d0
    [   45.775087][ T9787]  ? setup_bdev_super+0x369/0x730
    [   45.775533][ T9787]  get_tree_bdev_flags+0x384/0x620
    [   45.775985][ T9787]  ? __pfx_hfs_fill_super+0x10/0x10
    [   45.776453][ T9787]  ? __pfx_get_tree_bdev_flags+0x10/0x10
    [   45.776950][ T9787]  ? bpf_lsm_capable+0x9/0x10
    [   45.777365][ T9787]  ? security_capable+0x80/0x260
    [   45.777803][ T9787]  vfs_get_tree+0x8e/0x340
    [   45.778203][ T9787]  path_mount+0x13de/0x2010
    [   45.778604][ T9787]  ? kmem_cache_free+0x2b0/0x4c0
    [   45.779052][ T9787]  ? __pfx_path_mount+0x10/0x10
    [   45.779480][ T9787]  ? getname_flags.part.0+0x1c5/0x550
    [   45.779954][ T9787]  ? putname+0x154/0x1a0
    [   45.780335][ T9787]  __x64_sys_mount+0x27b/0x300
    [   45.780758][ T9787]  ? __pfx___x64_sys_mount+0x10/0x10
    [   45.781232][ T9787]  do_syscall_64+0xc9/0x480
    [   45.781631][ T9787]  entry_SYSCALL_64_after_hwframe+0x77/0x7f
    [   45.782149][ T9787] RIP: 0033:0x7ffb7265b6ca
    [   45.782539][ T9787] Code: 48 8b 0d c9 17 0d 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48
    [   45.784212][ T9787] RSP: 002b:00007ffc0c10cfb8 EFLAGS: 00000206 ORIG_RAX: 00000000000000a5
    [   45.784935][ T9787] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007ffb7265b6ca
    [   45.785626][ T9787] RDX: 0000200000000240 RSI: 0000200000000280 RDI: 00007ffc0c10d100
    [   45.786316][ T9787] RBP: 00007ffc0c10d190 R08: 00007ffc0c10d000 R09: 0000000000000000
    [   45.787011][ T9787] R10: 0000000000000048 R11: 0000000000000206 R12: 0000560246733250
    [   45.787697][ T9787] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
    [   45.788393][ T9787]  </TASK>
    [   45.788665][ T9787] Modules linked in:
    [   45.789058][ T9787] ---[ end trace 0000000000000000 ]---
    [   45.789554][ T9787] RIP: 0010:hfs_find_init+0x86/0x230
    [   45.790028][ T9787] Code: c1 ea 03 80 3c 02 00 0f 85 9a 01 00 00 4c 8d 6b 40 48 c7 45 18 00 00 00 00 48 b8 00 00 00 00 00 fc
    [   45.792364][ T9787] RSP: 0018:ffffc90015157668 EFLAGS: 00010202
    [   45.793155][ T9787] RAX: dffffc0000000000 RBX: 0000000000000000 RCX: ffffffff819a4d09
    [   45.794123][ T9787] RDX: 0000000000000008 RSI: ffffffff819acd3a RDI: ffffc900151576e8
    [   45.795105][ T9787] RBP: ffffc900151576d0 R08: 0000000000000005 R09: 0000000000000000
    [   45.796135][ T9787] R10: 0000000080000000 R11: 0000000000000001 R12: 0000000000000004
    [   45.797114][ T9787] R13: 0000000000000040 R14: ffff88802c50814a R15: 0000000000000000
    [   45.798024][ T9787] FS:  00007ffb72734540(0000) GS:ffff8880cec64000(0000) knlGS:0000000000000000
    [   45.799019][ T9787] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    [   45.799822][ T9787] CR2: 00007f9bd8225000 CR3: 000000010979a000 CR4: 00000000000006f0
    [   45.800747][ T9787] Kernel panic - not syncing: Fatal exception
    
    The hfs_fill_super() calls hfs_mdb_get() method that tries
    to construct Extents Tree and Catalog Tree:
    
    HFS_SB(sb)->ext_tree = hfs_btree_open(sb, HFS_EXT_CNID, hfs_ext_keycmp);
    if (!HFS_SB(sb)->ext_tree) {
            pr_err("unable to open extent tree\n");
            goto out;
    }
    HFS_SB(sb)->cat_tree = hfs_btree_open(sb, HFS_CAT_CNID, hfs_cat_keycmp);
    if (!HFS_SB(sb)->cat_tree) {
            pr_err("unable to open catalog tree\n");
            goto out;
    }
    
    However, hfs_btree_open() calls read_mapping_page() that
    calls hfs_get_block(). And this method calls hfs_ext_read_extent():
    
    static int hfs_ext_read_extent(struct inode *inode, u16 block)
    {
            struct hfs_find_data fd;
            int res;
    
            if (block >= HFS_I(inode)->cached_start &&
                block < HFS_I(inode)->cached_start + HFS_I(inode)->cached_blocks)
                    return 0;
    
            res = hfs_find_init(HFS_SB(inode->i_sb)->ext_tree, &fd);
            if (!res) {
                    res = __hfs_ext_cache_extent(&fd, inode, block);
                    hfs_find_exit(&fd);
            }
            return res;
    }
    
    The problem here that hfs_find_init() is trying to use
    HFS_SB(inode->i_sb)->ext_tree that is not initialized yet.
    It will be initailized when hfs_btree_open() finishes
    the execution.
    
    The patch adds checking of tree pointer in hfs_find_init()
    and it reworks the logic of hfs_btree_open() by reading
    the b-tree's header directly from the volume. The read_mapping_page()
    is exchanged on filemap_grab_folio() that grab the folio from
    mapping. Then, sb_bread() extracts the b-tree's header
    content and copy it into the folio.
    
    Reported-by: Wenzhi Wang <wenzhi.wang@uwaterloo.ca>
    Signed-off-by: Viacheslav Dubeyko <slava@dubeyko.com>
    cc: John Paul Adrian Glaubitz <glaubitz@physik.fu-berlin.de>
    cc: Yangtao Li <frank.li@vivo.com>
    cc: linux-fsdevel@vger.kernel.org
    Link: https://lore.kernel.org/r/20250710213657.108285-1-slava@dubeyko.com
    Signed-off-by: Viacheslav Dubeyko <slava@dubeyko.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

hfs: fix not erasing deleted b-tree node issue [+ + +]

Author: Viacheslav Dubeyko <slava@dubeyko.com>
Date:   Tue Apr 29 17:12:11 2025 -0700

    hfs: fix not erasing deleted b-tree node issue
    
    [ Upstream commit d3ed6d6981f4756f145766753c872482bc3b28d3 ]
    
    The generic/001 test of xfstests suite fails and corrupts
    the HFS volume:
    
    sudo ./check generic/001
    FSTYP         -- hfs
    PLATFORM      -- Linux/x86_64 hfsplus-testing-0001 6.15.0-rc2+ #3 SMP PREEMPT_DYNAMIC Fri Apr 25 17:13:00 PDT 2>
    MKFS_OPTIONS  -- /dev/loop51
    MOUNT_OPTIONS -- /dev/loop51 /mnt/scratch
    
    generic/001 32s ... _check_generic_filesystem: filesystem on /dev/loop50 is inconsistent
    (see /home/slavad/XFSTESTS-2/xfstests-dev/results//generic/001.full for details)
    
    Ran: generic/001
    Failures: generic/001
    Failed 1 of 1 tests
    
    fsck.hfs -d -n ./test-image.bin
    ** ./test-image.bin (NO WRITE)
            Using cacheBlockSize=32K cacheTotalBlock=1024 cacheSize=32768K.
       Executing fsck_hfs (version 540.1-Linux).
    ** Checking HFS volume.
       The volume name is untitled
    ** Checking extents overflow file.
    ** Checking catalog file.
       Unused node is not erased (node = 2)
       Unused node is not erased (node = 4)
    <skipped>
       Unused node is not erased (node = 253)
       Unused node is not erased (node = 254)
       Unused node is not erased (node = 255)
       Unused node is not erased (node = 256)
    ** Checking catalog hierarchy.
    ** Checking volume bitmap.
    ** Checking volume information.
       Verify Status: VIStat = 0x0000, ABTStat = 0x0000 EBTStat = 0x0000
                      CBTStat = 0x0004 CatStat = 0x00000000
    ** The volume untitled was found corrupt and needs to be repaired.
            volume type is HFS
            primary MDB is at block 2 0x02
            alternate MDB is at block 20971518 0x13ffffe
            primary VHB is at block 0 0x00
            alternate VHB is at block 0 0x00
            sector size = 512 0x200
            VolumeObject flags = 0x19
            total sectors for volume = 20971520 0x1400000
            total sectors for embedded volume = 0 0x00
    
    This patch adds logic of clearing the deleted b-tree node.
    
    sudo ./check generic/001
    FSTYP         -- hfs
    PLATFORM      -- Linux/x86_64 hfsplus-testing-0001 6.15.0-rc2+ #3 SMP PREEMPT_DYNAMIC Fri Apr 25 17:13:00 PDT 2025
    MKFS_OPTIONS  -- /dev/loop51
    MOUNT_OPTIONS -- /dev/loop51 /mnt/scratch
    
    generic/001 9s ...  32s
    Ran: generic/001
    Passed all 1 tests
    
    fsck.hfs -d -n ./test-image.bin
    ** ./test-image.bin (NO WRITE)
            Using cacheBlockSize=32K cacheTotalBlock=1024 cacheSize=32768K.
       Executing fsck_hfs (version 540.1-Linux).
    ** Checking HFS volume.
       The volume name is untitled
    ** Checking extents overflow file.
    ** Checking catalog file.
    ** Checking catalog hierarchy.
    ** Checking volume bitmap.
    ** Checking volume information.
    ** The volume untitled appears to be OK.
    
    Signed-off-by: Viacheslav Dubeyko <slava@dubeyko.com>
    Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com>
    Link: https://lore.kernel.org/r/20250430001211.1912533-1-slava@dubeyko.com
    Signed-off-by: Viacheslav Dubeyko <slava@dubeyko.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

hfs: fix slab-out-of-bounds in hfs_bnode_read() [+ + +]

Author: Viacheslav Dubeyko <slava@dubeyko.com>
Date:   Thu Jul 3 14:49:12 2025 -0700

    hfs: fix slab-out-of-bounds in hfs_bnode_read()
    
    [ Upstream commit a431930c9bac518bf99d6b1da526a7f37ddee8d8 ]
    
    This patch introduces is_bnode_offset_valid() method that checks
    the requested offset value. Also, it introduces
    check_and_correct_requested_length() method that checks and
    correct the requested length (if it is necessary). These methods
    are used in hfs_bnode_read(), hfs_bnode_write(), hfs_bnode_clear(),
    hfs_bnode_copy(), and hfs_bnode_move() with the goal to prevent
    the access out of allocated memory and triggering the crash.
    
    Signed-off-by: Viacheslav Dubeyko <slava@dubeyko.com>
    Link: https://lore.kernel.org/r/20250703214912.244138-1-slava@dubeyko.com
    Signed-off-by: Viacheslav Dubeyko <slava@dubeyko.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

hfsplus: don't use BUG_ON() in hfsplus_create_attributes_file() [+ + +]

Author: Tetsuo Handa <penguin-kernel@I-love.SAKURA.ne.jp>
Date:   Tue Jul 15 14:17:56 2025 +0900

    hfsplus: don't use BUG_ON() in hfsplus_create_attributes_file()
    
    [ Upstream commit c7c6363ca186747ebc2df10c8a1a51e66e0e32d9 ]
    
    When the volume header contains erroneous values that do not reflect
    the actual state of the filesystem, hfsplus_fill_super() assumes that
    the attributes file is not yet created, which later results in hitting
    BUG_ON() when hfsplus_create_attributes_file() is called. Replace this
    BUG_ON() with -EIO error with a message to suggest running fsck tool.
    
    Reported-by: syzbot <syzbot+1107451c16b9eb9d29e6@syzkaller.appspotmail.com>
    Closes: https://syzkaller.appspot.com/bug?extid=1107451c16b9eb9d29e6
    Signed-off-by: Tetsuo Handa <penguin-kernel@I-love.SAKURA.ne.jp>
    Reviewed-by: Viacheslav Dubeyko <slava@dubeyko.com>
    Link: https://lore.kernel.org/r/7b587d24-c8a1-4413-9b9a-00a33fbd849f@I-love.SAKURA.ne.jp
    Signed-off-by: Viacheslav Dubeyko <slava@dubeyko.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

hfsplus: fix slab-out-of-bounds in hfsplus_bnode_read() [+ + +]

Author: Viacheslav Dubeyko <slava@dubeyko.com>
Date:   Thu Jul 3 14:48:04 2025 -0700

    hfsplus: fix slab-out-of-bounds in hfsplus_bnode_read()
    
    [ Upstream commit c80aa2aaaa5e69d5219c6af8ef7e754114bd08d2 ]
    
    The hfsplus_bnode_read() method can trigger the issue:
    
    [  174.852007][ T9784] ==================================================================
    [  174.852709][ T9784] BUG: KASAN: slab-out-of-bounds in hfsplus_bnode_read+0x2f4/0x360
    [  174.853412][ T9784] Read of size 8 at addr ffff88810b5fc6c0 by task repro/9784
    [  174.854059][ T9784]
    [  174.854272][ T9784] CPU: 1 UID: 0 PID: 9784 Comm: repro Not tainted 6.16.0-rc3 #7 PREEMPT(full)
    [  174.854281][ T9784] Hardware name: QEMU Ubuntu 24.04 PC (i440FX + PIIX, 1996), BIOS 1.16.3-debian-1.16.3-2 04/01/2014
    [  174.854286][ T9784] Call Trace:
    [  174.854289][ T9784]  <TASK>
    [  174.854292][ T9784]  dump_stack_lvl+0x10e/0x1f0
    [  174.854305][ T9784]  print_report+0xd0/0x660
    [  174.854315][ T9784]  ? __virt_addr_valid+0x81/0x610
    [  174.854323][ T9784]  ? __phys_addr+0xe8/0x180
    [  174.854330][ T9784]  ? hfsplus_bnode_read+0x2f4/0x360
    [  174.854337][ T9784]  kasan_report+0xc6/0x100
    [  174.854346][ T9784]  ? hfsplus_bnode_read+0x2f4/0x360
    [  174.854354][ T9784]  hfsplus_bnode_read+0x2f4/0x360
    [  174.854362][ T9784]  hfsplus_bnode_dump+0x2ec/0x380
    [  174.854370][ T9784]  ? __pfx_hfsplus_bnode_dump+0x10/0x10
    [  174.854377][ T9784]  ? hfsplus_bnode_write_u16+0x83/0xb0
    [  174.854385][ T9784]  ? srcu_gp_start+0xd0/0x310
    [  174.854393][ T9784]  ? __mark_inode_dirty+0x29e/0xe40
    [  174.854402][ T9784]  hfsplus_brec_remove+0x3d2/0x4e0
    [  174.854411][ T9784]  __hfsplus_delete_attr+0x290/0x3a0
    [  174.854419][ T9784]  ? __pfx_hfs_find_1st_rec_by_cnid+0x10/0x10
    [  174.854427][ T9784]  ? __pfx___hfsplus_delete_attr+0x10/0x10
    [  174.854436][ T9784]  ? __asan_memset+0x23/0x50
    [  174.854450][ T9784]  hfsplus_delete_all_attrs+0x262/0x320
    [  174.854459][ T9784]  ? __pfx_hfsplus_delete_all_attrs+0x10/0x10
    [  174.854469][ T9784]  ? rcu_is_watching+0x12/0xc0
    [  174.854476][ T9784]  ? __mark_inode_dirty+0x29e/0xe40
    [  174.854483][ T9784]  hfsplus_delete_cat+0x845/0xde0
    [  174.854493][ T9784]  ? __pfx_hfsplus_delete_cat+0x10/0x10
    [  174.854507][ T9784]  hfsplus_unlink+0x1ca/0x7c0
    [  174.854516][ T9784]  ? __pfx_hfsplus_unlink+0x10/0x10
    [  174.854525][ T9784]  ? down_write+0x148/0x200
    [  174.854532][ T9784]  ? __pfx_down_write+0x10/0x10
    [  174.854540][ T9784]  vfs_unlink+0x2fe/0x9b0
    [  174.854549][ T9784]  do_unlinkat+0x490/0x670
    [  174.854557][ T9784]  ? __pfx_do_unlinkat+0x10/0x10
    [  174.854565][ T9784]  ? __might_fault+0xbc/0x130
    [  174.854576][ T9784]  ? getname_flags.part.0+0x1c5/0x550
    [  174.854584][ T9784]  __x64_sys_unlink+0xc5/0x110
    [  174.854592][ T9784]  do_syscall_64+0xc9/0x480
    [  174.854600][ T9784]  entry_SYSCALL_64_after_hwframe+0x77/0x7f
    [  174.854608][ T9784] RIP: 0033:0x7f6fdf4c3167
    [  174.854614][ T9784] Code: f0 ff ff 73 01 c3 48 8b 0d 26 0d 0e 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 08
    [  174.854622][ T9784] RSP: 002b:00007ffcb948bca8 EFLAGS: 00000206 ORIG_RAX: 0000000000000057
    [  174.854630][ T9784] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007f6fdf4c3167
    [  174.854636][ T9784] RDX: 00007ffcb948bcc0 RSI: 00007ffcb948bcc0 RDI: 00007ffcb948bd50
    [  174.854641][ T9784] RBP: 00007ffcb948cd90 R08: 0000000000000001 R09: 00007ffcb948bb40
    [  174.854645][ T9784] R10: 00007f6fdf564fc0 R11: 0000000000000206 R12: 0000561e1bc9c2d0
    [  174.854650][ T9784] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
    [  174.854658][ T9784]  </TASK>
    [  174.854661][ T9784]
    [  174.879281][ T9784] Allocated by task 9784:
    [  174.879664][ T9784]  kasan_save_stack+0x20/0x40
    [  174.880082][ T9784]  kasan_save_track+0x14/0x30
    [  174.880500][ T9784]  __kasan_kmalloc+0xaa/0xb0
    [  174.880908][ T9784]  __kmalloc_noprof+0x205/0x550
    [  174.881337][ T9784]  __hfs_bnode_create+0x107/0x890
    [  174.881779][ T9784]  hfsplus_bnode_find+0x2d0/0xd10
    [  174.882222][ T9784]  hfsplus_brec_find+0x2b0/0x520
    [  174.882659][ T9784]  hfsplus_delete_all_attrs+0x23b/0x320
    [  174.883144][ T9784]  hfsplus_delete_cat+0x845/0xde0
    [  174.883595][ T9784]  hfsplus_rmdir+0x106/0x1b0
    [  174.884004][ T9784]  vfs_rmdir+0x206/0x690
    [  174.884379][ T9784]  do_rmdir+0x2b7/0x390
    [  174.884751][ T9784]  __x64_sys_rmdir+0xc5/0x110
    [  174.885167][ T9784]  do_syscall_64+0xc9/0x480
    [  174.885568][ T9784]  entry_SYSCALL_64_after_hwframe+0x77/0x7f
    [  174.886083][ T9784]
    [  174.886293][ T9784] The buggy address belongs to the object at ffff88810b5fc600
    [  174.886293][ T9784]  which belongs to the cache kmalloc-192 of size 192
    [  174.887507][ T9784] The buggy address is located 40 bytes to the right of
    [  174.887507][ T9784]  allocated 152-byte region [ffff88810b5fc600, ffff88810b5fc698)
    [  174.888766][ T9784]
    [  174.888976][ T9784] The buggy address belongs to the physical page:
    [  174.889533][ T9784] page: refcount:0 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0x10b5fc
    [  174.890295][ T9784] flags: 0x57ff00000000000(node=1|zone=2|lastcpupid=0x7ff)
    [  174.890927][ T9784] page_type: f5(slab)
    [  174.891284][ T9784] raw: 057ff00000000000 ffff88801b4423c0 ffffea000426dc80 dead000000000002
    [  174.892032][ T9784] raw: 0000000000000000 0000000080100010 00000000f5000000 0000000000000000
    [  174.892774][ T9784] page dumped because: kasan: bad access detected
    [  174.893327][ T9784] page_owner tracks the page as allocated
    [  174.893825][ T9784] page last allocated via order 0, migratetype Unmovable, gfp_mask 0x52c00(GFP_NOIO|__GFP_NOWARN|__GFP_NO1
    [  174.895373][ T9784]  post_alloc_hook+0x1c0/0x230
    [  174.895801][ T9784]  get_page_from_freelist+0xdeb/0x3b30
    [  174.896284][ T9784]  __alloc_frozen_pages_noprof+0x25c/0x2460
    [  174.896810][ T9784]  alloc_pages_mpol+0x1fb/0x550
    [  174.897242][ T9784]  new_slab+0x23b/0x340
    [  174.897614][ T9784]  ___slab_alloc+0xd81/0x1960
    [  174.898028][ T9784]  __slab_alloc.isra.0+0x56/0xb0
    [  174.898468][ T9784]  __kmalloc_noprof+0x2b0/0x550
    [  174.898896][ T9784]  usb_alloc_urb+0x73/0xa0
    [  174.899289][ T9784]  usb_control_msg+0x1cb/0x4a0
    [  174.899718][ T9784]  usb_get_string+0xab/0x1a0
    [  174.900133][ T9784]  usb_string_sub+0x107/0x3c0
    [  174.900549][ T9784]  usb_string+0x307/0x670
    [  174.900933][ T9784]  usb_cache_string+0x80/0x150
    [  174.901355][ T9784]  usb_new_device+0x1d0/0x19d0
    [  174.901786][ T9784]  register_root_hub+0x299/0x730
    [  174.902231][ T9784] page last free pid 10 tgid 10 stack trace:
    [  174.902757][ T9784]  __free_frozen_pages+0x80c/0x1250
    [  174.903217][ T9784]  vfree.part.0+0x12b/0xab0
    [  174.903645][ T9784]  delayed_vfree_work+0x93/0xd0
    [  174.904073][ T9784]  process_one_work+0x9b5/0x1b80
    [  174.904519][ T9784]  worker_thread+0x630/0xe60
    [  174.904927][ T9784]  kthread+0x3a8/0x770
    [  174.905291][ T9784]  ret_from_fork+0x517/0x6e0
    [  174.905709][ T9784]  ret_from_fork_asm+0x1a/0x30
    [  174.906128][ T9784]
    [  174.906338][ T9784] Memory state around the buggy address:
    [  174.906828][ T9784]  ffff88810b5fc580: fb fb fb fb fb fb fb fb fc fc fc fc fc fc fc fc
    [  174.907528][ T9784]  ffff88810b5fc600: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
    [  174.908222][ T9784] >ffff88810b5fc680: 00 00 00 fc fc fc fc fc fc fc fc fc fc fc fc fc
    [  174.908917][ T9784]                                            ^
    [  174.909481][ T9784]  ffff88810b5fc700: fa fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
    [  174.910432][ T9784]  ffff88810b5fc780: fb fb fb fb fb fb fb fb fc fc fc fc fc fc fc fc
    [  174.911401][ T9784] ==================================================================
    
    The reason of the issue that code doesn't check the correctness
    of the requested offset and length. As a result, incorrect value
    of offset or/and length could result in access out of allocated
    memory.
    
    This patch introduces is_bnode_offset_valid() method that checks
    the requested offset value. Also, it introduces
    check_and_correct_requested_length() method that checks and
    correct the requested length (if it is necessary). These methods
    are used in hfsplus_bnode_read(), hfsplus_bnode_write(),
    hfsplus_bnode_clear(), hfsplus_bnode_copy(), and hfsplus_bnode_move()
    with the goal to prevent the access out of allocated memory
    and triggering the crash.
    
    Reported-by: Kun Hu <huk23@m.fudan.edu.cn>
    Reported-by: Jiaji Qin <jjtan24@m.fudan.edu.cn>
    Reported-by: Shuoran Bai <baishuoran@hrbeu.edu.cn>
    Signed-off-by: Viacheslav Dubeyko <slava@dubeyko.com>
    Link: https://lore.kernel.org/r/20250703214804.244077-1-slava@dubeyko.com
    Signed-off-by: Viacheslav Dubeyko <slava@dubeyko.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

hfsplus: fix slab-out-of-bounds read in hfsplus_uni2asc() [+ + +]

Author: Viacheslav Dubeyko <slava@dubeyko.com>
Date:   Thu Jul 10 16:08:30 2025 -0700

    hfsplus: fix slab-out-of-bounds read in hfsplus_uni2asc()
    
    [ Upstream commit 94458781aee6045bd3d0ad4b80b02886b9e2219b ]
    
    The hfsplus_readdir() method is capable to crash by calling
    hfsplus_uni2asc():
    
    [  667.121659][ T9805] ==================================================================
    [  667.122651][ T9805] BUG: KASAN: slab-out-of-bounds in hfsplus_uni2asc+0x902/0xa10
    [  667.123627][ T9805] Read of size 2 at addr ffff88802592f40c by task repro/9805
    [  667.124578][ T9805]
    [  667.124876][ T9805] CPU: 3 UID: 0 PID: 9805 Comm: repro Not tainted 6.16.0-rc3 #1 PREEMPT(full)
    [  667.124886][ T9805] Hardware name: QEMU Ubuntu 24.04 PC (i440FX + PIIX, 1996), BIOS 1.16.3-debian-1.16.3-2 04/01/2014
    [  667.124890][ T9805] Call Trace:
    [  667.124893][ T9805]  <TASK>
    [  667.124896][ T9805]  dump_stack_lvl+0x10e/0x1f0
    [  667.124911][ T9805]  print_report+0xd0/0x660
    [  667.124920][ T9805]  ? __virt_addr_valid+0x81/0x610
    [  667.124928][ T9805]  ? __phys_addr+0xe8/0x180
    [  667.124934][ T9805]  ? hfsplus_uni2asc+0x902/0xa10
    [  667.124942][ T9805]  kasan_report+0xc6/0x100
    [  667.124950][ T9805]  ? hfsplus_uni2asc+0x902/0xa10
    [  667.124959][ T9805]  hfsplus_uni2asc+0x902/0xa10
    [  667.124966][ T9805]  ? hfsplus_bnode_read+0x14b/0x360
    [  667.124974][ T9805]  hfsplus_readdir+0x845/0xfc0
    [  667.124984][ T9805]  ? __pfx_hfsplus_readdir+0x10/0x10
    [  667.124994][ T9805]  ? stack_trace_save+0x8e/0xc0
    [  667.125008][ T9805]  ? iterate_dir+0x18b/0xb20
    [  667.125015][ T9805]  ? trace_lock_acquire+0x85/0xd0
    [  667.125022][ T9805]  ? lock_acquire+0x30/0x80
    [  667.125029][ T9805]  ? iterate_dir+0x18b/0xb20
    [  667.125037][ T9805]  ? down_read_killable+0x1ed/0x4c0
    [  667.125044][ T9805]  ? putname+0x154/0x1a0
    [  667.125051][ T9805]  ? __pfx_down_read_killable+0x10/0x10
    [  667.125058][ T9805]  ? apparmor_file_permission+0x239/0x3e0
    [  667.125069][ T9805]  iterate_dir+0x296/0xb20
    [  667.125076][ T9805]  __x64_sys_getdents64+0x13c/0x2c0
    [  667.125084][ T9805]  ? __pfx___x64_sys_getdents64+0x10/0x10
    [  667.125091][ T9805]  ? __x64_sys_openat+0x141/0x200
    [  667.125126][ T9805]  ? __pfx_filldir64+0x10/0x10
    [  667.125134][ T9805]  ? do_user_addr_fault+0x7fe/0x12f0
    [  667.125143][ T9805]  do_syscall_64+0xc9/0x480
    [  667.125151][ T9805]  entry_SYSCALL_64_after_hwframe+0x77/0x7f
    [  667.125158][ T9805] RIP: 0033:0x7fa8753b2fc9
    [  667.125164][ T9805] Code: 00 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 48
    [  667.125172][ T9805] RSP: 002b:00007ffe96f8e0f8 EFLAGS: 00000217 ORIG_RAX: 00000000000000d9
    [  667.125181][ T9805] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007fa8753b2fc9
    [  667.125185][ T9805] RDX: 0000000000000400 RSI: 00002000000063c0 RDI: 0000000000000004
    [  667.125190][ T9805] RBP: 00007ffe96f8e110 R08: 00007ffe96f8e110 R09: 00007ffe96f8e110
    [  667.125195][ T9805] R10: 0000000000000000 R11: 0000000000000217 R12: 0000556b1e3b4260
    [  667.125199][ T9805] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
    [  667.125207][ T9805]  </TASK>
    [  667.125210][ T9805]
    [  667.145632][ T9805] Allocated by task 9805:
    [  667.145991][ T9805]  kasan_save_stack+0x20/0x40
    [  667.146352][ T9805]  kasan_save_track+0x14/0x30
    [  667.146717][ T9805]  __kasan_kmalloc+0xaa/0xb0
    [  667.147065][ T9805]  __kmalloc_noprof+0x205/0x550
    [  667.147448][ T9805]  hfsplus_find_init+0x95/0x1f0
    [  667.147813][ T9805]  hfsplus_readdir+0x220/0xfc0
    [  667.148174][ T9805]  iterate_dir+0x296/0xb20
    [  667.148549][ T9805]  __x64_sys_getdents64+0x13c/0x2c0
    [  667.148937][ T9805]  do_syscall_64+0xc9/0x480
    [  667.149291][ T9805]  entry_SYSCALL_64_after_hwframe+0x77/0x7f
    [  667.149809][ T9805]
    [  667.150030][ T9805] The buggy address belongs to the object at ffff88802592f000
    [  667.150030][ T9805]  which belongs to the cache kmalloc-2k of size 2048
    [  667.151282][ T9805] The buggy address is located 0 bytes to the right of
    [  667.151282][ T9805]  allocated 1036-byte region [ffff88802592f000, ffff88802592f40c)
    [  667.152580][ T9805]
    [  667.152798][ T9805] The buggy address belongs to the physical page:
    [  667.153373][ T9805] page: refcount:0 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0x25928
    [  667.154157][ T9805] head: order:3 mapcount:0 entire_mapcount:0 nr_pages_mapped:0 pincount:0
    [  667.154916][ T9805] anon flags: 0xfff00000000040(head|node=0|zone=1|lastcpupid=0x7ff)
    [  667.155631][ T9805] page_type: f5(slab)
    [  667.155997][ T9805] raw: 00fff00000000040 ffff88801b442f00 0000000000000000 dead000000000001
    [  667.156770][ T9805] raw: 0000000000000000 0000000080080008 00000000f5000000 0000000000000000
    [  667.157536][ T9805] head: 00fff00000000040 ffff88801b442f00 0000000000000000 dead000000000001
    [  667.158317][ T9805] head: 0000000000000000 0000000080080008 00000000f5000000 0000000000000000
    [  667.159088][ T9805] head: 00fff00000000003 ffffea0000964a01 00000000ffffffff 00000000ffffffff
    [  667.159865][ T9805] head: ffffffffffffffff 0000000000000000 00000000ffffffff 0000000000000008
    [  667.160643][ T9805] page dumped because: kasan: bad access detected
    [  667.161216][ T9805] page_owner tracks the page as allocated
    [  667.161732][ T9805] page last allocated via order 3, migratetype Unmovable, gfp_mask 0xd20c0(__GFP_IO|__GFP_FS|__GFP_NOWARN9
    [  667.163566][ T9805]  post_alloc_hook+0x1c0/0x230
    [  667.164003][ T9805]  get_page_from_freelist+0xdeb/0x3b30
    [  667.164503][ T9805]  __alloc_frozen_pages_noprof+0x25c/0x2460
    [  667.165040][ T9805]  alloc_pages_mpol+0x1fb/0x550
    [  667.165489][ T9805]  new_slab+0x23b/0x340
    [  667.165872][ T9805]  ___slab_alloc+0xd81/0x1960
    [  667.166313][ T9805]  __slab_alloc.isra.0+0x56/0xb0
    [  667.166767][ T9805]  __kmalloc_cache_noprof+0x255/0x3e0
    [  667.167255][ T9805]  psi_cgroup_alloc+0x52/0x2d0
    [  667.167693][ T9805]  cgroup_mkdir+0x694/0x1210
    [  667.168118][ T9805]  kernfs_iop_mkdir+0x111/0x190
    [  667.168568][ T9805]  vfs_mkdir+0x59b/0x8d0
    [  667.168956][ T9805]  do_mkdirat+0x2ed/0x3d0
    [  667.169353][ T9805]  __x64_sys_mkdir+0xef/0x140
    [  667.169784][ T9805]  do_syscall_64+0xc9/0x480
    [  667.170195][ T9805]  entry_SYSCALL_64_after_hwframe+0x77/0x7f
    [  667.170730][ T9805] page last free pid 1257 tgid 1257 stack trace:
    [  667.171304][ T9805]  __free_frozen_pages+0x80c/0x1250
    [  667.171770][ T9805]  vfree.part.0+0x12b/0xab0
    [  667.172182][ T9805]  delayed_vfree_work+0x93/0xd0
    [  667.172612][ T9805]  process_one_work+0x9b5/0x1b80
    [  667.173067][ T9805]  worker_thread+0x630/0xe60
    [  667.173486][ T9805]  kthread+0x3a8/0x770
    [  667.173857][ T9805]  ret_from_fork+0x517/0x6e0
    [  667.174278][ T9805]  ret_from_fork_asm+0x1a/0x30
    [  667.174703][ T9805]
    [  667.174917][ T9805] Memory state around the buggy address:
    [  667.175411][ T9805]  ffff88802592f300: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
    [  667.176114][ T9805]  ffff88802592f380: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
    [  667.176830][ T9805] >ffff88802592f400: 00 04 fc fc fc fc fc fc fc fc fc fc fc fc fc fc
    [  667.177547][ T9805]                       ^
    [  667.177933][ T9805]  ffff88802592f480: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
    [  667.178640][ T9805]  ffff88802592f500: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
    [  667.179350][ T9805] ==================================================================
    
    The hfsplus_uni2asc() method operates by struct hfsplus_unistr:
    
    struct hfsplus_unistr {
            __be16 length;
            hfsplus_unichr unicode[HFSPLUS_MAX_STRLEN];
    } __packed;
    
    where HFSPLUS_MAX_STRLEN is 255 bytes. The issue happens if length
    of the structure instance has value bigger than 255 (for example,
    65283). In such case, pointer on unicode buffer is going beyond of
    the allocated memory.
    
    The patch fixes the issue by checking the length value of
    hfsplus_unistr instance and using 255 value in the case if length
    value is bigger than HFSPLUS_MAX_STRLEN. Potential reason of such
    situation could be a corruption of Catalog File b-tree's node.
    
    Reported-by: Wenzhi Wang <wenzhi.wang@uwaterloo.ca>
    Signed-off-by: Liu Shixin <liushixin2@huawei.com>
    Signed-off-by: Viacheslav Dubeyko <slava@dubeyko.com>
    cc: John Paul Adrian Glaubitz <glaubitz@physik.fu-berlin.de>
    cc: Yangtao Li <frank.li@vivo.com>
    cc: linux-fsdevel@vger.kernel.org
    Reviewed-by: Yangtao Li <frank.li@vivo.com>
    Link: https://lore.kernel.org/r/20250710230830.110500-1-slava@dubeyko.com
    Signed-off-by: Viacheslav Dubeyko <slava@dubeyko.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

HID: rate-limit hid_warn to prevent log flooding [+ + +]

Author: Li Chen <chenl311@chinatelecom.cn>
Date:   Fri Jun 20 20:02:31 2025 +0800

    HID: rate-limit hid_warn to prevent log flooding
    
    [ Upstream commit 4051ead99888f101be92c7ce90d2de09aac6fd1c ]
    
    Syzkaller can create many uhid devices that trigger
    repeated warnings like:
    
      "hid-generic xxxx: unknown main item tag 0x0"
    
    These messages can flood the system log, especially if a crash occurs
    (e.g., with a slow UART console, leading to soft lockups). To mitigate
    this, convert `hid_warn()` to use `dev_warn_ratelimited()`.
    
    This helps reduce log noise and improves system stability under fuzzing
    or faulty device scenarios.
    
    Signed-off-by: Li Chen <chenl311@chinatelecom.cn>
    Signed-off-by: Jiri Kosina <jkosina@suse.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

hv_netvsc: Fix panic during namespace deletion with VF [+ + +]

Author: Haiyang Zhang <haiyangz@microsoft.com>
Date:   Wed Aug 6 13:21:51 2025 -0700

    hv_netvsc: Fix panic during namespace deletion with VF
    
    commit 33caa208dba6fa639e8a92fd0c8320b652e5550c upstream.
    
    The existing code move the VF NIC to new namespace when NETDEV_REGISTER is
    received on netvsc NIC. During deletion of the namespace,
    default_device_exit_batch() >> default_device_exit_net() is called. When
    netvsc NIC is moved back and registered to the default namespace, it
    automatically brings VF NIC back to the default namespace. This will cause
    the default_device_exit_net() >> for_each_netdev_safe loop unable to detect
    the list end, and hit NULL ptr:
    
    [  231.449420] mana 7870:00:00.0 enP30832s1: Moved VF to namespace with: eth0
    [  231.449656] BUG: kernel NULL pointer dereference, address: 0000000000000010
    [  231.450246] #PF: supervisor read access in kernel mode
    [  231.450579] #PF: error_code(0x0000) - not-present page
    [  231.450916] PGD 17b8a8067 P4D 0
    [  231.451163] Oops: Oops: 0000 [#1] SMP NOPTI
    [  231.451450] CPU: 82 UID: 0 PID: 1394 Comm: kworker/u768:1 Not tainted 6.16.0-rc4+ #3 VOLUNTARY
    [  231.452042] Hardware name: Microsoft Corporation Virtual Machine/Virtual Machine, BIOS Hyper-V UEFI Release v4.1 11/21/2024
    [  231.452692] Workqueue: netns cleanup_net
    [  231.452947] RIP: 0010:default_device_exit_batch+0x16c/0x3f0
    [  231.453326] Code: c0 0c f5 b3 e8 d5 db fe ff 48 85 c0 74 15 48 c7 c2 f8 fd ca b2 be 10 00 00 00 48 8d 7d c0 e8 7b 77 25 00 49 8b 86 28 01 00 00 <48> 8b 50 10 4c 8b 2a 4c 8d 62 f0 49 83 ed 10 4c 39 e0 0f 84 d6 00
    [  231.454294] RSP: 0018:ff75fc7c9bf9fd00 EFLAGS: 00010246
    [  231.454610] RAX: 0000000000000000 RBX: 0000000000000002 RCX: 61c8864680b583eb
    [  231.455094] RDX: ff1fa9f71462d800 RSI: ff75fc7c9bf9fd38 RDI: 0000000030766564
    [  231.455686] RBP: ff75fc7c9bf9fd78 R08: 0000000000000000 R09: 0000000000000000
    [  231.456126] R10: 0000000000000001 R11: 0000000000000004 R12: ff1fa9f70088e340
    [  231.456621] R13: ff1fa9f70088e340 R14: ffffffffb3f50c20 R15: ff1fa9f7103e6340
    [  231.457161] FS:  0000000000000000(0000) GS:ff1faa6783a08000(0000) knlGS:0000000000000000
    [  231.457707] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    [  231.458031] CR2: 0000000000000010 CR3: 0000000179ab2006 CR4: 0000000000b73ef0
    [  231.458434] Call Trace:
    [  231.458600]  <TASK>
    [  231.458777]  ops_undo_list+0x100/0x220
    [  231.459015]  cleanup_net+0x1b8/0x300
    [  231.459285]  process_one_work+0x184/0x340
    
    To fix it, move the ns change to a workqueue, and take rtnl_lock to avoid
    changing the netdev list when default_device_exit_net() is using it.
    
    Cc: stable@vger.kernel.org
    Fixes: 4c262801ea60 ("hv_netvsc: Fix VF namespace also in synthetic NIC NETDEV_REGISTER event")
    Signed-off-by: Haiyang Zhang <haiyangz@microsoft.com>
    Link: https://patch.msgid.link/1754511711-11188-1-git-send-email-haiyangz@linux.microsoft.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

hwmon: (emc2305) Set initial PWM minimum value during probe based on thermal state [+ + +]

Author: Florin Leotescu <florin.leotescu@nxp.com>
Date:   Tue Jun 3 14:31:25 2025 +0300

    hwmon: (emc2305) Set initial PWM minimum value during probe based on thermal state
    
    [ Upstream commit 0429415a084a15466e87d504e8c2a502488184a5 ]
    
    Prevent the PWM value from being set to minimum when thermal zone
    temperature exceeds any trip point during driver probe. Otherwise, the
    PWM fan speed will remains at minimum speed and not respond to
    temperature changes.
    
    Signed-off-by: Florin Leotescu <florin.leotescu@nxp.com>
    Link: https://lore.kernel.org/r/20250603113125.3175103-5-florin.leotescu@oss.nxp.com
    Signed-off-by: Guenter Roeck <linux@roeck-us.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

i2c: core: Fix double-free of fwnode in i2c_unregister_device() [+ + +]

Author: Hans de Goede <hansg@kernel.org>
Date:   Sat Jul 19 20:01:04 2025 +0200

    i2c: core: Fix double-free of fwnode in i2c_unregister_device()
    
    commit 1c24e5fc0c7096e00c202a6a3e0c342c1afb47c2 upstream.
    
    Before commit df6d7277e552 ("i2c: core: Do not dereference fwnode in struct
    device"), i2c_unregister_device() only called fwnode_handle_put() on
    of_node-s in the form of calling of_node_put(client->dev.of_node).
    
    But after this commit the i2c_client's fwnode now unconditionally gets
    fwnode_handle_put() on it.
    
    When the i2c_client has no primary (ACPI / OF) fwnode but it does have
    a software fwnode, the software-node will be the primary node and
    fwnode_handle_put() will put() it.
    
    But for the software fwnode device_remove_software_node() will also put()
    it leading to a double free:
    
    [   82.665598] ------------[ cut here ]------------
    [   82.665609] refcount_t: underflow; use-after-free.
    [   82.665808] WARNING: CPU: 3 PID: 1502 at lib/refcount.c:28 refcount_warn_saturate+0xba/0x11
    ...
    [   82.666830] RIP: 0010:refcount_warn_saturate+0xba/0x110
    ...
    [   82.666962]  <TASK>
    [   82.666971]  i2c_unregister_device+0x60/0x90
    
    Fix this by not calling fwnode_handle_put() when the primary fwnode is
    a software-node.
    
    Fixes: df6d7277e552 ("i2c: core: Do not dereference fwnode in struct device")
    Cc: stable@vger.kernel.org
    Signed-off-by: Hans de Goede <hansg@kernel.org>
    Reviewed-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com>
    Signed-off-by: Wolfram Sang <wsa+renesas@sang-engineering.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

i2c: Force DLL0945 touchpad i2c freq to 100khz [+ + +]

Author: fangzhong.zhou <myth5@myth5.com>
Date:   Sun Aug 3 07:15:54 2025 +0800

    i2c: Force DLL0945 touchpad i2c freq to 100khz
    
    [ Upstream commit 0b7c9528facdb5a73ad78fea86d2e95a6c48dbc4 ]
    
    This patch fixes an issue where the touchpad cursor movement becomes
    slow on the Dell Precision 5560. Force the touchpad freq to 100khz
    as a workaround.
    
    Tested on Dell Precision 5560 with 6.14 to 6.14.6. Cursor movement
    is now smooth and responsive.
    
    Signed-off-by: fangzhong.zhou <myth5@myth5.com>
    [wsa: kept sorting and removed unnecessary parts from commit msg]
    Signed-off-by: Wolfram Sang <wsa+renesas@sang-engineering.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

i3c: add missing include to internal header [+ + +]

Author: Wolfram Sang <wsa+renesas@sang-engineering.com>
Date:   Thu Jul 17 14:00:47 2025 +0200

    i3c: add missing include to internal header
    
    [ Upstream commit 3b661ca549b9e5bb11d0bc97ada6110aac3282d2 ]
    
    LKP found a random config which failed to build because IO accessors
    were not defined:
    
       In file included from drivers/i3c/master.c:21:
       drivers/i3c/internals.h: In function 'i3c_writel_fifo':
    >> drivers/i3c/internals.h:35:9: error: implicit declaration of function 'writesl' [-Werror=implicit-function-declaration]
    
    Add the proper header to where the IO accessors are used.
    
    Reported-by: kernel test robot <lkp@intel.com>
    Closes: https://lore.kernel.org/oe-kbuild-all/202507150208.BZDzzJ5E-lkp@intel.com/
    Signed-off-by: Wolfram Sang <wsa+renesas@sang-engineering.com>
    Reviewed-by: Frank Li <Frank.Li@nxp.com>
    Link: https://lore.kernel.org/r/20250717120046.9022-2-wsa+renesas@sang-engineering.com
    Signed-off-by: Alexandre Belloni <alexandre.belloni@bootlin.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

i3c: don't fail if GETHDRCAP is unsupported [+ + +]

Author: Wolfram Sang <wsa+renesas@sang-engineering.com>
Date:   Fri Jul 4 22:44:32 2025 +0200

    i3c: don't fail if GETHDRCAP is unsupported
    
    [ Upstream commit 447270cdb41b1c8c3621bb14b93a6749f942556e ]
    
    'I3C_BCR_HDR_CAP' is still spec v1.0 and has been renamed to 'advanced
    capabilities' in v1.1 onwards. The ST pressure sensor LPS22DF does not
    have HDR, but has the 'advanced cap' bit set. The core still wants to
    get additional information using the CCC 'GETHDRCAP' (or GETCAPS in v1.1
    onwards). Not all controllers support this CCC and will notify the upper
    layers about it. For instantiating the device, we can ignore this
    unsupported CCC as standard communication will work. Without this patch,
    the device will not be instantiated at all.
    
    Signed-off-by: Wolfram Sang <wsa+renesas@sang-engineering.com>
    Reviewed-by: Frank Li <Frank.Li@nxp.com>
    Link: https://lore.kernel.org/r/20250704204524.6124-1-wsa+renesas@sang-engineering.com
    Signed-off-by: Alexandre Belloni <alexandre.belloni@bootlin.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

i3c: master: Initialize ret in i3c_i2c_notifier_call() [+ + +]

Author: Jorge Marques <jorge.marques@analog.com>
Date:   Sun Jun 22 12:11:07 2025 +0200

    i3c: master: Initialize ret in i3c_i2c_notifier_call()
    
    [ Upstream commit 290ce8b2d0745e45a3155268184523a8c75996f1 ]
    
    Set ret to -EINVAL if i3c_i2c_notifier_call() receives an invalid
    action, resolving uninitialized warning.
    
    Signed-off-by: Jorge Marques <jorge.marques@analog.com>
    Reviewed-by: Frank Li <Frank.Li@nxp.com>
    Link: https://lore.kernel.org/r/20250622-i3c-master-ret-uninitialized-v1-1-aabb5625c932@analog.com
    Signed-off-by: Alexandre Belloni <alexandre.belloni@bootlin.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

idpf: preserve coalescing settings across resets [+ + +]

Author: Ahmed Zaki <ahmed.zaki@intel.com>
Date:   Fri Jun 20 11:15:48 2025 -0600

    idpf: preserve coalescing settings across resets
    
    [ Upstream commit e1e3fec3e34b4934a9d2c98e4ee00a4d87b19179 ]
    
    The IRQ coalescing config currently reside only inside struct
    idpf_q_vector. However, all idpf_q_vector structs are de-allocated and
    re-allocated during resets. This leads to user-set coalesce configuration
    to be lost.
    
    Add new fields to struct idpf_vport_user_config_data to save the user
    settings and re-apply them after reset.
    
    Reviewed-by: Madhu Chittim <madhu.chittim@intel.com>
    Signed-off-by: Ahmed Zaki <ahmed.zaki@intel.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Reviewed-by: Willem de Bruijn <willemb@google.com>
    Tested-by: Samuel Salin <Samuel.salin@intel.com>
    Signed-off-by: Tony Nguyen <anthony.l.nguyen@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

iio: adc: ad7768-1: Ensure SYNC_IN pulse minimum timing requirement [+ + +]

Author: Jonathan Santos <Jonathan.Santos@analog.com>
Date:   Wed Jun 4 16:35:21 2025 -0300

    iio: adc: ad7768-1: Ensure SYNC_IN pulse minimum timing requirement
    
    [ Upstream commit 7e54d932873d91a55d1b89b7389876d78aeeab32 ]
    
    The SYNC_IN pulse width must be at least 1.5 x Tmclk, corresponding to
    ~2.5 µs at the lowest supported MCLK frequency. Add a 3 µs delay to
    ensure reliable synchronization timing even for the worst-case scenario.
    
    Signed-off-by: Jonathan Santos <Jonathan.Santos@analog.com>
    Reviewed-by: David Lechner <dlechner@baylibre.com>
    Reviewed-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com>
    Link: https://patch.msgid.link/d3ee92a533cd1207cf5c5cc4d7bdbb5c6c267f68.1749063024.git.Jonathan.Santos@analog.com
    Signed-off-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

iio: adc: ad_sigma_delta: don't overallocate scan buffer [+ + +]

Author: David Lechner <dlechner@baylibre.com>
Date:   Tue Jul 1 16:37:49 2025 -0500

    iio: adc: ad_sigma_delta: don't overallocate scan buffer
    
    [ Upstream commit 5a2f15c5a8e017d0951e6dc62aa7b5b634f56881 ]
    
    Fix overallocating the size of the scan buffer by converting bits to
    bytes. The size is meant to be in bytes, so scanbits needs to be
    divided by 8.
    
    Signed-off-by: David Lechner <dlechner@baylibre.com>
    Reviewed-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com>
    Reviewed-by: Nuno Sá <nuno.sa@analog.com>
    Link: https://patch.msgid.link/20250701-iio-adc-ad7173-add-spi-offload-support-v3-1-42abb83e3dac@baylibre.com
    Signed-off-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

imx8m-blk-ctrl: set ISI panic write hurry level [+ + +]

Author: Krzysztof Hałasa <khalasa@piap.pl>
Date:   Fri May 9 11:26:55 2025 +0200

    imx8m-blk-ctrl: set ISI panic write hurry level
    
    [ Upstream commit c01fba0b4869cada5403fffff416cd1675dba078 ]
    
    Apparently, ISI needs cache settings similar to LCDIF.
    Otherwise we get artefacts in the image.
    Tested on i.MX8MP.
    
    Signed-off-by: Krzysztof Hałasa <khalasa@piap.pl>
    Link: https://lore.kernel.org/r/m3ldr69lsw.fsf@t19.piap.pl
    Signed-off-by: Ulf Hansson <ulf.hansson@linaro.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

intel_idle: Allow loading ACPI tables for any family [+ + +]

Author: Len Brown <len.brown@intel.com>
Date:   Fri Aug 8 15:37:14 2025 -0400

    intel_idle: Allow loading ACPI tables for any family
    
    [ Upstream commit e91a158b694d7f4bd937763dde79ed0afa472d8a ]
    
    There is no reason to limit intel_idle's loading of ACPI tables to
    family 6.  Upcoming Intel processors are not in family 6.
    
    Below "Fixes" really means "applies cleanly until".
    That syntax commit didn't change the previous logic,
    but shows this patch applies back 5-years.
    
    Fixes: 4a9f45a0533f ("intel_idle: Convert to new X86 CPU match macros")
    Signed-off-by: Len Brown <len.brown@intel.com>
    Link: https://patch.msgid.link/06101aa4fe784e5b0be1cb2c0bdd9afcf16bd9d4.1754681697.git.len.brown@intel.com
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

io_uring/memmap: cast nr_pages to size_t before shifting [+ + +]

Author: Jens Axboe <axboe@kernel.dk>
Date:   Fri Aug 8 06:35:14 2025 -0600

    io_uring/memmap: cast nr_pages to size_t before shifting
    
    commit 33503c083fda048c77903460ac0429e1e2c0e341 upstream.
    
    If the allocated size exceeds UINT_MAX, then it's necessary to cast
    the mr->nr_pages value to size_t to prevent it from overflowing. In
    practice this isn't much of a concern as the required memory size will
    have been validated upfront, and accounted to the user. And > 4GB sizes
    will be necessary to make the lack of a cast a problem, which greatly
    exceeds normal user locked_vm settings that are generally in the kb to
    mb range. However, if root is used, then accounting isn't done, and
    then it's possible to hit this issue.
    
    Link: https://lore.kernel.org/all/6895b298.050a0220.7f033.0059.GAE@google.com/
    Cc: stable@vger.kernel.org
    Reported-by: syzbot+23727438116feb13df15@syzkaller.appspotmail.com
    Fixes: 087f997870a9 ("io_uring/memmap: implement mmap for regions")
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

io_uring/net: commit partial buffers on retry [+ + +]

Author: Jens Axboe <axboe@kernel.dk>
Date:   Tue Aug 12 08:30:11 2025 -0600

    io_uring/net: commit partial buffers on retry
    
    commit 41b70df5b38bc80967d2e0ed55cc3c3896bba781 upstream.
    
    Ring provided buffers are potentially only valid within the single
    execution context in which they were acquired. io_uring deals with this
    and invalidates them on retry. But on the networking side, if
    MSG_WAITALL is set, or if the socket is of the streaming type and too
    little was processed, then it will hang on to the buffer rather than
    recycle or commit it. This is problematic for two reasons:
    
    1) If someone unregisters the provided buffer ring before a later retry,
       then the req->buf_list will no longer be valid.
    
    2) If multiple sockers are using the same buffer group, then multiple
       receives can consume the same memory. This can cause data corruption
       in the application, as either receive could land in the same
       userspace buffer.
    
    Fix this by disallowing partial retries from pinning a provided buffer
    across multiple executions, if ring provided buffers are used.
    
    Cc: stable@vger.kernel.org
    Reported-by: pt x <superman.xpt@gmail.com>
    Fixes: c56e022c0a27 ("io_uring: add support for user mapped provided buffer ring")
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

io_uring/rw: cast rw->flags assignment to rwf_t [+ + +]

Author: Jens Axboe <axboe@kernel.dk>
Date:   Mon Jul 7 16:46:30 2025 -0600

    io_uring/rw: cast rw->flags assignment to rwf_t
    
    commit 825aea662b492571877b32aeeae13689fd9fbee4 upstream.
    
    kernel test robot reports that a recent change of the sqe->rw_flags
    field throws a sparse warning on 32-bit archs:
    
    >> io_uring/rw.c:291:19: sparse: sparse: incorrect type in assignment (different base types) @@     expected restricted __kernel_rwf_t [usertype] flags @@     got unsigned int @@
       io_uring/rw.c:291:19: sparse:     expected restricted __kernel_rwf_t [usertype] flags
       io_uring/rw.c:291:19: sparse:     got unsigned int
    
    Force cast it to rwf_t to silence that new sparse warning.
    
    Fixes: cf73d9970ea4 ("io_uring: don't use int for ABI")
    Reported-by: kernel test robot <lkp@intel.com>
    Closes: https://lore.kernel.org/oe-kbuild-all/202507032211.PwSNPNSP-lkp@intel.com/
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

io_uring/zcrx: account area memory [+ + +]

Author: Pavel Begunkov <asml.silence@gmail.com>
Date:   Wed Jul 16 22:04:09 2025 +0100

    io_uring/zcrx: account area memory
    
    commit 262ab205180d2ba3ab6110899a4dbe439c51dfaa upstream.
    
    zcrx areas can be quite large and need to be accounted and checked
    against RLIMIT_MEMLOCK. In practise it shouldn't be a big issue as
    the inteface already requires cap_net_admin.
    
    Cc: stable@vger.kernel.org
    Fixes: cf96310c5f9a0 ("io_uring/zcrx: add io_zcrx_area")
    Signed-off-by: Pavel Begunkov <asml.silence@gmail.com>
    Link: https://lore.kernel.org/r/4b53f0c575bd062f63d12bec6cac98037fc66aeb.1752699568.git.asml.silence@gmail.com
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

io_uring/zcrx: don't leak pages on account failure [+ + +]

Author: Pavel Begunkov <asml.silence@gmail.com>
Date:   Mon Jul 21 10:56:21 2025 +0100

    io_uring/zcrx: don't leak pages on account failure
    
    commit 6bbd3411ff87df1ca38ff32d36eb5dc673ca8021 upstream.
    
    Someone needs to release pinned pages in io_import_umem() if accounting
    fails. Assign them to the area but return an error, the following
    io_zcrx_free_area() will clean them up.
    
    Fixes: 262ab205180d2 ("io_uring/zcrx: account area memory")
    Signed-off-by: Pavel Begunkov <asml.silence@gmail.com>
    Link: https://lore.kernel.org/r/e19f283a912f200c0d427e376cb789fc3f3d69bc.1753091564.git.asml.silence@gmail.com
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

io_uring/zcrx: fix null ifq on area destruction [+ + +]

Author: Pavel Begunkov <asml.silence@gmail.com>
Date:   Mon Jul 21 10:56:20 2025 +0100

    io_uring/zcrx: fix null ifq on area destruction
    
    commit 720df2310b89cf76c1dc1a05902536282506f8bf upstream.
    
    Dan reports that ifq can be null when infering arguments for
    io_unaccount_mem() from io_zcrx_free_area(). Fix it by always setting a
    correct ifq.
    
    Reported-by: kernel test robot <lkp@intel.com>
    Reported-by: Dan Carpenter <dan.carpenter@linaro.org>
    Closes: https://lore.kernel.org/r/202507180628.gBxrOgqr-lkp@intel.com/
    Fixes: 262ab205180d2 ("io_uring/zcrx: account area memory")
    Signed-off-by: Pavel Begunkov <asml.silence@gmail.com>
    Link: https://lore.kernel.org/r/20670d163bb90dba2a81a4150f1125603cefb101.1753091564.git.asml.silence@gmail.com
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

io_uring: don't use int for ABI [+ + +]

Author: Pavel Begunkov <asml.silence@gmail.com>
Date:   Wed Jul 2 21:31:54 2025 +0100

    io_uring: don't use int for ABI
    
    commit cf73d9970ea4f8cace5d8f02d2565a2723003112 upstream.
    
    __kernel_rwf_t is defined as int, the actual size of which is
    implementation defined. It won't go well if some compiler / archs
    ever defines it as i64, so replace it with __u32, hoping that
    there is no one using i16 for it.
    
    Cc: stable@vger.kernel.org
    Fixes: 2b188cc1bb857 ("Add io_uring IO interface")
    Signed-off-by: Pavel Begunkov <asml.silence@gmail.com>
    Link: https://lore.kernel.org/r/47c666c4ee1df2018863af3a2028af18feef11ed.1751412511.git.asml.silence@gmail.com
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

io_uring: export io_[un]account_mem [+ + +]

Author: Pavel Begunkov <asml.silence@gmail.com>
Date:   Wed Jul 16 22:04:08 2025 +0100

    io_uring: export io_[un]account_mem
    
    commit 11fbada7184f9e19bcdfa2f6b15828a78b8897a6 upstream.
    
    Export pinned memory accounting helpers, they'll be used by zcrx
    shortly.
    
    Cc: stable@vger.kernel.org
    Fixes: cf96310c5f9a0 ("io_uring/zcrx: add io_zcrx_area")
    Signed-off-by: Pavel Begunkov <asml.silence@gmail.com>
    Link: https://lore.kernel.org/r/9a61e54bd89289b39570ae02fe620e12487439e4.1752699568.git.asml.silence@gmail.com
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

iommu/arm-smmu-qcom: Add SM6115 MDSS compatible [+ + +]

Author: Alexey Klimov <alexey.klimov@linaro.org>
Date:   Fri Jun 13 18:32:38 2025 +0100

    iommu/arm-smmu-qcom: Add SM6115 MDSS compatible
    
    commit f7fa8520f30373ce99c436c4d57c76befdacbef3 upstream.
    
    Add the SM6115 MDSS compatible to clients compatible list, as it also
    needs that workaround.
    Without this workaround, for example, QRB4210 RB2 which is based on
    SM4250/SM6115 generates a lot of smmu unhandled context faults during
    boot:
    
    arm_smmu_context_fault: 116854 callbacks suppressed
    arm-smmu c600000.iommu: Unhandled context fault: fsr=0x402,
    iova=0x5c0ec600, fsynr=0x320021, cbfrsynra=0x420, cb=5
    arm-smmu c600000.iommu: FSR    = 00000402 [Format=2 TF], SID=0x420
    arm-smmu c600000.iommu: FSYNR0 = 00320021 [S1CBNDX=50 PNU PLVL=1]
    arm-smmu c600000.iommu: Unhandled context fault: fsr=0x402,
    iova=0x5c0d7800, fsynr=0x320021, cbfrsynra=0x420, cb=5
    arm-smmu c600000.iommu: FSR    = 00000402 [Format=2 TF], SID=0x420
    
    and also failed initialisation of lontium lt9611uxc, gpu and dpu is
    observed:
    (binding MDSS components triggered by lt9611uxc have failed)
    
     ------------[ cut here ]------------
     !aspace
     WARNING: CPU: 6 PID: 324 at drivers/gpu/drm/msm/msm_gem_vma.c:130 msm_gem_vma_init+0x150/0x18c [msm]
     Modules linked in: ... (long list of modules)
     CPU: 6 UID: 0 PID: 324 Comm: (udev-worker) Not tainted 6.15.0-03037-gaacc73ceeb8b #4 PREEMPT
     Hardware name: Qualcomm Technologies, Inc. QRB4210 RB2 (DT)
     pstate: 80000005 (Nzcv daif -PAN -UAO -TCO -DIT -SSBS BTYPE=--)
     pc : msm_gem_vma_init+0x150/0x18c [msm]
     lr : msm_gem_vma_init+0x150/0x18c [msm]
     sp : ffff80008144b280
                    ...
     Call trace:
      msm_gem_vma_init+0x150/0x18c [msm] (P)
      get_vma_locked+0xc0/0x194 [msm]
      msm_gem_get_and_pin_iova_range+0x4c/0xdc [msm]
      msm_gem_kernel_new+0x48/0x160 [msm]
      msm_gpu_init+0x34c/0x53c [msm]
      adreno_gpu_init+0x1b0/0x2d8 [msm]
      a6xx_gpu_init+0x1e8/0x9e0 [msm]
      adreno_bind+0x2b8/0x348 [msm]
      component_bind_all+0x100/0x230
      msm_drm_bind+0x13c/0x3d0 [msm]
      try_to_bring_up_aggregate_device+0x164/0x1d0
      __component_add+0xa4/0x174
      component_add+0x14/0x20
      dsi_dev_attach+0x20/0x34 [msm]
      dsi_host_attach+0x58/0x98 [msm]
      devm_mipi_dsi_attach+0x34/0x90
      lt9611uxc_attach_dsi.isra.0+0x94/0x124 [lontium_lt9611uxc]
      lt9611uxc_probe+0x540/0x5fc [lontium_lt9611uxc]
      i2c_device_probe+0x148/0x2a8
      really_probe+0xbc/0x2c0
      __driver_probe_device+0x78/0x120
      driver_probe_device+0x3c/0x154
      __driver_attach+0x90/0x1a0
      bus_for_each_dev+0x68/0xb8
      driver_attach+0x24/0x30
      bus_add_driver+0xe4/0x208
      driver_register+0x68/0x124
      i2c_register_driver+0x48/0xcc
      lt9611uxc_driver_init+0x20/0x1000 [lontium_lt9611uxc]
      do_one_initcall+0x60/0x1d4
      do_init_module+0x54/0x1fc
      load_module+0x1748/0x1c8c
      init_module_from_file+0x74/0xa0
      __arm64_sys_finit_module+0x130/0x2f8
      invoke_syscall+0x48/0x104
      el0_svc_common.constprop.0+0xc0/0xe0
      do_el0_svc+0x1c/0x28
      el0_svc+0x2c/0x80
      el0t_64_sync_handler+0x10c/0x138
      el0t_64_sync+0x198/0x19c
     ---[ end trace 0000000000000000 ]---
     msm_dpu 5e01000.display-controller: [drm:msm_gpu_init [msm]] *ERROR* could not allocate memptrs: -22
     msm_dpu 5e01000.display-controller: failed to load adreno gpu
     platform a400000.remoteproc:glink-edge:apr:service@7:dais: Adding to iommu group 19
     msm_dpu 5e01000.display-controller: failed to bind 5900000.gpu (ops a3xx_ops [msm]): -22
     msm_dpu 5e01000.display-controller: adev bind failed: -22
     lt9611uxc 0-002b: failed to attach dsi to host
     lt9611uxc 0-002b: probe with driver lt9611uxc failed with error -22
    
    Suggested-by: Bjorn Andersson <andersson@kernel.org>
    Reviewed-by: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
    Fixes: 3581b7062cec ("drm/msm/disp/dpu1: add support for display on SM6115")
    Cc: stable@vger.kernel.org
    Signed-off-by: Alexey Klimov <alexey.klimov@linaro.org>
    Link: https://lore.kernel.org/r/20250613173238.15061-1-alexey.klimov@linaro.org
    Signed-off-by: Will Deacon <will@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

iommu/arm-smmu-v3: Revert vmaster in the error path [+ + +]

Author: Nicolin Chen <nicolinc@nvidia.com>
Date:   Fri Jul 11 13:40:20 2025 -0700

    iommu/arm-smmu-v3: Revert vmaster in the error path
    
    commit 49f42634e8054e57d09c7f9ef5e4527e116059cb upstream.
    
    The error path for err_free_master_domain leaks the vmaster. Move all
    the kfrees for vmaster into the goto error section.
    
    Fixes: cfea71aea921 ("iommu/arm-smmu-v3: Put iopf enablement in the domain attach path")
    Cc: stable@vger.kernel.org
    Reviewed-by: Jason Gunthorpe <jgg@nvidia.com>
    Reviewed-by: Pranjal Shrivastava <praan@google.com>
    Signed-off-by: Nicolin Chen <nicolinc@nvidia.com>
    Link: https://lore.kernel.org/r/20250711204020.1677884-1-nicolinc@nvidia.com
    Signed-off-by: Will Deacon <will@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

iommu/vt-d: Optimize iotlb_sync_map for non-caching/non-RWBF modes [+ + +]

Author: Lu Baolu <baolu.lu@linux.intel.com>
Date:   Mon Jul 14 12:50:19 2025 +0800

    iommu/vt-d: Optimize iotlb_sync_map for non-caching/non-RWBF modes
    
    commit 12724ce3fe1a3d8f30d56e48b4f272d8860d1970 upstream.
    
    The iotlb_sync_map iommu ops allows drivers to perform necessary cache
    flushes when new mappings are established. For the Intel iommu driver,
    this callback specifically serves two purposes:
    
    - To flush caches when a second-stage page table is attached to a device
      whose iommu is operating in caching mode (CAP_REG.CM==1).
    - To explicitly flush internal write buffers to ensure updates to memory-
      resident remapping structures are visible to hardware (CAP_REG.RWBF==1).
    
    However, in scenarios where neither caching mode nor the RWBF flag is
    active, the cache_tag_flush_range_np() helper, which is called in the
    iotlb_sync_map path, effectively becomes a no-op.
    
    Despite being a no-op, cache_tag_flush_range_np() involves iterating
    through all cache tags of the iommu's attached to the domain, protected
    by a spinlock. This unnecessary execution path introduces overhead,
    leading to a measurable I/O performance regression. On systems with NVMes
    under the same bridge, performance was observed to drop from approximately
    ~6150 MiB/s down to ~4985 MiB/s.
    
    Introduce a flag in the dmar_domain structure. This flag will only be set
    when iotlb_sync_map is required (i.e., when CM or RWBF is set). The
    cache_tag_flush_range_np() is called only for domains where this flag is
    set. This flag, once set, is immutable, given that there won't be mixed
    configurations in real-world scenarios where some IOMMUs in a system
    operate in caching mode while others do not. Theoretically, the
    immutability of this flag does not impact functionality.
    
    Reported-by: Ioanna Alifieraki <ioanna-maria.alifieraki@canonical.com>
    Closes: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2115738
    Link: https://lore.kernel.org/r/20250701171154.52435-1-ioanna-maria.alifieraki@canonical.com
    Fixes: 129dab6e1286 ("iommu/vt-d: Use cache_tag_flush_range_np() in iotlb_sync_map")
    Cc: stable@vger.kernel.org
    Signed-off-by: Lu Baolu <baolu.lu@linux.intel.com>
    Reviewed-by: Kevin Tian <kevin.tian@intel.com>
    Link: https://lore.kernel.org/r/20250703031545.3378602-1-baolu.lu@linux.intel.com
    Link: https://lore.kernel.org/r/20250714045028.958850-3-baolu.lu@linux.intel.com
    Signed-off-by: Will Deacon <will@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

iommufd: Prevent ALIGN() overflow [+ + +]

Author: Jason Gunthorpe <jgg@ziepe.ca>
Date:   Thu Jul 17 11:46:55 2025 -0300

    iommufd: Prevent ALIGN() overflow
    
    commit b42497e3c0e74db061eafad41c0cd7243c46436b upstream.
    
    When allocating IOVA the candidate range gets aligned to the target
    alignment. If the range is close to ULONG_MAX then the ALIGN() can
    wrap resulting in a corrupted iova.
    
    Open code the ALIGN() using get_add_overflow() to prevent this.
    This simplifies the checks as we don't need to check for length earlier
    either.
    
    Consolidate the two copies of this code under a single helper.
    
    This bug would allow userspace to create a mapping that overlaps with some
    other mapping or a reserved range.
    
    Cc: stable@vger.kernel.org
    Fixes: 51fe6141f0f6 ("iommufd: Data structure to provide IOVA to PFN mapping")
    Reported-by: syzbot+c2f65e2801743ca64e08@syzkaller.appspotmail.com
    Closes: https://lore.kernel.org/r/685af644.a00a0220.2e5631.0094.GAE@google.com
    Reviewed-by: Yi Liu <yi.l.liu@intel.com>
    Reviewed-by: Nicolin Chen <nicolinc@nvidia.com>
    Link: https://patch.msgid.link/all/1-v1-7b4a16fc390b+10f4-iommufd_alloc_overflow_jgg@nvidia.com/
    Signed-off-by: Jason Gunthorpe <jgg@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

iommufd: Report unmapped bytes in the error path of iopt_unmap_iova_range [+ + +]

Author: Nicolin Chen <nicolinc@nvidia.com>
Date:   Wed Jul 9 22:58:53 2025 -0700

    iommufd: Report unmapped bytes in the error path of iopt_unmap_iova_range
    
    commit b23e09f9997771b4b739c1c694fa832b5fa2de02 upstream.
    
    There are callers that read the unmapped bytes even when rc != 0. Thus, do
    not forget to report it in the error path too.
    
    Fixes: 8d40205f6093 ("iommufd: Add kAPI toward external drivers for kernel access")
    Link: https://patch.msgid.link/r/e2b61303bbc008ba1a4e2d7c2a2894749b59fdac.1752126748.git.nicolinc@nvidia.com
    Cc: stable@vger.kernel.org
    Reviewed-by: Kevin Tian <kevin.tian@intel.com>
    Reviewed-by: Jason Gunthorpe <jgg@nvidia.com>
    Signed-off-by: Nicolin Chen <nicolinc@nvidia.com>
    Signed-off-by: Jason Gunthorpe <jgg@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ionic: clean dbpage in de-init [+ + +]

Author: Shannon Nelson <sln@onemain.com>
Date:   Mon Jun 9 14:46:43 2025 -0700

    ionic: clean dbpage in de-init
    
    [ Upstream commit c9080abea1e69b8b1408ec7dec0acdfdc577a3e2 ]
    
    Since the kern_dbpage gets set up in ionic_lif_init() and that
    function's error path will clean it if needed, the kern_dbpage
    on teardown should be cleaned in ionic_lif_deinit(), not in
    ionic_lif_free().  As it is currently we get a double call
    to iounmap() on kern_dbpage if the PCI ionic fails setting up
    the lif.  One example of this is when firmware isn't responding
    to AdminQ requests and ionic's first AdminQ call fails to
    setup the NotifyQ.
    
    Signed-off-by: Shannon Nelson <shannon.nelson@amd.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Reviewed-by: Joe Damato <joe@dama.to>
    Signed-off-by: David S. Miller <davem@davemloft.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ipmi: Fix strcpy source and destination the same [+ + +]

Author: Corey Minyard <corey@minyard.net>
Date:   Fri Jun 13 19:06:26 2025 -0500

    ipmi: Fix strcpy source and destination the same
    
    [ Upstream commit 8ffcb7560b4a15faf821df95e3ab532b2b020f8c ]
    
    The source and destination of some strcpy operations was the same.
    Split out the part of the operations that needed to be done for those
    particular calls so the unnecessary copy wasn't done.
    
    Reported-by: kernel test robot <lkp@intel.com>
    Closes: https://lore.kernel.org/oe-kbuild-all/202506140756.EFXXvIP4-lkp@intel.com/
    Signed-off-by: Corey Minyard <corey@minyard.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ipmi: Use dev_warn_ratelimited() for incorrect message warnings [+ + +]

Author: Breno Leitao <leitao@debian.org>
Date:   Thu Jul 10 05:57:26 2025 -0700

    ipmi: Use dev_warn_ratelimited() for incorrect message warnings
    
    [ Upstream commit ec50ec378e3fd83bde9b3d622ceac3509a60b6b5 ]
    
    During BMC firmware upgrades on live systems, the ipmi_msghandler
    generates excessive "BMC returned incorrect response" warnings
    while the BMC is temporarily offline. This can flood system logs
    in large deployments.
    
    Replace dev_warn() with dev_warn_ratelimited() to throttle these
    warnings and prevent log spam during BMC maintenance operations.
    
    Signed-off-by: Breno Leitao <leitao@debian.org>
    Message-ID: <20250710-ipmi_ratelimit-v1-1-6d417015ebe9@debian.org>
    Signed-off-by: Corey Minyard <corey@minyard.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ipv6: mcast: Check inet6_dev->dead under idev->mc_lock in __ipv6_dev_mc_inc(). [+ + +]

Author: Kuniyuki Iwashima <kuniyu@google.com>
Date:   Wed Jul 2 16:01:20 2025 -0700

    ipv6: mcast: Check inet6_dev->dead under idev->mc_lock in __ipv6_dev_mc_inc().
    
    [ Upstream commit dbd40f318cf2f59759bd170c401adc20ba360a3e ]
    
    Since commit 63ed8de4be81 ("mld: add mc_lock for protecting
    per-interface mld data"), every multicast resource is protected
    by inet6_dev->mc_lock.
    
    RTNL is unnecessary in terms of protection but still needed for
    synchronisation between addrconf_ifdown() and __ipv6_dev_mc_inc().
    
    Once we removed RTNL, there would be a race below, where we could
    add a multicast address to a dead inet6_dev.
    
      CPU1                            CPU2
      ====                            ====
      addrconf_ifdown()               __ipv6_dev_mc_inc()
                                        if (idev->dead) <-- false
        dead = true                       return -ENODEV;
        ipv6_mc_destroy_dev() / ipv6_mc_down()
          mutex_lock(&idev->mc_lock)
          ...
          mutex_unlock(&idev->mc_lock)
                                        mutex_lock(&idev->mc_lock)
                                        ...
                                        mutex_unlock(&idev->mc_lock)
    
    The race window can be easily closed by checking inet6_dev->dead
    under inet6_dev->mc_lock in __ipv6_dev_mc_inc() as addrconf_ifdown()
    will acquire it after marking inet6_dev dead.
    
    Let's check inet6_dev->dead under mc_lock in __ipv6_dev_mc_inc().
    
    Note that now __ipv6_dev_mc_inc() no longer depends on RTNL and
    we can remove ASSERT_RTNL() there and the RTNL comment above
    addrconf_join_solict().
    
    Signed-off-by: Kuniyuki Iwashima <kuniyu@google.com>
    Reviewed-by: Eric Dumazet <edumazet@google.com>
    Link: https://patch.msgid.link/20250702230210.3115355-4-kuni1840@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ipvs: Fix estimator kthreads preferred affinity [+ + +]

Author: Frederic Weisbecker <frederic@kernel.org>
Date:   Tue Jul 29 14:26:11 2025 +0200

    ipvs: Fix estimator kthreads preferred affinity
    
    [ Upstream commit c0a23bbc98e93704a1f4fb5e7e7bb2d7c0fb6eb3 ]
    
    The estimator kthreads' affinity are defined by sysctl overwritten
    preferences and applied through a plain call to the scheduler's affinity
    API.
    
    However since the introduction of managed kthreads preferred affinity,
    such a practice shortcuts the kthreads core code which eventually
    overwrites the target to the default unbound affinity.
    
    Fix this with using the appropriate kthread's API.
    
    Fixes: d1a89197589c ("kthread: Default affine kthread to its preferred NUMA node")
    Signed-off-by: Frederic Weisbecker <frederic@kernel.org>
    Acked-by: Julian Anastasov <ja@ssi.bg>
    Signed-off-by: Florian Westphal <fw@strlen.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

irqchip/mips-gic: Allow forced affinity [+ + +]

Author: Markus Stockhausen <markus.stockhausen@gmx.de>
Date:   Sat Jun 21 01:49:51 2025 -0400

    irqchip/mips-gic: Allow forced affinity
    
    [ Upstream commit 2250db8628a0d8293ad2e0671138b848a185fba1 ]
    
    Devices of the Realtek MIPS Otto platform use the official rtl-otto-timer
    as clock event generator and CPU clocksource. It is registered for each CPU
    startup via cpuhp_setup_state() and forces the affinity of the clockevent
    interrupts to the appropriate CPU via irq_force_affinity().
    
    On the "smaller" devices with a vendor specific interrupt controller
    (supported by irq-realtek-rtl) the registration works fine. The "larger"
    RTL931x series is based on a MIPS interAptiv dual core with a MIPS GIC
    controller. Interrupt routing setup is cancelled because gic_set_affinity()
    does not accept the current (not yet online) CPU as a target.
    
    Relax the checks by evaluating the force parameter that is provided for
    exactly this purpose like in other drivers. With this the affinity can be
    set as follows:
    
     - force = false: allow to set affinity to any online cpu
     - force = true:  allow to set affinity to any cpu
    
    Co-developed-by: Sebastian Gottschall <s.gottschall@dd-wrt.com>
    Signed-off-by: Sebastian Gottschall <s.gottschall@dd-wrt.com>
    Signed-off-by: Markus Stockhausen <markus.stockhausen@gmx.de>
    Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
    Link: https://lore.kernel.org/all/20250621054952.380374-1-markus.stockhausen@gmx.de
    Signed-off-by: Sasha Levin <sashal@kernel.org>

irqchip/mvebu-gicp: Clear pending interrupts on init [+ + +]

Author: Elad Nachman <enachman@marvell.com>
Date:   Sun Aug 3 13:25:48 2025 +0300

    irqchip/mvebu-gicp: Clear pending interrupts on init
    
    [ Upstream commit 3c3d7dbab2c70a4bca47634d564bf659351c05ca ]
    
    When a kexec'ed kernel boots up, there might be stale unhandled interrupts
    pending in the interrupt controller. These are delivered as spurious
    interrupts once the boot CPU enables interrupts.
    
    Clear all pending interrupts when the driver is initialized to prevent
    these spurious interrupts from locking the CPU in an endless loop.
    
    Signed-off-by: Elad Nachman <enachman@marvell.com>
    Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
    Link: https://lore.kernel.org/all/20250803102548.669682-2-enachman@marvell.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

irqchip/mvebu-gicp: Use resource_size() for ioremap() [+ + +]

Author: Thomas Gleixner <tglx@linutronix.de>
Date:   Wed Aug 6 16:53:18 2025 +0200

    irqchip/mvebu-gicp: Use resource_size() for ioremap()
    
    commit 9f7488f24c7571d349d938061e0ede7a39b65d6b upstream.
    
    0-day reported an off by one in the ioremap() sizing:
    
      drivers/irqchip/irq-mvebu-gicp.c:240:45-48: WARNING:
      Suspicious code. resource_size is maybe missing with gicp -> res
    
    Convert it to resource_size(), which does the right thing.
    
    Fixes: 3c3d7dbab2c7 ("irqchip/mvebu-gicp: Clear pending interrupts on init")
    Reported-by: kernel test robot <lkp@intel.com>
    Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
    Closes: https://lore.kernel.org/oe-kbuild-all/202508062150.mtFQMTXc-lkp@intel.com/
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

irqchip/renesas-rzv2h: Enable SKIP_SET_WAKE and MASK_ON_SUSPEND [+ + +]

Author: Biju Das <biju.das.jz@bp.renesas.com>
Date:   Tue Jul 1 11:59:21 2025 +0100

    irqchip/renesas-rzv2h: Enable SKIP_SET_WAKE and MASK_ON_SUSPEND
    
    [ Upstream commit de2942828e7670526289f098df7e50b112e8ff1e ]
    
    The interrupt controller found on RZ/G3E doesn't provide any facility to
    configure the wakeup sources. That's the reason why the driver lacks the
    irq_set_wake() callback for the interrupt chip.
    
    But this prevent to properly enter power management states like "suspend to
    idle".
    
    Enable the flags IRQCHIP_SKIP_SET_WAKE and IRQCHIP_MASK_ON_SUSPEND so the
    interrupt suspend logic can handle the chip correctly.
    
    Signed-off-by: Biju Das <biju.das.jz@bp.renesas.com>
    Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
    Reviewed-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Link: https://lore.kernel.org/all/20250701105923.52151-1-biju.das.jz@bp.renesas.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

jfs: Regular file corruption check [+ + +]

Author: Edward Adam Davis <eadavis@qq.com>
Date:   Wed Jun 4 14:48:43 2025 +0800

    jfs: Regular file corruption check
    
    [ Upstream commit 2d04df8116426b6c7b9f8b9b371250f666a2a2fb ]
    
    The reproducer builds a corrupted file on disk with a negative i_size value.
    Add a check when opening this file to avoid subsequent operation failures.
    
    Reported-by: syzbot+630f6d40b3ccabc8e96e@syzkaller.appspotmail.com
    Closes: https://syzkaller.appspot.com/bug?extid=630f6d40b3ccabc8e96e
    Tested-by: syzbot+630f6d40b3ccabc8e96e@syzkaller.appspotmail.com
    Signed-off-by: Edward Adam Davis <eadavis@qq.com>
    Signed-off-by: Dave Kleikamp <dave.kleikamp@oracle.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

jfs: truncate good inode pages when hard link is 0 [+ + +]

Author: Lizhi Xu <lizhi.xu@windriver.com>
Date:   Fri Jun 13 11:05:34 2025 +0800

    jfs: truncate good inode pages when hard link is 0
    
    [ Upstream commit 2d91b3765cd05016335cd5df5e5c6a29708ec058 ]
    
    The fileset value of the inode copy from the disk by the reproducer is
    AGGR_RESERVED_I. When executing evict, its hard link number is 0, so its
    inode pages are not truncated. This causes the bugon to be triggered when
    executing clear_inode() because nrpages is greater than 0.
    
    Reported-by: syzbot+6e516bb515d93230bc7b@syzkaller.appspotmail.com
    Closes: https://syzkaller.appspot.com/bug?extid=6e516bb515d93230bc7b
    Signed-off-by: Lizhi Xu <lizhi.xu@windriver.com>
    Signed-off-by: Dave Kleikamp <dave.kleikamp@oracle.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

jfs: upper bound check of tree index in dbAllocAG [+ + +]

Author: Arnaud Lecomte <contact@arnaud-lcm.com>
Date:   Thu Apr 24 00:13:51 2025 +0200

    jfs: upper bound check of tree index in dbAllocAG
    
    [ Upstream commit c214006856ff52a8ff17ed8da52d50601d54f9ce ]
    
    When computing the tree index in dbAllocAG, we never check if we are
    out of bounds realative to the size of the stree.
    This could happen in a scenario where the filesystem metadata are
    corrupted.
    
    Reported-by: syzbot+cffd18309153948f3c3e@syzkaller.appspotmail.com
    Closes: https://syzkaller.appspot.com/bug?extid=cffd18309153948f3c3e
    Tested-by: syzbot+cffd18309153948f3c3e@syzkaller.appspotmail.com
    Signed-off-by: Arnaud Lecomte <contact@arnaud-lcm.com>
    Signed-off-by: Dave Kleikamp <dave.kleikamp@oracle.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

kconfig: gconf: avoid hardcoding model2 in on_treeview2_cursor_changed() [+ + +]

Author: Masahiro Yamada <masahiroy@kernel.org>
Date:   Wed Jun 25 00:05:20 2025 +0900

    kconfig: gconf: avoid hardcoding model2 in on_treeview2_cursor_changed()
    
    [ Upstream commit cae9cdbcd9af044810bcceeb43a87accca47c71d ]
    
    The on_treeview2_cursor_changed() handler is connected to both the left
    and right tree views, but it hardcodes model2 (the GtkTreeModel of the
    right tree view). This is incorrect. Get the associated model from the
    view.
    
    Signed-off-by: Masahiro Yamada <masahiroy@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

kconfig: gconf: fix potential memory leak in renderer_edited() [+ + +]

Author: Masahiro Yamada <masahiroy@kernel.org>
Date:   Wed Jun 25 00:04:55 2025 +0900

    kconfig: gconf: fix potential memory leak in renderer_edited()
    
    [ Upstream commit f72ed4c6a375e52a3f4b75615e4a89d29d8acea7 ]
    
    If gtk_tree_model_get_iter() fails, gtk_tree_path_free() is not called.
    
    Signed-off-by: Masahiro Yamada <masahiroy@kernel.org>
    Acked-by: Randy Dunlap <rdunlap@infradead.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

kconfig: lxdialog: fix 'space' to (de)select options [+ + +]

Author: Yann E. MORIN <yann.morin.1998@free.fr>
Date:   Thu Nov 14 00:53:32 2013 +0100

    kconfig: lxdialog: fix 'space' to (de)select options
    
    [ Upstream commit 694174f94ebeeb5ec5cc0e9de9b40c82057e1d95 ]
    
    In case a menu has comment without letters/numbers (eg. characters
    matching the regexp '^[^[:alpha:][:digit:]]+$', for example - or *),
    hitting space will cycle through those comments, rather than
    selecting/deselecting the currently-highlighted option.
    
    This is the behaviour of hitting any letter/digit: jump to the next
    option which prompt starts with that letter. The only letters that
    do not behave as such are 'y' 'm' and 'n'. Prompts that start with
    one of those three letters are instead matched on the first letter
    that is not 'y', 'm' or 'n'.
    
    Fix that by treating 'space' as we treat y/m/n, ie. as an action key,
    not as shortcut to jump to  prompt.
    
    Signed-off-by: Yann E. MORIN <yann.morin.1998@free.fr>
    Signed-off-by: Peter Korsgaard <peter@korsgaard.com>
    Signed-off-by: Cherniaev Andrei <dungeonlords789@naver.com>
    [masahiro: took from Buildroot, adjusted the commit subject]
    Signed-off-by: Masahiro Yamada <masahiroy@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

kconfig: lxdialog: replace strcpy() with strncpy() in inputbox.c [+ + +]

Author: Suchit Karunakaran <suchitkarunakaran@gmail.com>
Date:   Sun Jul 27 22:14:33 2025 +0530

    kconfig: lxdialog: replace strcpy() with strncpy() in inputbox.c
    
    [ Upstream commit 5ac726653a1029a2eccba93bbe59e01fc9725828 ]
    
    strcpy() performs no bounds checking and can lead to buffer overflows if
    the input string exceeds the destination buffer size. This patch replaces
    it with strncpy(), and null terminates the input string.
    
    Signed-off-by: Suchit Karunakaran <suchitkarunakaran@gmail.com>
    Reviewed-by: Nicolas Schier <nicolas.schier@linux.dev>
    Signed-off-by: Masahiro Yamada <masahiroy@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

kconfig: nconf: Ensure null termination where strncpy is used [+ + +]

Author: Shankari Anand <shankari.ak0208@gmail.com>
Date:   Thu Jun 26 00:36:54 2025 +0530

    kconfig: nconf: Ensure null termination where strncpy is used
    
    [ Upstream commit f468992936894c9ce3b1659cf38c230d33b77a16 ]
    
    strncpy() does not guarantee null-termination if the source string is
    longer than the destination buffer.
    
    Ensure the buffer is explicitly null-terminated to prevent potential
    string overflows or undefined behavior.
    
    Signed-off-by: Shankari Anand <shankari.ak0208@gmail.com>
    Signed-off-by: Masahiro Yamada <masahiroy@kernel.org>
    Acked-by: Randy Dunlap <rdunlap@infradead.org>
    Tested-by: Randy Dunlap <rdunlap@infradead.org>
    Tested-by: Nicolas Schier <n.schier@avm.de>
    Acked-by: Nicolas Schier <n.schier@avm.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

kheaders: rebuild kheaders_data.tar.xz when a file is modified within a minute [+ + +]

Author: Masahiro Yamada <masahiroy@kernel.org>
Date:   Mon Jun 9 01:59:55 2025 +0900

    kheaders: rebuild kheaders_data.tar.xz when a file is modified within a minute
    
    [ Upstream commit 626c54af35764b0b8a4ed5c446458ba6ddfe9cc8 ]
    
    When a header file is changed, kernel/gen_kheaders.sh may fail to update
    kernel/kheaders_data.tar.xz.
    
    [steps to reproduce]
    
    [1] Build kernel/kheaders_data.tar.xz
    
      $ make -j$(nproc) kernel/kheaders.o
        DESCEND objtool
        INSTALL libsubcmd_headers
        CALL    scripts/checksyscalls.sh
        CHK     kernel/kheaders_data.tar.xz
        GEN     kernel/kheaders_data.tar.xz
        CC      kernel/kheaders.o
    
    [2] Modify a header without changing the file size
    
      $ sed -i s/0xdeadbeef/0xfeedbeef/ include/linux/elfnote.h
    
    [3] Rebuild kernel/kheaders_data.tar.xz
    
      $ make -j$(nproc) kernel/kheaders.o
        DESCEND objtool
        INSTALL libsubcmd_headers
        CALL    scripts/checksyscalls.sh
        CHK     kernel/kheaders_data.tar.xz
    
    kernel/kheaders_data.tar.xz is not updated if steps [1] - [3] are run
    within the same minute.
    
    The headers_md5 variable stores the MD5 hash of the 'ls -l' output
    for all header files. This hash value is used to determine whether
    kheaders_data.tar.xz needs to be rebuilt. However, 'ls -l' prints the
    modification times with minute-level granularity. If a file is modified
    within the same minute and its size remains the same, the MD5 hash does
    not change.
    
    To reliably detect file modifications, this commit rewrites
    kernel/gen_kheaders.sh to output header dependencies to
    kernel/.kheaders_data.tar.xz.cmd. Then, Make compares the timestamps
    and reruns kernel/gen_kheaders.sh when necessary. This is the standard
    mechanism used by Make and Kbuild.
    
    Signed-off-by: Masahiro Yamada <masahiroy@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

kselftest/arm64: Specify SVE data when testing VL set in sve-ptrace [+ + +]

Author: Mark Brown <broonie@kernel.org>
Date:   Mon Jun 9 16:25:33 2025 +0100

    kselftest/arm64: Specify SVE data when testing VL set in sve-ptrace
    
    [ Upstream commit 9e8ebfe677f9101bbfe1f75d548a5aec581e8213 ]
    
    Since f916dd32a943 ("arm64/fpsimd: ptrace: Mandate SVE payload for
    streaming-mode state") we reject attempts to write to the streaming mode
    regset even if there is no register data supplied, causing the tests for
    setting vector lengths and setting SVE_VL_INHERIT in sve-ptrace to
    spuriously fail. Set the flag to avoid the issue, we still support not
    supplying register data.
    
    Acked-by: Mark Rutland <mark.rutland@arm.com>
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Link: https://lore.kernel.org/r/20250609-kselftest-arm64-ssve-fixups-v2-3-998fcfa6f240@kernel.org
    Signed-off-by: Catalin Marinas <catalin.marinas@arm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ktest.pl: Prevent recursion of default variable options [+ + +]

Author: Steven Rostedt <rostedt@goodmis.org>
Date:   Fri Jul 18 16:18:44 2025 -0400

    ktest.pl: Prevent recursion of default variable options
    
    [ Upstream commit 61f7e318e99d3b398670518dd3f4f8510d1800fc ]
    
    If a default variable contains itself, do not recurse on it.
    
    For example:
    
      ADD_CONFIG := ${CONFIG_DIR}/temp_config
      DEFAULTS
      ADD_CONFIG = ${CONFIG_DIR}/default_config ${ADD_CONFIG}
    
    The above works because the temp variable ADD_CONFIG (is a temp because it
    is created with ":=") is already defined, it will be substituted in the
    variable option. But if it gets commented out:
    
      # ADD_CONFIG := ${CONFIG_DIR}/temp_config
      DEFAULTS
      ADD_CONFIG = ${CONFIG_DIR}/default_config ${ADD_CONFIG}
    
    Then the above will go into a recursive loop where ${ADD_CONFIG} will
    get replaced with the current definition of ADD_CONFIG which contains the
    ${ADD_CONFIG} and that will also try to get converted. ktest.pl will error
    after 100 attempts of recursion and fail.
    
    When replacing a variable with the default variable, if the default
    variable contains itself, do not replace it.
    
    Cc: "John Warthog9 Hawley" <warthog9@kernel.org>
    Cc: Dhaval Giani <dhaval.giani@gmail.com>
    Cc: Greg KH <gregkh@linuxfoundation.org>
    Link: https://lore.kernel.org/20250718202053.732189428@kernel.org
    Signed-off-by: Steven Rostedt <rostedt@goodmis.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

KVM: nVMX: Check vmcs12->guest_ia32_debugctl on nested VM-Enter [+ + +]

Author: Maxim Levitsky <mlevitsk@redhat.com>
Date:   Thu Aug 14 12:12:10 2025 -0400

    KVM: nVMX: Check vmcs12->guest_ia32_debugctl on nested VM-Enter
    
    [ Upstream commit 095686e6fcb4150f0a55b1a25987fad3d8af58d6 ]
    
    Add a consistency check for L2's guest_ia32_debugctl, as KVM only supports
    a subset of hardware functionality, i.e. KVM can't rely on hardware to
    detect illegal/unsupported values.  Failure to check the vmcs12 value
    would allow the guest to load any harware-supported value while running L2.
    
    Take care to exempt BTF and LBR from the validity check in order to match
    KVM's behavior for writes via WRMSR, but without clobbering vmcs12.  Even
    if VM_EXIT_SAVE_DEBUG_CONTROLS is set in vmcs12, L1 can reasonably expect
    that vmcs12->guest_ia32_debugctl will not be modified if writes to the MSR
    are being intercepted.
    
    Arguably, KVM _should_ update vmcs12 if VM_EXIT_SAVE_DEBUG_CONTROLS is set
    *and* writes to MSR_IA32_DEBUGCTLMSR are not being intercepted by L1, but
    that would incur non-trivial complexity and wouldn't change the fact that
    KVM's handling of DEBUGCTL is blatantly broken.  I.e. the extra complexity
    is not worth carrying.
    
    Cc: stable@vger.kernel.org
    Signed-off-by: Maxim Levitsky <mlevitsk@redhat.com>
    Co-developed-by: Sean Christopherson <seanjc@google.com>
    Link: https://lore.kernel.org/r/20250610232010.162191-7-seanjc@google.com
    Signed-off-by: Sean Christopherson <seanjc@google.com>
    Stable-dep-of: 6b1dd26544d0 ("KVM: VMX: Preserve host's DEBUGCTLMSR_FREEZE_IN_SMM while running the guest")
    Signed-off-by: Sasha Levin <sashal@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

KVM: VMX: Extract checking of guest's DEBUGCTL into helper [+ + +]

Author: Sean Christopherson <seanjc@google.com>
Date:   Thu Aug 14 12:12:09 2025 -0400

    KVM: VMX: Extract checking of guest's DEBUGCTL into helper
    
    [ Upstream commit 8a4351ac302cd8c19729ba2636acfd0467c22ae8 ]
    
    Move VMX's logic to check DEBUGCTL values into a standalone helper so that
    the code can be used by nested VM-Enter to apply the same logic to the
    value being loaded from vmcs12.
    
    KVM needs to explicitly check vmcs12->guest_ia32_debugctl on nested
    VM-Enter, as hardware may support features that KVM does not, i.e. relying
    on hardware to detect invalid guest state will result in false negatives.
    Unfortunately, that means applying KVM's funky suppression of BTF and LBR
    to vmcs12 so as not to break existing guests.
    
    No functional change intended.
    
    Reviewed-by: Dapeng Mi <dapeng1.mi@linux.intel.com>
    Link: https://lore.kernel.org/r/20250610232010.162191-6-seanjc@google.com
    Signed-off-by: Sean Christopherson <seanjc@google.com>
    Stable-dep-of: 6b1dd26544d0 ("KVM: VMX: Preserve host's DEBUGCTLMSR_FREEZE_IN_SMM while running the guest")
    Signed-off-by: Sasha Levin <sashal@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

KVM: VMX: Preserve host's DEBUGCTLMSR_FREEZE_IN_SMM while running the guest [+ + +]

Author: Maxim Levitsky <mlevitsk@redhat.com>
Date:   Thu Aug 14 12:12:12 2025 -0400

    KVM: VMX: Preserve host's DEBUGCTLMSR_FREEZE_IN_SMM while running the guest
    
    [ Upstream commit 6b1dd26544d045f6a79e8c73572c0c0db3ef3c1a ]
    
    Set/clear DEBUGCTLMSR_FREEZE_IN_SMM in GUEST_IA32_DEBUGCTL based on the
    host's pre-VM-Enter value, i.e. preserve the host's FREEZE_IN_SMM setting
    while running the guest.  When running with the "default treatment of SMIs"
    in effect (the only mode KVM supports), SMIs do not generate a VM-Exit that
    is visible to host (non-SMM) software, and instead transitions directly
    from VMX non-root to SMM.  And critically, DEBUGCTL isn't context switched
    by hardware on SMI or RSM, i.e. SMM will run with whatever value was
    resident in hardware at the time of the SMI.
    
    Failure to preserve FREEZE_IN_SMM results in the PMU unexpectedly counting
    events while the CPU is executing in SMM, which can pollute profiling and
    potentially leak information into the guest.
    
    Check for changes in FREEZE_IN_SMM prior to every entry into KVM's inner
    run loop, as the bit can be toggled in IRQ context via IPI callback (SMP
    function call), by way of /sys/devices/cpu/freeze_on_smi.
    
    Add a field in kvm_x86_ops to communicate which DEBUGCTL bits need to be
    preserved, as FREEZE_IN_SMM is only supported and defined for Intel CPUs,
    i.e. explicitly checking FREEZE_IN_SMM in common x86 is at best weird, and
    at worst could lead to undesirable behavior in the future if AMD CPUs ever
    happened to pick up a collision with the bit.
    
    Exempt TDX vCPUs, i.e. protected guests, from the check, as the TDX Module
    owns and controls GUEST_IA32_DEBUGCTL.
    
    WARN in SVM if KVM_RUN_LOAD_DEBUGCTL is set, mostly to document that the
    lack of handling isn't a KVM bug (TDX already WARNs on any run_flag).
    
    Lastly, explicitly reload GUEST_IA32_DEBUGCTL on a VM-Fail that is missed
    by KVM but detected by hardware, i.e. in nested_vmx_restore_host_state().
    Doing so avoids the need to track host_debugctl on a per-VMCS basis, as
    GUEST_IA32_DEBUGCTL is unconditionally written by prepare_vmcs02() and
    load_vmcs12_host_state().  For the VM-Fail case, even though KVM won't
    have actually entered the guest, vcpu_enter_guest() will have run with
    vmcs02 active and thus could result in vmcs01 being run with a stale value.
    
    Cc: stable@vger.kernel.org
    Signed-off-by: Maxim Levitsky <mlevitsk@redhat.com>
    Co-developed-by: Sean Christopherson <seanjc@google.com>
    Link: https://lore.kernel.org/r/20250610232010.162191-9-seanjc@google.com
    Signed-off-by: Sean Christopherson <seanjc@google.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

KVM: VMX: Wrap all accesses to IA32_DEBUGCTL with getter/setter APIs [+ + +]

Author: Maxim Levitsky <mlevitsk@redhat.com>
Date:   Thu Aug 14 12:12:11 2025 -0400

    KVM: VMX: Wrap all accesses to IA32_DEBUGCTL with getter/setter APIs
    
    [ Upstream commit 7d0cce6cbe71af6e9c1831bff101a2b9c249c4a2 ]
    
    Introduce vmx_guest_debugctl_{read,write}() to handle all accesses to
    vmcs.GUEST_IA32_DEBUGCTL. This will allow stuffing FREEZE_IN_SMM into
    GUEST_IA32_DEBUGCTL based on the host setting without bleeding the state
    into the guest, and without needing to copy+paste the FREEZE_IN_SMM
    logic into every patch that accesses GUEST_IA32_DEBUGCTL.
    
    No functional change intended.
    
    Cc: stable@vger.kernel.org
    Signed-off-by: Maxim Levitsky <mlevitsk@redhat.com>
    [sean: massage changelog, make inline, use in all prepare_vmcs02() cases]
    Reviewed-by: Dapeng Mi <dapeng1.mi@linux.intel.com>
    Link: https://lore.kernel.org/r/20250610232010.162191-8-seanjc@google.com
    Signed-off-by: Sean Christopherson <seanjc@google.com>
    Stable-dep-of: 6b1dd26544d0 ("KVM: VMX: Preserve host's DEBUGCTLMSR_FREEZE_IN_SMM while running the guest")
    Signed-off-by: Sasha Levin <sashal@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

landlock: opened file never has a negative dentry [+ + +]

Author: Al Viro <viro@zeniv.linux.org.uk>
Date:   Tue May 27 22:39:51 2025 -0400

    landlock: opened file never has a negative dentry
    
    [ Upstream commit d1832e648d2be564e4b5e357f94d0f33156590dc ]
    
    Reviewed-by: Christian Brauner <brauner@kernel.org>
    Acked-by: Mickaël Salaün <mic@digikod.net>
    Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

leds: flash: leds-qcom-flash: Fix registry access after re-bind [+ + +]

Author: Krzysztof Kozlowski <krzysztof.kozlowski@linaro.org>
Date:   Thu May 29 08:33:36 2025 +0200

    leds: flash: leds-qcom-flash: Fix registry access after re-bind
    
    commit fab15f57360b1e6620a1d0d6b0fbee896e6c1f07 upstream.
    
    Driver in probe() updates each of 'reg_field' with 'reg_base':
    
            for (i = 0; i < REG_MAX_COUNT; i++)
                    regs[i].reg += reg_base;
    
    'reg_field' array (under variable 'regs' above) is statically allocated,
    thus each re-bind would add another 'reg_base' leading to bogus
    register addresses.  Constify the local 'reg_field' array and duplicate
    it in probe to solve this.
    
    Fixes: 96a2e242a5dc ("leds: flash: Add driver to support flash LED module in QCOM PMICs")
    Cc: stable@vger.kernel.org
    Signed-off-by: Krzysztof Kozlowski <krzysztof.kozlowski@linaro.org>
    Reviewed-by: Fenglin Wu <fenglin.wu@oss.qualcomm.com>
    Link: https://lore.kernel.org/r/20250529063335.8785-2-krzysztof.kozlowski@linaro.org
    Signed-off-by: Lee Jones <lee@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

leds: leds-lp50xx: Handle reg to get correct multi_index [+ + +]

Author: Johan Adolfsson <johan.adolfsson@axis.com>
Date:   Tue Jun 17 12:23:54 2025 +0200

    leds: leds-lp50xx: Handle reg to get correct multi_index
    
    [ Upstream commit 2e84a5e5374232e6f356ce5c079a5658d7e4af2c ]
    
    mc_subled used for multi_index needs well defined array indexes,
    to guarantee the desired result, use reg for that.
    
    If devicetree child nodes is processed in random or reverse order
    you may end up with multi_index "blue green red" instead of the expected
    "red green blue".
    If user space apps uses multi_index to deduce how to control the leds
    they would most likely be broken without this patch if devicetree
    processing is reversed (which it appears to be).
    
    arch/arm/boot/dts/aspeed/aspeed-bmc-facebook-fuji.dts has reg set
    but I don't see how it can have worked without this change.
    
    If reg is not set, an error is returned,
    If reg is out of range, an error is returned.
    reg within led child nodes starts with 0, to map to the iout in each bank.
    
    Signed-off-by: Johan Adolfsson <johan.adolfsson@axis.com>
    Reviewed-by: Jacek Anaszewski <jacek.anaszewski@gmail.com>
    Link: https://lore.kernel.org/r/20250617-led-fix-v7-1-cdbe8efc88fa@axis.com
    Signed-off-by: Lee Jones <lee@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

lib/crypto: x86/poly1305: Fix performance regression on short messages [+ + +]

Author: Eric Biggers <ebiggers@kernel.org>
Date:   Sun Jul 6 16:11:00 2025 -0700

    lib/crypto: x86/poly1305: Fix performance regression on short messages
    
    commit 9f65592b7e1f24569bb6ced064df5b4319f725ce upstream.
    
    Restore the len >= 288 condition on using the AVX implementation, which
    was incidentally removed by commit 318c53ae02f2 ("crypto: x86/poly1305 -
    Add block-only interface").  This check took into account the overhead
    in key power computation, kernel-mode "FPU", and tail handling
    associated with the AVX code.  Indeed, restoring this check slightly
    improves performance for len < 256 as measured using poly1305_kunit on
    an "AMD Ryzen AI 9 365" (Zen 5) CPU:
    
        Length      Before       After
        ======  ==========  ==========
             1     30 MB/s     36 MB/s
            16    516 MB/s    598 MB/s
            64   1700 MB/s   1882 MB/s
           127   2265 MB/s   2651 MB/s
           128   2457 MB/s   2827 MB/s
           200   2702 MB/s   3238 MB/s
           256   3841 MB/s   3768 MB/s
           511   4580 MB/s   4585 MB/s
           512   5430 MB/s   5398 MB/s
          1024   7268 MB/s   7305 MB/s
          3173   8999 MB/s   8948 MB/s
          4096   9942 MB/s   9921 MB/s
         16384  10557 MB/s  10545 MB/s
    
    While the optimal threshold for this CPU might be slightly lower than
    288 (see the len == 256 case), other CPUs would need to be tested too,
    and these sorts of benchmarks can underestimate the true cost of
    kernel-mode "FPU".  Therefore, for now just restore the 288 threshold.
    
    Fixes: 318c53ae02f2 ("crypto: x86/poly1305 - Add block-only interface")
    Cc: stable@vger.kernel.org
    Reviewed-by: Ard Biesheuvel <ardb@kernel.org>
    Link: https://lore.kernel.org/r/20250706231100.176113-6-ebiggers@kernel.org
    Signed-off-by: Eric Biggers <ebiggers@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

lib/crypto: x86/poly1305: Fix register corruption in no-SIMD contexts [+ + +]

Author: Eric Biggers <ebiggers@kernel.org>
Date:   Sun Jul 6 16:10:59 2025 -0700

    lib/crypto: x86/poly1305: Fix register corruption in no-SIMD contexts
    
    commit 16f2c30e290e04135b70ad374fb7e1d1ed9ff5e7 upstream.
    
    Restore the SIMD usability check and base conversion that were removed
    by commit 318c53ae02f2 ("crypto: x86/poly1305 - Add block-only
    interface").
    
    This safety check is cheap and is well worth eliminating a footgun.
    While the Poly1305 functions should not be called when SIMD registers
    are unusable, if they are anyway, they should just do the right thing
    instead of corrupting random tasks' registers and/or computing incorrect
    MACs.  Fixing this is also needed for poly1305_kunit to pass.
    
    Just use irq_fpu_usable() instead of the original crypto_simd_usable(),
    since poly1305_kunit won't rely on crypto_simd_disabled_for_test.
    
    Fixes: 318c53ae02f2 ("crypto: x86/poly1305 - Add block-only interface")
    Cc: stable@vger.kernel.org
    Reviewed-by: Ard Biesheuvel <ardb@kernel.org>
    Link: https://lore.kernel.org/r/20250706231100.176113-5-ebiggers@kernel.org
    Signed-off-by: Eric Biggers <ebiggers@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

lib/sbitmap: convert shallow_depth from one word to the whole sbitmap [+ + +]

Author: Yu Kuai <yukuai3@huawei.com>
Date:   Thu Aug 7 11:24:12 2025 +0800

    lib/sbitmap: convert shallow_depth from one word to the whole sbitmap
    
    [ Upstream commit 42e6c6ce03fd3e41e39a0f93f9b1a1d9fa664338 ]
    
    Currently elevators will record internal 'async_depth' to throttle
    asynchronous requests, and they both calculate shallow_dpeth based on
    sb->shift, with the respect that sb->shift is the available tags in one
    word.
    
    However, sb->shift is not the availbale tags in the last word, see
    __map_depth:
    
    if (index == sb->map_nr - 1)
      return sb->depth - (index << sb->shift);
    
    For consequence, if the last word is used, more tags can be get than
    expected, for example, assume nr_requests=256 and there are four words,
    in the worst case if user set nr_requests=32, then the first word is
    the last word, and still use bits per word, which is 64, to calculate
    async_depth is wrong.
    
    One the ohter hand, due to cgroup qos, bfq can allow only one request
    to be allocated, and set shallow_dpeth=1 will still allow the number
    of words request to be allocated.
    
    Fix this problems by using shallow_depth to the whole sbitmap instead
    of per word, also change kyber, mq-deadline and bfq to follow this,
    a new helper __map_depth_with_shallow() is introduced to calculate
    available bits in each word.
    
    Signed-off-by: Yu Kuai <yukuai3@huawei.com>
    Link: https://lore.kernel.org/r/20250807032413.1469456-2-yukuai1@huaweicloud.com
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

lib: packing: Include necessary headers [+ + +]

Author: Nathan Lynch <nathan.lynch@amd.com>
Date:   Tue Jun 24 08:50:44 2025 -0500

    lib: packing: Include necessary headers
    
    [ Upstream commit 8bd0af3154b2206ce19f8b1410339f7a2a56d0c3 ]
    
    packing.h uses ARRAY_SIZE(), BUILD_BUG_ON_MSG(), min(), max(), and
    sizeof_field() without including the headers where they are defined,
    potentially causing build failures.
    
    Fix this in packing.h and sort the result.
    
    Signed-off-by: Nathan Lynch <nathan.lynch@amd.com>
    Reviewed-by: Vladimir Oltean <olteanv@gmail.com>
    Link: https://patch.msgid.link/20250624-packing-includes-v1-1-c23c81fab508@amd.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

libbpf: Fix warning in calloc() usage [+ + +]

Author: Matteo Croce <teknoraver@meta.com>
Date:   Thu Jul 17 22:03:37 2025 +0200

    libbpf: Fix warning in calloc() usage
    
    [ Upstream commit 0ee30d937c147fc14c4b49535181d437cd2fde7a ]
    
    When compiling libbpf with some compilers, this warning is triggered:
    
    libbpf.c: In function ‘bpf_object__gen_loader’:
    libbpf.c:9209:28: error: ‘calloc’ sizes specified with ‘sizeof’ in the earlier argument and not in the later argument [-Werror=calloc-transposed-args]
     9209 |         gen = calloc(sizeof(*gen), 1);
          |                            ^
    libbpf.c:9209:28: note: earlier argument should specify number of elements, later size of each element
    
    Fix this by inverting the calloc() arguments.
    
    Signed-off-by: Matteo Croce <teknoraver@meta.com>
    Signed-off-by: Andrii Nakryiko <andrii@kernel.org>
    Acked-by: Yonghong Song <yonghong.song@linux.dev>
    Link: https://lore.kernel.org/bpf/20250717200337.49168-1-technoboy85@gmail.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

libbpf: Verify that arena map exists when adding arena relocations [+ + +]

Author: Eduard Zingerman <eddyz87@gmail.com>
Date:   Fri Jul 18 15:20:59 2025 -0700

    libbpf: Verify that arena map exists when adding arena relocations
    
    [ Upstream commit 42be23e8f2dcb100cb9944b2b54b6bf41aff943d ]
    
    Fuzzer reported a memory access error in bpf_program__record_reloc()
    that happens when:
    - ".addr_space.1" section exists
    - there is a relocation referencing this section
    - there are no arena maps defined in BTF.
    
    Sanity checks for maps existence are already present in
    bpf_program__record_reloc(), hence this commit adds another one.
    
    [1] https://github.com/libbpf/libbpf/actions/runs/16375110681/job/46272998064
    
    Signed-off-by: Eduard Zingerman <eddyz87@gmail.com>
    Signed-off-by: Andrii Nakryiko <andrii@kernel.org>
    Link: https://lore.kernel.org/bpf/20250718222059.281526-1-eddyz87@gmail.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

Linux: Linux 6.16.2 [+ + +]

Author: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Date:   Wed Aug 20 18:41:44 2025 +0200

    Linux 6.16.2
    
    Link: https://lore.kernel.org/r/20250818124505.781598737@linuxfoundation.org
    Tested-by: Ronald Warsow <rwarsow@gmx.de>
    Tested-by: Salvatore Bonaccorso <carnil@debian.org>
    Tested-by: Brett A C Sheffield <bacs@librecast.net>
    Tested-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Tested-by: Shuah Khan <skhan@linuxfoundation.org>
    Tested-by: Hardik Garg <hargar@linux.microsoft.com>
    Tested-by: Takeshi Ogasawara <takeshi.ogasawara@futuring-girl.com>
    Tested-by: Ron Economos <re@w6rz.net>
    Tested-by: Markus Reichelt <lkt+2023@mareichelt.com>
    Link: https://lore.kernel.org/r/20250819122844.483737955@linuxfoundation.org
    Tested-by: Ronald Warsow <rwarsow@gmx.de>
    Tested-by: Markus Reichelt <lkt+2023@mareichelt.com>
    Tested-by: Peter Schneider <pschneider1968@googlemail.com>
    Tested-by: Florian Fainelli <floria.fainelli@broadcom.com>
    Tested-by: Justin M. Forbes <jforbes@fedoraproject.org>
    Tested-by: Takeshi Ogasawara <takeshi.ogasawara@futuring-girl.com>
    Tested-by: Hardik Garg <hargar@linux.microsoft.com>
    Tested-by: Brett A C Sheffield <bacs@librecast.net>
    Tested-by: Linux Kernel Functional Testing <lkft@linaro.org>
    Tested-by: Mark Brown <broonie@kernel.org>
    Tested-by: Jon Hunter <jonathanh@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

LoongArch: Avoid in-place string operation on FDT content [+ + +]

Author: Yao Zi <ziyao@disroot.org>
Date:   Sun Aug 3 22:49:47 2025 +0800

    LoongArch: Avoid in-place string operation on FDT content
    
    commit 70a2365e18affc5ebdaab1ca6a0b3c4f3aac2ee8 upstream.
    
    In init_cpu_fullname(), a constant pointer to "model" property is
    retrieved. It's later modified by the strsep() function, which is
    illegal and corrupts kernel's FDT copy. This is shown by dmesg,
    
            OF: fdt: not creating '/sys/firmware/fdt': CRC check failed
    
    Create a mutable copy of the model property and do in-place operations
    on the mutable copy instead. loongson_sysconf.cpuname lives across the
    kernel lifetime, thus manually releasing isn't necessary.
    
    Also move the of_node_put() call for the root node after the usage of
    its property, since of_node_put() decreases the reference counter thus
    usage after the call is unsafe.
    
    Cc: stable@vger.kernel.org
    Fixes: 44a01f1f726a ("LoongArch: Parsing CPU-related information from DTS")
    Reviewed-by: Jiaxun Yang <jiaxun.yang@flygoat.com>
    Signed-off-by: Yao Zi <ziyao@disroot.org>
    Signed-off-by: Huacai Chen <chenhuacai@loongson.cn>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

LoongArch: BPF: Fix jump offset calculation in tailcall [+ + +]

Author: Haoran Jiang <jianghaoran@kylinos.cn>
Date:   Tue Aug 5 19:00:22 2025 +0800

    LoongArch: BPF: Fix jump offset calculation in tailcall
    
    commit cd39d9e6b7e4c58fa77783e7aedf7ada51d02ea3 upstream.
    
    The extra pass of bpf_int_jit_compile() skips JIT context initialization
    which essentially skips offset calculation leaving out_offset = -1, so
    the jmp_offset in emit_bpf_tail_call is calculated by
    
    "#define jmp_offset (out_offset - (cur_offset))"
    
    is a negative number, which is wrong. The final generated assembly are
    as follow.
    
    54:     bgeu            $a2, $t1, -8        # 0x0000004c
    58:     addi.d          $a6, $s5, -1
    5c:     bltz            $a6, -16            # 0x0000004c
    60:     alsl.d          $t2, $a2, $a1, 0x3
    64:     ld.d            $t2, $t2, 264
    68:     beq             $t2, $zero, -28     # 0x0000004c
    
    Before apply this patch, the follow test case will reveal soft lock issues.
    
    cd tools/testing/selftests/bpf/
    ./test_progs --allow=tailcalls/tailcall_bpf2bpf_1
    
    dmesg:
    watchdog: BUG: soft lockup - CPU#2 stuck for 26s! [test_progs:25056]
    
    Cc: stable@vger.kernel.org
    Fixes: 5dc615520c4d ("LoongArch: Add BPF JIT support")
    Reviewed-by: Hengqi Chen <hengqi.chen@gmail.com>
    Signed-off-by: Haoran Jiang <jianghaoran@kylinos.cn>
    Signed-off-by: Huacai Chen <chenhuacai@loongson.cn>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

LoongArch: Don't use %pK through printk() in unwinder [+ + +]

Author: Thomas Weißschuh <thomas.weissschuh@linutronix.de>
Date:   Sun Aug 3 22:49:47 2025 +0800

    LoongArch: Don't use %pK through printk() in unwinder
    
    commit 2362e8124ed21445c6886806e5deaee717629ddd upstream.
    
    In the past %pK was preferable to %p as it would not leak raw pointer
    values into the kernel log.
    
    Since commit ad67b74d2469 ("printk: hash addresses printed with %p")
    the regular %p has been improved to avoid this issue.
    
    Furthermore, restricted pointers ("%pK") were never meant to be used
    through printk(). They can still unintentionally leak raw pointers or
    acquire sleeping locks in atomic contexts.
    
    Switch to the regular pointer formatting which is safer and easier to
    reason about.
    
    Cc: stable@vger.kernel.org
    Signed-off-by: Thomas Weißschuh <thomas.weissschuh@linutronix.de>
    Signed-off-by: Huacai Chen <chenhuacai@loongson.cn>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

LoongArch: Make relocate_new_kernel_size be a .quad value [+ + +]

Author: Huacai Chen <chenhuacai@kernel.org>
Date:   Sun Aug 3 22:49:47 2025 +0800

    LoongArch: Make relocate_new_kernel_size be a .quad value
    
    commit a1a81b5477196ca1290b367404a461e046e647d5 upstream.
    
    Now relocate_new_kernel_size is a .long value, which means 32bit, so its
    high 32bit is undefined. This causes memcpy((void *)reboot_code_buffer,
    relocate_new_kernel, relocate_new_kernel_size) in machine_kexec_prepare()
    access out of range memories in some cases, and then end up with an ADE
    exception.
    
    So make relocate_new_kernel_size be a .quad value, which means 64bit, to
    avoid such errors.
    
    Cc: stable@vger.kernel.org
    Signed-off-by: Huacai Chen <chenhuacai@loongson.cn>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

LoongArch: vDSO: Remove -nostdlib complier flag [+ + +]

Author: Wentao Guan <guanwentao@uniontech.com>
Date:   Tue Aug 5 19:00:22 2025 +0800

    LoongArch: vDSO: Remove -nostdlib complier flag
    
    commit d35ec48fa6c8fe0cfa4a03155109fec7677911d4 upstream.
    
    Since $(LD) is directly used, hence -nostdlib is unneeded, MIPS has
    removed this, we should remove it too.
    
    bdbf2038fbf4 ("MIPS: VDSO: remove -nostdlib compiler flag").
    
    In fact, other architectures also use $(LD) now.
    
    fe00e50b2db8 ("ARM: 8858/1: vdso: use $(LD) instead of $(CC) to link VDSO")
    691efbedc60d ("arm64: vdso: use $(LD) instead of $(CC) to link VDSO")
    2ff906994b6c ("MIPS: VDSO: Use $(LD) instead of $(CC) to link VDSO")
    2b2a25845d53 ("s390/vdso: Use $(LD) instead of $(CC) to link vDSO")
    
    Cc: stable@vger.kernel.org
    Reviewed-by: Yanteng Si <siyanteng@cqsoftware.com.cn>
    Signed-off-by: Wentao Guan <guanwentao@uniontech.com>
    Signed-off-by: Huacai Chen <chenhuacai@loongson.cn>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

loop: Avoid updating block size under exclusive owner [+ + +]

Author: Jan Kara <jack@suse.cz>
Date:   Fri Jul 11 18:32:03 2025 +0200

    loop: Avoid updating block size under exclusive owner
    
    [ Upstream commit 7e49538288e523427beedd26993d446afef1a6fb ]
    
    Syzbot came up with a reproducer where a loop device block size is
    changed underneath a mounted filesystem. This causes a mismatch between
    the block device block size and the block size stored in the superblock
    causing confusion in various places such as fs/buffer.c. The particular
    issue triggered by syzbot was a warning in __getblk_slow() due to
    requested buffer size not matching block device block size.
    
    Fix the problem by getting exclusive hold of the loop device to change
    its block size. This fails if somebody (such as filesystem) has already
    an exclusive ownership of the block device and thus prevents modifying
    the loop device under some exclusive owner which doesn't expect it.
    
    Reported-by: syzbot+01ef7a8da81a975e1ccd@syzkaller.appspotmail.com
    Signed-off-by: Jan Kara <jack@suse.cz>
    Tested-by: syzbot+01ef7a8da81a975e1ccd@syzkaller.appspotmail.com
    Link: https://lore.kernel.org/r/20250711163202.19623-2-jack@suse.cz
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

md/raid10: set chunk_sectors limit [+ + +]

Author: John Garry <john.g.garry@oracle.com>
Date:   Fri Jul 11 10:52:56 2025 +0000

    md/raid10: set chunk_sectors limit
    
    [ Upstream commit 7ef50c4c6a9c36fa3ea6f1681a80c0bf9a797345 ]
    
    Same as done for raid0, set chunk_sectors limit to appropriately set the
    atomic write size limit.
    
    Reviewed-by: Nilay Shroff <nilay@linux.ibm.com>
    Reviewed-by: Yu Kuai <yukuai3@huawei.com>
    Signed-off-by: John Garry <john.g.garry@oracle.com>
    Reviewed-by: Martin K. Petersen <martin.petersen@oracle.com>
    Link: https://lore.kernel.org/r/20250711105258.3135198-5-john.g.garry@oracle.com
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

md: dm-zoned-target: Initialize return variable r to avoid uninitialized use [+ + +]

Author: Purva Yeshi <purvayeshi550@gmail.com>
Date:   Thu Jul 10 13:11:57 2025 +0530

    md: dm-zoned-target: Initialize return variable r to avoid uninitialized use
    
    [ Upstream commit 487767bff572d46f7c37ad846c4078f6d6c9cc55 ]
    
    Fix Smatch-detected error:
    drivers/md/dm-zoned-target.c:1073 dmz_iterate_devices()
    error: uninitialized symbol 'r'.
    
    Smatch detects a possible use of the uninitialized variable 'r' in
    dmz_iterate_devices() because if dmz->nr_ddevs is zero, the loop is
    skipped and 'r' is returned without being set, leading to undefined
    behavior.
    
    Initialize 'r' to 0 before the loop. This ensures that if there are no
    devices to iterate over, the function still returns a defined value.
    
    Signed-off-by: Purva Yeshi <purvayeshi550@gmail.com>
    Signed-off-by: Mikulas Patocka <mpatocka@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

media: dvb-frontends: dib7090p: fix null-ptr-deref in dib7090p_rw_on_apb() [+ + +]

Author: Alex Guo <alexguo1023@gmail.com>
Date:   Sun Jun 15 21:32:31 2025 -0400

    media: dvb-frontends: dib7090p: fix null-ptr-deref in dib7090p_rw_on_apb()
    
    [ Upstream commit ce5cac69b2edac3e3246fee03e8f4c2a1075238b ]
    
    In dib7090p_rw_on_apb, msg is controlled by user. When msg[0].buf is null and
    msg[0].len is zero, former checks on msg[0].buf would be passed. If accessing
    msg[0].buf[2] without sanity check, null pointer deref would happen. We add
    check on msg[0].len to prevent crash. Similar issue occurs when access
    msg[1].buf[0] and msg[1].buf[1].
    
    Similar commit: commit 0ed554fd769a ("media: dvb-usb: az6027: fix null-ptr-deref in az6027_i2c_xfer()")
    
    Signed-off-by: Alex Guo <alexguo1023@gmail.com>
    Link: https://lore.kernel.org/r/20250616013231.730221-1-alexguo1023@gmail.com
    Signed-off-by: Mauro Carvalho Chehab <mchehab+huawei@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

media: dvb-frontends: w7090p: fix null-ptr-deref in w7090p_tuner_write_serpar and w7090p_tuner_read_serpar [+ + +]

Author: Alex Guo <alexguo1023@gmail.com>
Date:   Sun Jun 15 21:33:53 2025 -0400

    media: dvb-frontends: w7090p: fix null-ptr-deref in w7090p_tuner_write_serpar and w7090p_tuner_read_serpar
    
    [ Upstream commit ed0234c8458b3149f15e496b48a1c9874dd24a1b ]
    
    In w7090p_tuner_write_serpar, msg is controlled by user. When msg[0].buf is null and msg[0].len is zero, former checks on msg[0].buf would be passed. If accessing msg[0].buf[2] without sanity check, null pointer deref would happen. We add
    check on msg[0].len to prevent crash.
    
    Similar commit: commit 0ed554fd769a ("media: dvb-usb: az6027: fix null-ptr-deref in az6027_i2c_xfer()")
    
    Signed-off-by: Alex Guo <alexguo1023@gmail.com>
    Link: https://lore.kernel.org/r/20250616013353.738790-1-alexguo1023@gmail.com
    Signed-off-by: Mauro Carvalho Chehab <mchehab+huawei@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

media: hi556: Fix reset GPIO timings [+ + +]

Author: Hans de Goede <hansg@kernel.org>
Date:   Sat May 31 21:05:33 2025 +0200

    media: hi556: Fix reset GPIO timings
    
    [ Upstream commit 99f2211a9d89fe34b3fa847fd7a4475171406cd0 ]
    
    probe() requests the reset GPIO to be set to high when getting it.
    Immeditately after this hi556_resume() is called and sets the GPIO low.
    
    If the GPIO was low before requesting it this will result in the GPIO
    only very briefly spiking high and the sensor not being properly reset.
    The same problem also happens on back to back runtime suspend + resume.
    
    Fix this by adding a sleep of 2 ms in hi556_resume() before setting
    the GPIO low (if there is a reset GPIO).
    
    The final sleep is kept unconditional, because if there is e.g. no reset
    GPIO but a controllable clock then the sensor also needs some time after
    enabling the clock.
    
    Signed-off-by: Hans de Goede <hdegoede@redhat.com>
    Signed-off-by: Sakari Ailus <sakari.ailus@linux.intel.com>
    Signed-off-by: Hans Verkuil <hverkuil@xs4all.nl>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

media: i2c: set lt6911uxe's reset_gpio to GPIOD_OUT_LOW [+ + +]

Author: Dongcheng Yan <dongcheng.yan@intel.com>
Date:   Wed May 21 15:15:19 2025 +0800

    media: i2c: set lt6911uxe's reset_gpio to GPIOD_OUT_LOW
    
    commit 3c607baf68639d6bfe1a336523c4c9597f4b512a upstream.
    
    reset_gpio needs to be an output and set to GPIOD_OUT_LOW, to ensure
    lt6911uxe is in reset state during probe.
    
    This issue was found on the onboard lt6911uxe, where the reset_pin was
    not reset, causing the lt6911uxe to fail to probe.
    
    Fixes: e49563c3be09d4 ("media: i2c: add lt6911uxe hdmi bridge driver")
    Cc: stable@vger.kernel.org
    Signed-off-by: Dongcheng Yan <dongcheng.yan@intel.com>
    Signed-off-by: Sakari Ailus <sakari.ailus@linux.intel.com>
    Signed-off-by: Hans Verkuil <hverkuil@xs4all.nl>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

media: i2c: vd55g1: Fix RATE macros not being expressed in bps [+ + +]

Author: Benjamin Mugnier <benjamin.mugnier@foss.st.com>
Date:   Wed Jun 11 10:48:30 2025 +0200

    media: i2c: vd55g1: Fix RATE macros not being expressed in bps
    
    [ Upstream commit 5b69a84005b686365e9cacb2ca942271ed74867a ]
    
    As a bit rate is expressed in bps, use MEGA instead of HZ_PER_MHZ.
    
    Signed-off-by: Benjamin Mugnier <benjamin.mugnier@foss.st.com>
    Signed-off-by: Sakari Ailus <sakari.ailus@linux.intel.com>
    Signed-off-by: Hans Verkuil <hverkuil@xs4all.nl>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

media: i2c: vd55g1: Fix return code in vd55g1_enable_streams error path [+ + +]

Author: Benjamin Mugnier <benjamin.mugnier@foss.st.com>
Date:   Wed Jun 11 10:48:31 2025 +0200

    media: i2c: vd55g1: Fix return code in vd55g1_enable_streams error path
    
    commit 5931eed35cb632ff8b7e0b0cc91abc6014c64045 upstream.
    
    Enable stream was returning success even if an error occurred, fix it by
    modifying the err_rpm_put return value to -EINVAL.
    
    Signed-off-by: Benjamin Mugnier <benjamin.mugnier@foss.st.com>
    Fixes: e56616d7b23c ("media: i2c: Add driver for ST VD55G1 camera sensor")
    Cc: stable@vger.kernel.org
    Signed-off-by: Sakari Ailus <sakari.ailus@linux.intel.com>
    Signed-off-by: Hans Verkuil <hverkuil@xs4all.nl>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

media: i2c: vd55g1: Setup sensor external clock before patching [+ + +]

Author: Benjamin Mugnier <benjamin.mugnier@foss.st.com>
Date:   Wed Jun 11 10:48:32 2025 +0200

    media: i2c: vd55g1: Setup sensor external clock before patching
    
    [ Upstream commit df2f8fd91bde57d5d5aca6adddf7e988f2e8c60e ]
    
    Proper clock configuration is required to advance through FSM states.
    Prior than this having a different clock value than default sensor's
    value was used (12 MHz) could prevent the sensor from booting.
    
    Signed-off-by: Benjamin Mugnier <benjamin.mugnier@foss.st.com>
    Signed-off-by: Sakari Ailus <sakari.ailus@linux.intel.com>
    Signed-off-by: Hans Verkuil <hverkuil@xs4all.nl>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

media: ipu-bridge: Add _HID for OV5670 [+ + +]

Author: Daniel Scally <dan.scally@ideasonboard.com>
Date:   Tue May 20 14:17:46 2025 +0100

    media: ipu-bridge: Add _HID for OV5670
    
    [ Upstream commit 484f8bec3ddb453321ef0b8621c25de6ce3d0302 ]
    
    The OV5670 is found on Dell 7212 tablets paired with an IPU3 ISP
    and needs to be connected by the ipu-bridge. Add it to the list
    of supported devices.
    
    Signed-off-by: Daniel Scally <dan.scally@ideasonboard.com>
    Signed-off-by: Sakari Ailus <sakari.ailus@linux.intel.com>
    Signed-off-by: Hans Verkuil <hverkuil@xs4all.nl>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

media: iris: Add handling for corrupt and drop frames [+ + +]

Author: Dikshita Agarwal <quic_dikshita@quicinc.com>
Date:   Fri May 9 14:09:03 2025 +0530

    media: iris: Add handling for corrupt and drop frames
    
    [ Upstream commit b791dcfcba3a0c46fb3e2decab31d2340c5dc313 ]
    
    Firmware attach DATACORRUPT/DROP buffer flags for the frames which
    needs to be dropped, handle it by setting VB2_BUF_STATE_ERROR for these
    buffers before calling buf_done.
    
    Reviewed-by: Bryan O'Donoghue <bryan.odonoghue@linaro.org>
    Acked-by: Vikash Garodia <quic_vgarodia@quicinc.com>
    Tested-by: Neil Armstrong <neil.armstrong@linaro.org> # on SM8550-QRD
    Tested-by: Neil Armstrong <neil.armstrong@linaro.org> # on SM8550-HDK
    Tested-by: Neil Armstrong <neil.armstrong@linaro.org> # on SM8650-QRD
    Tested-by: Neil Armstrong <neil.armstrong@linaro.org> # on SM8650-HDK
    Signed-off-by: Dikshita Agarwal <quic_dikshita@quicinc.com>
    Tested-by: Vikash Garodia <quic_vgarodia@quicinc.com> # on sa8775p-ride
    Signed-off-by: Bryan O'Donoghue <bod@kernel.org>
    Signed-off-by: Hans Verkuil <hverkuil@xs4all.nl>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

media: raspberrypi: cfe: Fix min_reqbufs_allocation [+ + +]

Author: Tomi Valkeinen <tomi.valkeinen@ideasonboard.com>
Date:   Tue Oct 29 14:47:05 2024 +0200

    media: raspberrypi: cfe: Fix min_reqbufs_allocation
    
    [ Upstream commit 57b5a302b5d529db96ddc52fbccec005092ebb3d ]
    
    The driver checks if "vq->max_num_buffers + *nbuffers < 3", but
    vq->max_num_buffers is (by default) 32, so the check is never true. Nor
    does the check make sense.
    
    The original code in the BSP kernel was "vq->num_buffers + *nbuffers <
    3", but got mangled along the way to upstream. The intention was to make
    sure that at least 3 buffers are allocated.
    
    Fix this by removing the bad lines and setting q->min_reqbufs_allocation
    to three.
    
    Signed-off-by: Tomi Valkeinen <tomi.valkeinen@ideasonboard.com>
    Reviewed-by: Hans Verkuil <hverkuil@xs4all.nl>
    Signed-off-by: Hans Verkuil <hverkuil@xs4all.nl>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

media: tc358743: Check I2C succeeded during probe [+ + +]

Author: Dave Stevenson <dave.stevenson@raspberrypi.com>
Date:   Wed Jun 11 19:37:15 2025 +0100

    media: tc358743: Check I2C succeeded during probe
    
    [ Upstream commit 303d81635e1d9c949b370215cc94526ed81f2e3d ]
    
    The probe for the TC358743 reads the CHIPID register from
    the device and compares it to the expected value of 0.
    If the I2C request fails then that also returns 0, so
    the driver loads thinking that the device is there.
    
    Generally I2C communications are reliable so there is
    limited need to check the return value on every transfer,
    therefore only amend the one read during probe to check
    for I2C errors.
    
    Signed-off-by: Dave Stevenson <dave.stevenson@raspberrypi.com>
    Signed-off-by: Hans Verkuil <hverkuil@xs4all.nl>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

media: tc358743: Increase FIFO trigger level to 374 [+ + +]

Author: Dave Stevenson <dave.stevenson@raspberrypi.com>
Date:   Wed Jun 11 19:37:14 2025 +0100

    media: tc358743: Increase FIFO trigger level to 374
    
    [ Upstream commit 86addd25314a1e77dbdcfddfeed0bab2f27da0e2 ]
    
    The existing fixed value of 16 worked for UYVY 720P60 over
    2 lanes at 594MHz, or UYVY 1080P60 over 4 lanes. (RGB888
    1080P60 needs 6 lanes at 594MHz).
    It doesn't allow for lower resolutions to work as the FIFO
    underflows.
    
    374 is required for 1080P24 or 1080P30 UYVY over 2 lanes @
    972Mbit/s, but >374 means that the FIFO underflows on 1080P50
    UYVY over 2 lanes @ 972Mbit/s.
    
    Whilst it would be nice to compute it, the required information
    isn't published by Toshiba.
    
    Signed-off-by: Dave Stevenson <dave.stevenson@raspberrypi.com>
    Signed-off-by: Hans Verkuil <hverkuil@xs4all.nl>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

media: tc358743: Return an appropriate colorspace from tc358743_set_fmt [+ + +]

Author: Dave Stevenson <dave.stevenson@raspberrypi.com>
Date:   Wed Jun 11 19:37:16 2025 +0100

    media: tc358743: Return an appropriate colorspace from tc358743_set_fmt
    
    [ Upstream commit 377cc006a364dfdab2f3f221cfad63a9265200b8 ]
    
    When calling tc358743_set_fmt, the code was calling tc358743_get_fmt
    to choose a valid format. However that sets the colorspace
    based on information read back from the chip, not the colour
    format requested.
    
    The result was that if you called try or set format for UYVY
    when the current format was RGB3 then you would get told SRGB,
    and try RGB3 when current was UYVY and you would get told
    SMPTE170M.
    
    The value programmed in the VI_REP register for the colorspace
    is always set by this driver, therefore there is no need to read
    back the value, and never set to REC709.
    Return the colorspace based on the format set/tried instead.
    
    Signed-off-by: Dave Stevenson <dave.stevenson@raspberrypi.com>
    Signed-off-by: Hans Verkuil <hverkuil@xs4all.nl>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

media: usb: hdpvr: disable zero-length read messages [+ + +]

Author: Wolfram Sang <wsa+renesas@sang-engineering.com>
Date:   Thu May 22 10:09:54 2025 +0200

    media: usb: hdpvr: disable zero-length read messages
    
    [ Upstream commit b5ae5a79825ba8037b0be3ef677a24de8c063abf ]
    
    This driver passes the length of an i2c_msg directly to
    usb_control_msg(). If the message is now a read and of length 0, it
    violates the USB protocol and a warning will be printed. Enable the
    I2C_AQ_NO_ZERO_LEN_READ quirk for this adapter thus forbidding 0-length
    read messages altogether.
    
    Signed-off-by: Wolfram Sang <wsa+renesas@sang-engineering.com>
    Signed-off-by: Hans Verkuil <hverkuil@xs4all.nl>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

media: uvcvideo: Add quirk for HP Webcam HD 2300 [+ + +]

Author: Ricardo Ribalda <ribalda@chromium.org>
Date:   Mon Jun 2 18:21:57 2025 +0000

    media: uvcvideo: Add quirk for HP Webcam HD 2300
    
    [ Upstream commit 53b0b80e5240fec7c5a420bffb310edc83faf4fd ]
    
    HP Webcam HD 2300 does not seem to flip the FID bit according to spec.
    
    Device Descriptor:
      bLength                18
      bDescriptorType         1
      bcdUSB               2.00
      bDeviceClass          239 Miscellaneous Device
      bDeviceSubClass         2 [unknown]
      bDeviceProtocol         1 Interface Association
      bMaxPacketSize0        64
      idVendor           0x03f0 HP, Inc
      idProduct          0xe207 HP Webcam HD 2300
      bcdDevice           10.20
      iManufacturer           3 Hewlett Packard
      iProduct                1 HP Webcam HD 2300
      iSerial                 0
      bNumConfigurations      1
    
    Reported-by: Michaël Melchiore <rohel01@gmail.com>
    Closes: https://lore.kernel.org/linux-media/CA+q66aRvTigH15cUyfvzPJ2mfsDFMt=CjuYNwvAZb29w8b1KDA@mail.gmail.com
    Signed-off-by: Ricardo Ribalda <ribalda@chromium.org>
    Reviewed-by: Laurent Pinchart <laurent.pinchart@ideasonboard.com>
    Link: https://lore.kernel.org/r/20250602-uvc-hp-quirk-v1-1-7047d94d679f@chromium.org
    Signed-off-by: Laurent Pinchart <laurent.pinchart@ideasonboard.com>
    Signed-off-by: Hans Verkuil <hverkuil@xs4all.nl>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

media: uvcvideo: Do not mark valid metadata as invalid [+ + +]

Author: Ricardo Ribalda <ribalda@chromium.org>
Date:   Mon Jul 7 18:34:01 2025 +0000

    media: uvcvideo: Do not mark valid metadata as invalid
    
    commit bda2859bff0b9596a19648f3740c697ce4c71496 upstream.
    
    Currently, the driver performs a length check of the metadata buffer
    before the actual metadata size is known and before the metadata is
    decided to be copied. This results in valid metadata buffers being
    incorrectly marked as invalid.
    
    Move the length check to occur after the metadata size is determined and
    is decided to be copied.
    
    Cc: stable@vger.kernel.org
    Fixes: 088ead255245 ("media: uvcvideo: Add a metadata device node")
    Reviewed-by: Laurent Pinchart <laurent.pinchart@ideasonboard.com>
    Reviewed-by: Hans de Goede <hansg@kernel.org>
    Signed-off-by: Ricardo Ribalda <ribalda@chromium.org>
    Link: https://lore.kernel.org/r/20250707-uvc-meta-v8-1-ed17f8b1218b@chromium.org
    Signed-off-by: Hans de Goede <hansg@kernel.org>
    Signed-off-by: Hans Verkuil <hverkuil@xs4all.nl>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

media: uvcvideo: Fix 1-byte out-of-bounds read in uvc_parse_format() [+ + +]

Author: Youngjun Lee <yjjuny.lee@samsung.com>
Date:   Tue Jun 10 21:41:07 2025 +0900

    media: uvcvideo: Fix 1-byte out-of-bounds read in uvc_parse_format()
    
    commit 782b6a718651eda3478b1824b37a8b3185d2740c upstream.
    
    The buffer length check before calling uvc_parse_format() only ensured
    that the buffer has at least 3 bytes (buflen > 2), buf the function
    accesses buffer[3], requiring at least 4 bytes.
    
    This can lead to an out-of-bounds read if the buffer has exactly 3 bytes.
    
    Fix it by checking that the buffer has at least 4 bytes in
    uvc_parse_format().
    
    Signed-off-by: Youngjun Lee <yjjuny.lee@samsung.com>
    Reviewed-by: Laurent Pinchart <laurent.pinchart@ideasonboard.com>
    Fixes: c0efd232929c ("V4L/DVB (8145a): USB Video Class driver")
    Cc: stable@vger.kernel.org
    Reviewed-by: Ricardo Ribalda <ribalda@chromium.org>
    Link: https://lore.kernel.org/r/20250610124107.37360-1-yjjuny.lee@samsung.com
    Signed-off-by: Laurent Pinchart <laurent.pinchart@ideasonboard.com>
    Signed-off-by: Hans Verkuil <hverkuil@xs4all.nl>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

media: uvcvideo: Fix bandwidth issue for Alcor camera [+ + +]

Author: chenchangcheng <chenchangcheng@kylinos.cn>
Date:   Sat May 10 14:18:03 2025 +0800

    media: uvcvideo: Fix bandwidth issue for Alcor camera
    
    [ Upstream commit 9764401bf6f8a20eb11c2e78470f20fee91a9ea7 ]
    
    Some broken device return wrong dwMaxPayloadTransferSize fields as
    follows:
    
    [  218.632537] uvcvideo: Device requested 2752512 B/frame bandwidth.
    [  218.632598] uvcvideo: No fast enough alt setting for requested bandwidth.
    
    When dwMaxPayloadTransferSize is greater than maxpsize, it will prevent
    the camera from starting. So use the bandwidth of maxpsize.
    
    Signed-off-by: chenchangcheng <chenchangcheng@kylinos.cn>
    Reviewed-by: Ricardo Ribalda <ribalda@chromium.org>
    Reviewed-by: Laurent Pinchart <laurent.pinchart@ideasonboard.com>
    Link: https://lore.kernel.org/r/20250510061803.811433-1-ccc194101@163.com
    Signed-off-by: Laurent Pinchart <laurent.pinchart@ideasonboard.com>
    Signed-off-by: Hans Verkuil <hverkuil@xs4all.nl>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

media: uvcvideo: Set V4L2_CTRL_FLAG_DISABLED during queryctrl errors [+ + +]

Author: Ricardo Ribalda <ribalda@chromium.org>
Date:   Fri May 2 07:48:28 2025 +0000

    media: uvcvideo: Set V4L2_CTRL_FLAG_DISABLED during queryctrl errors
    
    [ Upstream commit 649c033711d7fd6e1d5d69e4cfc3fceca7de2867 ]
    
    To implement VIDIOC_QUERYCTRL, we need to know the minimum, maximum,
    step and flags of the control. For some of the controls, this involves
    querying the actual hardware.
    
    Some non-compliant cameras produce errors when we query them. These
    error can be triggered every time, sometimes, or when other controls do
    not have the "right value". Right now, we populate that error to userspace.
    When an error happens, the v4l2 framework does not copy the v4l2_queryctrl
    struct to userspace. Also, userspace apps are not ready to handle any
    other error than -EINVAL.
    
    One of the main usecases of VIDIOC_QUERYCTRL is enumerating the controls
    of a device. This is done using the V4L2_CTRL_FLAG_NEXT_CTRL flag. In
    that usecase, a non-compliant control will make it almost impossible to
    enumerate all controls of the device.
    
    A control with an invalid max/min/step/flags is better than non being
    able to enumerate the rest of the controls.
    
    This patch:
    - Retries for an extra attempt to read the control, to avoid spurious
      errors. More attempts do not seem to produce better results in the
      tested hardware.
    - Makes VIDIOC_QUERYCTRL return 0 for -EIO errors.
    - Introduces a warning in dmesg so we can have a trace of what has happened
      and sets the V4L2_CTRL_FLAG_DISABLED.
    - Makes sure we keep returning V4L2_CTRL_FLAG_DISABLED for all the next
      attempts to query that control (other operations have the same
      functionality as now).
    
    Reviewed-by: Hans de Goede <hdegoede@redhat.com>
    Signed-off-by: Ricardo Ribalda <ribalda@chromium.org>
    Link: https://lore.kernel.org/r/20250502-uvc-eaccess-v8-1-0b8b58ac1142@chromium.org
    Signed-off-by: Hans de Goede <hansg@kernel.org>
    Signed-off-by: Laurent Pinchart <laurent.pinchart@ideasonboard.com>
    Signed-off-by: Hans Verkuil <hverkuil@xs4all.nl>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

media: uvcvideo: Turn on the camera if V4L2_EVENT_SUB_FL_SEND_INITIAL [+ + +]

Author: Ricardo Ribalda <ribalda@chromium.org>
Date:   Tue Jul 1 06:45:17 2025 +0000

    media: uvcvideo: Turn on the camera if V4L2_EVENT_SUB_FL_SEND_INITIAL
    
    commit a03e32e60141058d46ea8cf4631654c43c740fdb upstream.
    
    If we subscribe to an event with V4L2_EVENT_SUB_FL_SEND_INITIAL, the
    driver needs to report back some values that require the camera to be
    powered on. But VIDIOC_SUBSCRIBE_EVENT is not part of the ioctls that
    turn on the camera.
    
    We could unconditionally turn on the camera during
    VIDIOC_SUBSCRIBE_EVENT, but it is more efficient to turn it on only
    during V4L2_EVENT_SUB_FL_SEND_INITIAL, which we believe is not a common
    usecase.
    
    To avoid a list_del if uvc_pm_get() fails, we move list_add_tail to the
    end of the function.
    
    Reviewed-by: Hans de Goede <hansg@kernel.org>
    Fixes: d1b618e79548 ("media: uvcvideo: Do not turn on the camera for some ioctls")
    Cc: stable@vger.kernel.org
    Reviewed-by: Laurent Pinchart <laurent.pinchart@ideasonboard.com>
    Signed-off-by: Ricardo Ribalda <ribalda@chromium.org>
    Link: https://lore.kernel.org/r/20250701-uvc-grannular-invert-v4-5-8003b9b89f68@chromium.org
    Signed-off-by: Hans de Goede <hansg@kernel.org>
    Signed-off-by: Hans Verkuil <hverkuil@xs4all.nl>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

media: v4l2-common: Reduce warnings about missing V4L2_CID_LINK_FREQ control [+ + +]

Author: Niklas Söderlund <niklas.soderlund+renesas@ragnatech.se>
Date:   Thu May 8 10:37:45 2025 +0200

    media: v4l2-common: Reduce warnings about missing V4L2_CID_LINK_FREQ control
    
    [ Upstream commit 5a0abb8909b9dcf347fce1d201ac6686ac33fd64 ]
    
    When operating a pipeline with a missing V4L2_CID_LINK_FREQ control this
    two line warning is printed each time the pipeline is started. Reduce
    this excessive logging by only warning once for the missing control.
    
    Signed-off-by: Niklas Söderlund <niklas.soderlund+renesas@ragnatech.se>
    Signed-off-by: Sakari Ailus <sakari.ailus@linux.intel.com>
    Signed-off-by: Hans Verkuil <hverkuil@xs4all.nl>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

media: v4l2: Add support for NV12M tiled variants to v4l2_format_info() [+ + +]

Author: Marek Szyprowski <m.szyprowski@samsung.com>
Date:   Fri Jul 11 11:41:58 2025 +0200

    media: v4l2: Add support for NV12M tiled variants to v4l2_format_info()
    
    commit f7546da1d6eb8928efb89b7faacbd6c2f8f0de5c upstream.
    
    Commit 6f1466123d73 ("media: s5p-mfc: Add YV12 and I420 multiplanar
    format support") added support for the new formats to s5p-mfc driver,
    what in turn required some internal calls to the v4l2_format_info()
    function while setting up formats. This in turn broke support for the
    "old" tiled NV12MT* formats, which are not recognized by this function.
    Fix this by adding those variants of NV12M pixel format to
    v4l2_format_info() function database.
    
    Fixes: 6f1466123d73 ("media: s5p-mfc: Add YV12 and I420 multiplanar format support")
    Cc: stable@vger.kernel.org
    Signed-off-by: Marek Szyprowski <m.szyprowski@samsung.com>
    Signed-off-by: Nicolas Dufresne <nicolas.dufresne@collabora.com>
    Signed-off-by: Hans Verkuil <hverkuil@xs4all.nl>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

media: venus: Fix OOB read due to missing payload bound check [+ + +]

Author: Vedang Nagar <quic_vnagar@quicinc.com>
Date:   Mon May 19 12:42:22 2025 +0530

    media: venus: Fix OOB read due to missing payload bound check
    
    commit 06d6770ff0d8cc8dfd392329a8cc03e2a83e7289 upstream.
    
    Currently, The event_seq_changed() handler processes a variable number
    of properties sent by the firmware. The number of properties is indicated
    by the firmware and used to iterate over the payload. However, the
    payload size is not being validated against the actual message length.
    
    This can lead to out-of-bounds memory access if the firmware provides a
    property count that exceeds the data available in the payload. Such a
    condition can result in kernel crashes or potential information leaks if
    memory beyond the buffer is accessed.
    
    Fix this by properly validating the remaining size of the payload before
    each property access and updating bounds accordingly as properties are
    parsed.
    
    This ensures that property parsing is safely bounded within the received
    message buffer and protects against malformed or malicious firmware
    behavior.
    
    Fixes: 09c2845e8fe4 ("[media] media: venus: hfi: add Host Firmware Interface (HFI)")
    Cc: stable@vger.kernel.org
    Signed-off-by: Vedang Nagar <quic_vnagar@quicinc.com>
    Reviewed-by: Vikash Garodia <quic_vgarodia@quicinc.com>
    Reviewed-by: Bryan O'Donoghue <bryan.odonoghue@linaro.org>
    Co-developed-by: Dikshita Agarwal <quic_dikshita@quicinc.com>
    Signed-off-by: Dikshita Agarwal <quic_dikshita@quicinc.com>
    Signed-off-by: Bryan O'Donoghue <bod@kernel.org>
    Signed-off-by: Hans Verkuil <hverkuil@xs4all.nl>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mei: bus: Check for still connected devices in mei_cl_bus_dev_release() [+ + +]

Author: Hans de Goede <hansg@kernel.org>
Date:   Mon Jun 23 10:50:52 2025 +0200

    mei: bus: Check for still connected devices in mei_cl_bus_dev_release()
    
    [ Upstream commit 35e8a426b16adbecae7a4e0e3c00fc8d0273db53 ]
    
    mei_cl_bus_dev_release() also frees the mei-client (struct mei_cl)
    belonging to the device being released.
    
    If there are bugs like the just fixed bug in the ACE/CSI2 mei drivers,
    the mei-client being freed might still be part of the mei_device's
    file_list and iterating over this list after the freeing will then trigger
    a use-afer-free bug.
    
    Add a check to mei_cl_bus_dev_release() to make sure that the to-be-freed
    mei-client is not on the mei_device's file_list.
    
    Signed-off-by: Hans de Goede <hansg@kernel.org>
    Link: https://lore.kernel.org/r/20250623085052.12347-11-hansg@kernel.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

mfd: axp20x: Set explicit ID for AXP313 regulator [+ + +]

Author: Chen-Yu Tsai <wens@csie.org>
Date:   Fri Jun 20 01:32:07 2025 +0800

    mfd: axp20x: Set explicit ID for AXP313 regulator
    
    [ Upstream commit 88828c7e940dd45d139ad4a39d702b23840a37c5 ]
    
    On newer boards featuring the A523 SoC, the AXP323 (related to the
    AXP313) is paired with the AXP717 and serves as a secondary PMIC
    providing additional regulator outputs. However the MFD cells are all
    registered with PLATFORM_DEVID_NONE, which causes the regulator cells
    to conflict with each other.
    
    Commit e37ec3218870 ("mfd: axp20x: Allow multiple regulators") attempted
    to fix this by switching to PLATFORM_DEVID_AUTO so that the device names
    would all be different, however that broke IIO channel mapping, which is
    also tied to the device names. As a result the change was later reverted.
    
    Instead, here we attempt to make sure the AXP313/AXP323 regulator cell
    does not conflict by explicitly giving it an ID number. This was
    previously done for the AXP809+AXP806 pair used with the A80 SoC.
    
    Signed-off-by: Chen-Yu Tsai <wens@csie.org>
    Link: https://lore.kernel.org/r/20250619173207.3367126-1-wens@kernel.org
    Signed-off-by: Lee Jones <lee@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

mfd: cros_ec: Separate charge-control probing from USB-PD [+ + +]

Author: Thomas Weißschuh <linux@weissschuh.net>
Date:   Wed May 21 16:42:51 2025 +0200

    mfd: cros_ec: Separate charge-control probing from USB-PD
    
    commit e40fc1160d491c3bcaf8e940ae0dde0a7c5e8e14 upstream.
    
    The charge-control subsystem in the ChromeOS EC is not strictly tied to
    its USB-PD subsystem.
    
    Since commit 7613bc0d116a ("mfd: cros_ec: Don't load charger with UCSI")
    the presence of EC_FEATURE_UCSI_PPM would inhibit the probing of the
    charge-control driver.
    
    Furthermore recent versions of the EC firmware in Framework laptops
    hard-disable EC_FEATURE_USB_PD to avoid probing cros-usbpd-charger,
    which then also breaks cros-charge-control.
    
    Instead use the dedicated EC_FEATURE_CHARGER.
    
    Cc: stable@vger.kernel.org
    Link: https://github.com/FrameworkComputer/EmbeddedController/commit/1d7bcf1d50137c8c01969eb65880bc83e424597e
    Fixes: 555b5fcdb844 ("mfd: cros_ec: Register charge control subdevice")
    Signed-off-by: Thomas Weißschuh <linux@weissschuh.net>
    Reviewed-by: Tzung-Bi Shih <tzungbi@kernel.org>
    Tested-by: Tom Vincent <linux@tlvince.com>
    Link: https://lore.kernel.org/r/20250521-cros-ec-mfd-chctl-probe-v1-1-6ebfe3a6efa7@weissschuh.net
    Signed-off-by: Lee Jones <lee@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

MIPS: Don't crash in stack_top() for tasks without ABI or vDSO [+ + +]

Author: Thomas Weißschuh <thomas.weissschuh@linutronix.de>
Date:   Wed Jun 11 13:28:26 2025 +0200

    MIPS: Don't crash in stack_top() for tasks without ABI or vDSO
    
    [ Upstream commit e9f4a6b3421e936c3ee9d74710243897d74dbaa2 ]
    
    Not all tasks have an ABI associated or vDSO mapped,
    for example kthreads never do.
    If such a task ever ends up calling stack_top(), it will derefence the
    NULL ABI pointer and crash.
    
    This can for example happen when using kunit:
    
        mips_stack_top+0x28/0xc0
        arch_pick_mmap_layout+0x190/0x220
        kunit_vm_mmap_init+0xf8/0x138
        __kunit_add_resource+0x40/0xa8
        kunit_vm_mmap+0x88/0xd8
        usercopy_test_init+0xb8/0x240
        kunit_try_run_case+0x5c/0x1a8
        kunit_generic_run_threadfn_adapter+0x28/0x50
        kthread+0x118/0x240
        ret_from_kernel_thread+0x14/0x1c
    
    Only dereference the ABI point if it is set.
    
    The GIC page is also included as it is specific to the vDSO.
    Also move the randomization adjustment into the same conditional.
    
    Signed-off-by: Thomas Weißschuh <thomas.weissschuh@linutronix.de>
    Reviewed-by: David Gow <davidgow@google.com>
    Reviewed-by: Huacai Chen <chenhuacai@loongson.cn>
    Signed-off-by: Thomas Bogendoerfer <tsbogend@alpha.franken.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

MIPS: lantiq: falcon: sysctrl: fix request memory check logic [+ + +]

Author: Shiji Yang <yangshiji66@outlook.com>
Date:   Wed Jun 18 22:53:23 2025 +0800

    MIPS: lantiq: falcon: sysctrl: fix request memory check logic
    
    [ Upstream commit 9c9a7ff9882fc6ba7d2f4050697e8bb80383e8dc ]
    
    request_mem_region() will return NULL instead of error code
    when the memory request fails. Therefore, we should check if
    the return value is non-zero instead of less than zero. In
    this way, this patch also fixes the build warnings:
    
    arch/mips/lantiq/falcon/sysctrl.c:214:50: error: ordered comparison of pointer with integer zero [-Werror=extra]
      214 |                                 res_status.name) < 0) ||
          |                                                  ^
    arch/mips/lantiq/falcon/sysctrl.c:216:47: error: ordered comparison of pointer with integer zero [-Werror=extra]
      216 |                                 res_ebu.name) < 0) ||
          |                                               ^
    arch/mips/lantiq/falcon/sysctrl.c:219:50: error: ordered comparison of pointer with integer zero [-Werror=extra]
      219 |                                 res_sys[0].name) < 0) ||
          |                                                  ^
    arch/mips/lantiq/falcon/sysctrl.c:222:50: error: ordered comparison of pointer with integer zero [-Werror=extra]
      222 |                                 res_sys[1].name) < 0) ||
          |                                                  ^
    arch/mips/lantiq/falcon/sysctrl.c:225:50: error: ordered comparison of pointer with integer zero [-Werror=extra]
      225 |                                 res_sys[2].name) < 0))
          |
    
    Signed-off-by: Shiji Yang <yangshiji66@outlook.com>
    Signed-off-by: Thomas Bogendoerfer <tsbogend@alpha.franken.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

MIPS: vpe-mt: add missing prototypes for vpe_{alloc,start,stop,free} [+ + +]

Author: Shiji Yang <yangshiji66@outlook.com>
Date:   Thu Jul 3 21:06:32 2025 +0800

    MIPS: vpe-mt: add missing prototypes for vpe_{alloc,start,stop,free}
    
    [ Upstream commit 844615dd0f2d95c018ec66b943e08af22b62aff3 ]
    
    These functions are exported but their prototypes are not defined.
    This patch adds the missing function prototypes to fix the following
    compilation warnings:
    
    arch/mips/kernel/vpe-mt.c:180:7: error: no previous prototype for 'vpe_alloc' [-Werror=missing-prototypes]
      180 | void *vpe_alloc(void)
          |       ^~~~~~~~~
    arch/mips/kernel/vpe-mt.c:198:5: error: no previous prototype for 'vpe_start' [-Werror=missing-prototypes]
      198 | int vpe_start(void *vpe, unsigned long start)
          |     ^~~~~~~~~
    arch/mips/kernel/vpe-mt.c:208:5: error: no previous prototype for 'vpe_stop' [-Werror=missing-prototypes]
      208 | int vpe_stop(void *vpe)
          |     ^~~~~~~~
    arch/mips/kernel/vpe-mt.c:229:5: error: no previous prototype for 'vpe_free' [-Werror=missing-prototypes]
      229 | int vpe_free(void *vpe)
          |     ^~~~~~~~
    
    Signed-off-by: Shiji Yang <yangshiji66@outlook.com>
    Signed-off-by: Thomas Bogendoerfer <tsbogend@alpha.franken.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

misc: rtsx: usb: Ensure mmc child device is active when card is present [+ + +]

Author: Ricky Wu <ricky_wu@realtek.com>
Date:   Fri Jul 11 22:01:43 2025 +0800

    misc: rtsx: usb: Ensure mmc child device is active when card is present
    
    commit 966c5cd72be8989c8a559ddef8e8ff07a37c5eb0 upstream.
    
    When a card is present in the reader, the driver currently defers
    autosuspend by returning -EAGAIN during the suspend callback to
    trigger USB remote wakeup signaling. However, this does not guarantee
    that the mmc child device has been resumed, which may cause issues if
    it remains suspended while the card is accessible.
    This patch ensures that all child devices, including the mmc host
    controller, are explicitly resumed before returning -EAGAIN. This
    fixes a corner case introduced by earlier remote wakeup handling,
    improving reliability of runtime PM when a card is inserted.
    
    Fixes: 883a87ddf2f1 ("misc: rtsx_usb: Use USB remote wakeup signaling for card insertion detection")
    Cc: stable@vger.kernel.org
    Signed-off-by: Ricky Wu <ricky_wu@realtek.com>
    Reviewed-by: Ulf Hansson <ulf.hansson@linaro.org>
    Link: https://lore.kernel.org/r/20250711140143.2105224-1-ricky_wu@realtek.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mm, slab: restore NUMA policy support for large kmalloc [+ + +]

Author: Vlastimil Babka <vbabka@suse.cz>
Date:   Mon Jun 2 13:02:12 2025 +0200

    mm, slab: restore NUMA policy support for large kmalloc
    
    commit e2d18cbf178775ad377ad88ee55e6e183c38d262 upstream.
    
    The slab allocator observes the task's NUMA policy in various places
    such as allocating slab pages. Large kmalloc() allocations used to do
    that too, until an unintended change by c4cab557521a ("mm/slab_common:
    cleanup kmalloc_large()") resulted in ignoring mempolicy and just
    preferring the local node. Restore the NUMA policy support.
    
    Fixes: c4cab557521a ("mm/slab_common: cleanup kmalloc_large()")
    Cc: <stable@vger.kernel.org>
    Acked-by: Christoph Lameter (Ampere) <cl@gentwo.org>
    Acked-by: Roman Gushchin <roman.gushchin@linux.dev>
    Reviewed-by: Harry Yoo <harry.yoo@oracle.com>
    Signed-off-by: Vlastimil Babka <vbabka@suse.cz>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mm/damon/core: commit damos->target_nid [+ + +]

Author: Bijan Tabatabai <bijantabatab@micron.com>
Date:   Tue Jul 8 19:47:29 2025 -0500

    mm/damon/core: commit damos->target_nid
    
    commit 579bd5006fe7f4a7abb32da0160d376476cab67d upstream.
    
    When committing new scheme parameters from the sysfs, the target_nid field
    of the damos struct would not be copied.  This would result in the
    target_nid field to retain its original value, despite being updated in
    the sysfs interface.
    
    This patch fixes this issue by copying target_nid in damos_commit().
    
    Link: https://lkml.kernel.org/r/20250709004729.17252-1-bijan311@gmail.com
    Fixes: 83dc7bbaecae ("mm/damon/sysfs: use damon_commit_ctx()")
    Signed-off-by: Bijan Tabatabai <bijantabatab@micron.com>
    Reviewed-by: SeongJae Park <sj@kernel.org>
    Cc: Jonathan Corbet <corbet@lwn.net>
    Cc: Ravi Shankar Jonnalagadda <ravis.opensrc@micron.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mm/huge_memory: don't ignore queried cachemode in vmf_insert_pfn_pud() [+ + +]

Author: David Hildenbrand <david@redhat.com>
Date:   Fri Jun 13 11:27:00 2025 +0200

    mm/huge_memory: don't ignore queried cachemode in vmf_insert_pfn_pud()
    
    commit 09fefdca80aebd1023e827cb0ee174983d829d18 upstream.
    
    Patch series "mm/huge_memory: vmf_insert_folio_*() and
    vmf_insert_pfn_pud() fixes", v3.
    
    While working on improving vm_normal_page() and friends, I stumbled over
    this issues: refcounted "normal" folios must not be marked using
    pmd_special() / pud_special().  Otherwise, we're effectively telling the
    system that these folios are no "normal", violating the rules we
    documented for vm_normal_page().
    
    Fortunately, there are not many pmd_special()/pud_special() users yet.  So
    far there doesn't seem to be serious damage.
    
    Tested using the ndctl tests ("ndctl:dax" suite).
    
    
    This patch (of 3):
    
    We set up the cache mode but ...  don't forward the updated pgprot to
    insert_pfn_pud().
    
    Only a problem on x86-64 PAT when mapping PFNs using PUDs that require a
    special cachemode.
    
    Fix it by using the proper pgprot where the cachemode was setup.
    
    It is unclear in which configurations we would get the cachemode wrong:
    through vfio seems possible.  Getting cachemodes wrong is usually ...
    bad.  As the fix is easy, let's backport it to stable.
    
    Identified by code inspection.
    
    Link: https://lkml.kernel.org/r/20250613092702.1943533-1-david@redhat.com
    Link: https://lkml.kernel.org/r/20250613092702.1943533-2-david@redhat.com
    Fixes: 7b806d229ef1 ("mm: remove vmf_insert_pfn_xxx_prot() for huge page-table entries")
    Signed-off-by: David Hildenbrand <david@redhat.com>
    Reviewed-by: Dan Williams <dan.j.williams@intel.com>
    Reviewed-by: Lorenzo Stoakes <lorenzo.stoakes@oracle.com>
    Reviewed-by: Jason Gunthorpe <jgg@nvidia.com>
    Reviewed-by: Oscar Salvador <osalvador@suse.de>
    Tested-by: Dan Williams <dan.j.williams@intel.com>
    Cc: Alistair Popple <apopple@nvidia.com>
    Cc: Baolin Wang <baolin.wang@linux.alibaba.com>
    Cc: Dev Jain <dev.jain@arm.com>
    Cc: Liam Howlett <liam.howlett@oracle.com>
    Cc: Mariano Pache <npache@redhat.com>
    Cc: Michal Hocko <mhocko@suse.com>
    Cc: Mike Rapoport <rppt@kernel.org>
    Cc: Ryan Roberts <ryan.roberts@arm.com>
    Cc: Suren Baghdasaryan <surenb@google.com>
    Cc: Vlastimil Babka <vbabka@suse.cz>
    Cc: Zi Yan <ziy@nvidia.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mm/kmemleak: avoid deadlock by moving pr_warn() outside kmemleak_lock [+ + +]

Author: Breno Leitao <leitao@debian.org>
Date:   Thu Jul 31 02:57:18 2025 -0700

    mm/kmemleak: avoid deadlock by moving pr_warn() outside kmemleak_lock
    
    commit 47b0f6d8f0d2be4d311a49e13d2fd5f152f492b2 upstream.
    
    When netpoll is enabled, calling pr_warn_once() while holding
    kmemleak_lock in mem_pool_alloc() can cause a deadlock due to lock
    inversion with the netconsole subsystem.  This occurs because
    pr_warn_once() may trigger netpoll, which eventually leads to
    __alloc_skb() and back into kmemleak code, attempting to reacquire
    kmemleak_lock.
    
    This is the path for the deadlock.
    
    mem_pool_alloc()
      -> raw_spin_lock_irqsave(&kmemleak_lock, flags);
          -> pr_warn_once()
              -> netconsole subsystem
                 -> netpoll
                     -> __alloc_skb
                       -> __create_object
                         -> raw_spin_lock_irqsave(&kmemleak_lock, flags);
    
    Fix this by setting a flag and issuing the pr_warn_once() after
    kmemleak_lock is released.
    
    Link: https://lkml.kernel.org/r/20250731-kmemleak_lock-v1-1-728fd470198f@debian.org
    Fixes: c5665868183f ("mm: kmemleak: use the memory pool for early allocations")
    Signed-off-by: Breno Leitao <leitao@debian.org>
    Reported-by: Jakub Kicinski <kuba@kernel.org>
    Acked-by: Catalin Marinas <catalin.marinas@arm.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mm/kmemleak: avoid soft lockup in __kmemleak_do_cleanup() [+ + +]

Author: Waiman Long <longman@redhat.com>
Date:   Mon Jul 28 15:02:48 2025 -0400

    mm/kmemleak: avoid soft lockup in __kmemleak_do_cleanup()
    
    commit d1534ae23c2b6be350c8ab060803fbf6e9682adc upstream.
    
    A soft lockup warning was observed on a relative small system x86-64
    system with 16 GB of memory when running a debug kernel with kmemleak
    enabled.
    
      watchdog: BUG: soft lockup - CPU#8 stuck for 33s! [kworker/8:1:134]
    
    The test system was running a workload with hot unplug happening in
    parallel.  Then kemleak decided to disable itself due to its inability to
    allocate more kmemleak objects.  The debug kernel has its
    CONFIG_DEBUG_KMEMLEAK_MEM_POOL_SIZE set to 40,000.
    
    The soft lockup happened in kmemleak_do_cleanup() when the existing
    kmemleak objects were being removed and deleted one-by-one in a loop via a
    workqueue.  In this particular case, there are at least 40,000 objects
    that need to be processed and given the slowness of a debug kernel and the
    fact that a raw_spinlock has to be acquired and released in
    __delete_object(), it could take a while to properly handle all these
    objects.
    
    As kmemleak has been disabled in this case, the object removal and
    deletion process can be further optimized as locking isn't really needed.
    However, it is probably not worth the effort to optimize for such an edge
    case that should rarely happen.  So the simple solution is to call
    cond_resched() at periodic interval in the iteration loop to avoid soft
    lockup.
    
    Link: https://lkml.kernel.org/r/20250728190248.605750-1-longman@redhat.com
    Signed-off-by: Waiman Long <longman@redhat.com>
    Acked-by: Catalin Marinas <catalin.marinas@arm.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mm/memory-tier: fix abstract distance calculation overflow [+ + +]

Author: Li Zhijian <lizhijian@fujitsu.com>
Date:   Tue Jun 10 14:27:51 2025 +0800

    mm/memory-tier: fix abstract distance calculation overflow
    
    commit cce35103135c7ffc7bebc32ebfc74fe1f2c3cb5d upstream.
    
    In mt_perf_to_adistance(), the calculation of abstract distance (adist)
    involves multiplying several int values including
    MEMTIER_ADISTANCE_DRAM.
    
    *adist = MEMTIER_ADISTANCE_DRAM *
                    (perf->read_latency + perf->write_latency) /
                    (default_dram_perf.read_latency + default_dram_perf.write_latency) *
                    (default_dram_perf.read_bandwidth + default_dram_perf.write_bandwidth) /
                    (perf->read_bandwidth + perf->write_bandwidth);
    
    Since these values can be large, the multiplication may exceed the
    maximum value of an int (INT_MAX) and overflow (Our platform did),
    leading to an incorrect adist.
    
    User-visible impact:
    The memory tiering subsystem will misinterpret slow memory (like CXL)
    as faster than DRAM, causing inappropriate demotion of pages from
    CXL (slow memory) to DRAM (fast memory).
    
    For example, we will see the following demotion chains from the dmesg, where
    Node0,1 are DRAM, and Node2,3 are CXL node:
     Demotion targets for Node 0: null
     Demotion targets for Node 1: null
     Demotion targets for Node 2: preferred: 0-1, fallback: 0-1
     Demotion targets for Node 3: preferred: 0-1, fallback: 0-1
    
    Change MEMTIER_ADISTANCE_DRAM to be a long constant by writing it with
    the 'L' suffix.  This prevents the overflow because the multiplication
    will then be done in the long type which has a larger range.
    
    Link: https://lkml.kernel.org/r/20250611023439.2845785-1-lizhijian@fujitsu.com
    Link: https://lkml.kernel.org/r/20250610062751.2365436-1-lizhijian@fujitsu.com
    Fixes: 3718c02dbd4c ("acpi, hmat: calculate abstract distance with HMAT")
    Signed-off-by: Li Zhijian <lizhijian@fujitsu.com>
    Reviewed-by: Huang Ying <ying.huang@linux.alibaba.com>
    Acked-by: Balbir Singh <balbirs@nvidia.com>
    Reviewed-by: Donet Tom <donettom@linux.ibm.com>
    Reviewed-by: Oscar Salvador <osalvador@suse.de>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mm/ptdump: take the memory hotplug lock inside ptdump_walk_pgd() [+ + +]

Author: Anshuman Khandual <anshuman.khandual@arm.com>
Date:   Fri Jun 20 10:54:27 2025 +0530

    mm/ptdump: take the memory hotplug lock inside ptdump_walk_pgd()
    
    commit 59305202c67fea50378dcad0cc199dbc13a0e99a upstream.
    
    Memory hot remove unmaps and tears down various kernel page table regions
    as required.  The ptdump code can race with concurrent modifications of
    the kernel page tables.  When leaf entries are modified concurrently, the
    dump code may log stale or inconsistent information for a VA range, but
    this is otherwise not harmful.
    
    But when intermediate levels of kernel page table are freed, the dump code
    will continue to use memory that has been freed and potentially
    reallocated for another purpose.  In such cases, the ptdump code may
    dereference bogus addresses, leading to a number of potential problems.
    
    To avoid the above mentioned race condition, platforms such as arm64,
    riscv and s390 take memory hotplug lock, while dumping kernel page table
    via the sysfs interface /sys/kernel/debug/kernel_page_tables.
    
    Similar race condition exists while checking for pages that might have
    been marked W+X via /sys/kernel/debug/kernel_page_tables/check_wx_pages
    which in turn calls ptdump_check_wx().  Instead of solving this race
    condition again, let's just move the memory hotplug lock inside generic
    ptdump_check_wx() which will benefit both the scenarios.
    
    Drop get_online_mems() and put_online_mems() combination from all existing
    platform ptdump code paths.
    
    Link: https://lkml.kernel.org/r/20250620052427.2092093-1-anshuman.khandual@arm.com
    Fixes: bbd6ec605c0f ("arm64/mm: Enable memory hot remove")
    Signed-off-by: Anshuman Khandual <anshuman.khandual@arm.com>
    Acked-by: David Hildenbrand <david@redhat.com>
    Reviewed-by: Dev Jain <dev.jain@arm.com>
    Acked-by: Alexander Gordeev <agordeev@linux.ibm.com>    [s390]
    Cc: Catalin Marinas <catalin.marinas@arm.com>
    Cc: Will Deacon <will@kernel.org>
    Cc: Ryan Roberts <ryan.roberts@arm.com>
    Cc: Paul Walmsley <paul.walmsley@sifive.com>
    Cc: Palmer Dabbelt <palmer@dabbelt.com>
    Cc: Alexander Gordeev <agordeev@linux.ibm.com>
    Cc: Gerald Schaefer <gerald.schaefer@linux.ibm.com>
    Cc: Heiko Carstens <hca@linux.ibm.com>
    Cc: Vasily Gorbik <gor@linux.ibm.com>
    Cc: Christian Borntraeger <borntraeger@linux.ibm.com>
    Cc: Sven Schnelle <svens@linux.ibm.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mm/shmem, swap: improve cached mTHP handling and fix potential hang [+ + +]

Author: Kairui Song <kasong@tencent.com>
Date:   Mon Jul 28 15:52:59 2025 +0800

    mm/shmem, swap: improve cached mTHP handling and fix potential hang
    
    commit 5c241ed8d031693dadf33dd98ed2e7cc363e9b66 upstream.
    
    The current swap-in code assumes that, when a swap entry in shmem mapping
    is order 0, its cached folios (if present) must be order 0 too, which
    turns out not always correct.
    
    The problem is shmem_split_large_entry is called before verifying the
    folio will eventually be swapped in, one possible race is:
    
        CPU1                          CPU2
    shmem_swapin_folio
    /* swap in of order > 0 swap entry S1 */
      folio = swap_cache_get_folio
      /* folio = NULL */
      order = xa_get_order
      /* order > 0 */
      folio = shmem_swap_alloc_folio
      /* mTHP alloc failure, folio = NULL */
      <... Interrupted ...>
                                     shmem_swapin_folio
                                     /* S1 is swapped in */
                                     shmem_writeout
                                     /* S1 is swapped out, folio cached */
      shmem_split_large_entry(..., S1)
      /* S1 is split, but the folio covering it has order > 0 now */
    
    Now any following swapin of S1 will hang: `xa_get_order` returns 0, and
    folio lookup will return a folio with order > 0.  The
    `xa_get_order(&mapping->i_pages, index) != folio_order(folio)` will always
    return false causing swap-in to return -EEXIST.
    
    And this looks fragile.  So fix this up by allowing seeing a larger folio
    in swap cache, and check the whole shmem mapping range covered by the
    swapin have the right swap value upon inserting the folio.  And drop the
    redundant tree walks before the insertion.
    
    This will actually improve performance, as it avoids two redundant Xarray
    tree walks in the hot path, and the only side effect is that in the
    failure path, shmem may redundantly reallocate a few folios causing
    temporary slight memory pressure.
    
    And worth noting, it may seems the order and value check before inserting
    might help reducing the lock contention, which is not true.  The swap
    cache layer ensures raced swapin will either see a swap cache folio or
    failed to do a swapin (we have SWAP_HAS_CACHE bit even if swap cache is
    bypassed), so holding the folio lock and checking the folio flag is
    already good enough for avoiding the lock contention.  The chance that a
    folio passes the swap entry value check but the shmem mapping slot has
    changed should be very low.
    
    Link: https://lkml.kernel.org/r/20250728075306.12704-1-ryncsn@gmail.com
    Link: https://lkml.kernel.org/r/20250728075306.12704-2-ryncsn@gmail.com
    Fixes: 809bc86517cc ("mm: shmem: support large folio swap out")
    Signed-off-by: Kairui Song <kasong@tencent.com>
    Reviewed-by: Kemeng Shi <shikemeng@huaweicloud.com>
    Reviewed-by: Baolin Wang <baolin.wang@linux.alibaba.com>
    Tested-by: Baolin Wang <baolin.wang@linux.alibaba.com>
    Cc: Baoquan He <bhe@redhat.com>
    Cc: Barry Song <baohua@kernel.org>
    Cc: Chris Li <chrisl@kernel.org>
    Cc: Hugh Dickins <hughd@google.com>
    Cc: Matthew Wilcox (Oracle) <willy@infradead.org>
    Cc: Nhat Pham <nphamcs@gmail.com>
    Cc: Dev Jain <dev.jain@arm.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mm/smaps: fix race between smaps_hugetlb_range and migration [+ + +]

Author: Jinjiang Tu <tujinjiang@huawei.com>
Date:   Thu Jul 24 17:09:56 2025 +0800

    mm/smaps: fix race between smaps_hugetlb_range and migration
    
    [ Upstream commit 45d19b4b6c2d422771c29b83462d84afcbb33f01 ]
    
    smaps_hugetlb_range() handles the pte without holdling ptl, and may be
    concurrenct with migration, leaing to BUG_ON in pfn_swap_entry_to_page().
    The race is as follows.
    
    smaps_hugetlb_range              migrate_pages
      huge_ptep_get
                                       remove_migration_ptes
                                       folio_unlock
      pfn_swap_entry_folio
        BUG_ON
    
    To fix it, hold ptl lock in smaps_hugetlb_range().
    
    Link: https://lkml.kernel.org/r/20250724090958.455887-1-tujinjiang@huawei.com
    Link: https://lkml.kernel.org/r/20250724090958.455887-2-tujinjiang@huawei.com
    Fixes: 25ee01a2fca0 ("mm: hugetlb: proc: add hugetlb-related fields to /proc/PID/smaps")
    Signed-off-by: Jinjiang Tu <tujinjiang@huawei.com>
    Acked-by: David Hildenbrand <david@redhat.com>
    Cc: Andrei Vagin <avagin@gmail.com>
    Cc: Andrii Nakryiko <andrii@kernel.org>
    Cc: Baolin Wang <baolin.wang@linux.alibaba.com>
    Cc: Brahmajit Das <brahmajit.xyz@gmail.com>
    Cc: Catalin Marinas <catalin.marinas@arm.com>
    Cc: Christophe Leroy <christophe.leroy@csgroup.eu>
    Cc: David Rientjes <rientjes@google.com>
    Cc: Dev Jain <dev.jain@arm.com>
    Cc: Hugh Dickins <hughd@google.com>
    Cc: Joern Engel <joern@logfs.org>
    Cc: Kefeng Wang <wangkefeng.wang@huawei.com>
    Cc: Lorenzo Stoakes <lorenzo.stoakes@oracle.com>
    Cc: Michal Hocko <mhocko@suse.com>
    Cc: Ryan Roberts <ryan.roberts@arm.com>
    Cc: Thiago Jung Bauermann <thiago.bauermann@linaro.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

mmc: rtsx_usb_sdmmc: Fix error-path in sd_set_power_mode() [+ + +]

Author: Ulf Hansson <ulf.hansson@linaro.org>
Date:   Tue Jun 10 13:16:23 2025 +0200

    mmc: rtsx_usb_sdmmc: Fix error-path in sd_set_power_mode()
    
    [ Upstream commit 47a255f7d2eabee06cfbf5b1c2379749442fd01d ]
    
    In the error path of sd_set_power_mode() we don't update host->power_mode,
    which could lead to an imbalance of the runtime PM usage count. Fix this by
    always updating host->power_mode.
    
    Reviewed-by: Avri Altman <avri.altman@sandisk.com>
    Signed-off-by: Ulf Hansson <ulf.hansson@linaro.org>
    Acked-by: Ricky Wu <ricky_wu@realtek.com>
    Link: https://lore.kernel.org/r/20250610111633.504366-2-ulf.hansson@linaro.org
    Signed-off-by: Sasha Levin <sashal@kernel.org>

mmc: sdhci-esdhc-imx: Don't change pinctrl in suspend if wakeup source [+ + +]

Author: Haibo Chen <haibo.chen@nxp.com>
Date:   Wed May 21 11:31:34 2025 +0800

    mmc: sdhci-esdhc-imx: Don't change pinctrl in suspend if wakeup source
    
    [ Upstream commit 031d9e30d569ca15ca32f64357c83eee6488e09d ]
    
    The pinctrl sleep state may config the pin mux to certain function to save
    power in system suspend. Unfortunately this doesn't work if usdhc is used
    as a wakeup source, like waking up on SDIO irqs or card-detect irqs. In
    these cases, we need pin mux to be configured to usdhc function pad.
    
    The issue is found on imx93-11x11-evk board, where WiFI over SDIO with
    in-band irqs fails to wakeup the system, because the DATA[1] pin has been
    set to GPIO function.
    
    To fix the problem, don't change the pinctrl state in suspend if there is a
    system wakeup enabled.
    
    Signed-off-by: Haibo Chen <haibo.chen@nxp.com>
    Signed-off-by: Luke Wang <ziniu.wang_1@nxp.com>
    Link: https://lore.kernel.org/r/20250521033134.112671-1-ziniu.wang_1@nxp.com
    Signed-off-by: Ulf Hansson <ulf.hansson@linaro.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

mmc: sdhci-msm: Ensure SD card power isn't ON when card removed [+ + +]

Author: Sarthak Garg <quic_sartgarg@quicinc.com>
Date:   Tue Jul 1 15:36:59 2025 +0530

    mmc: sdhci-msm: Ensure SD card power isn't ON when card removed
    
    [ Upstream commit db58532188ebf51d52b1d7693d9e94c76b926e9f ]
    
    Many mobile phones feature multi-card tray designs, where the same
    tray is used for both SD and SIM cards. If the SD card is placed
    at the outermost location in the tray, the SIM card may come in
    contact with SD card power-supply while removing the tray, possibly
    resulting in SIM damage.
    
    To prevent that, make sure the SD card is really inserted by reading
    the Card Detect pin state. If it's not, turn off the power in
    sdhci_msm_check_power_status() and also set the BUS_FAIL power state
    on the controller as part of pwr_irq handling for BUS_ON request.
    
    Signed-off-by: Sarthak Garg <quic_sartgarg@quicinc.com>
    Acked-by: Adrian Hunter <adrian.hunter@intel.com>
    Link: https://lore.kernel.org/r/20250701100659.3310386-1-quic_sartgarg@quicinc.com
    Signed-off-by: Ulf Hansson <ulf.hansson@linaro.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

module: Prevent silent truncation of module name in delete_module(2) [+ + +]

Author: Petr Pavlu <petr.pavlu@suse.com>
Date:   Mon Jun 30 16:32:32 2025 +0200

    module: Prevent silent truncation of module name in delete_module(2)
    
    [ Upstream commit a6323bd4e611567913e23df5b58f2d4e4da06789 ]
    
    Passing a module name longer than MODULE_NAME_LEN to the delete_module
    syscall results in its silent truncation. This really isn't much of
    a problem in practice, but it could theoretically lead to the removal of an
    incorrect module. It is more sensible to return ENAMETOOLONG or ENOENT in
    such a case.
    
    Update the syscall to return ENOENT, as documented in the delete_module(2)
    man page to mean "No module by that name exists." This is appropriate
    because a module with a name longer than MODULE_NAME_LEN cannot be loaded
    in the first place.
    
    Signed-off-by: Petr Pavlu <petr.pavlu@suse.com>
    Reviewed-by: Daniel Gomez <da.gomez@samsung.com>
    Link: https://lore.kernel.org/r/20250630143535.267745-2-petr.pavlu@suse.com
    Signed-off-by: Daniel Gomez <da.gomez@samsung.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net/mlx5e: Properly access RCU protected qdisc_sleeping variable [+ + +]

Author: Leon Romanovsky <leon@kernel.org>
Date:   Wed Jul 16 17:17:49 2025 +0300

    net/mlx5e: Properly access RCU protected qdisc_sleeping variable
    
    [ Upstream commit 2a601b2d35623065d31ebaf697b07502d54878c9 ]
    
    qdisc_sleeping variable is declared as "struct Qdisc __rcu" and
    as such needs proper annotation while accessing it.
    
    Without rtnl_dereference(), the following error is generated by sparse:
    
    drivers/net/ethernet/mellanox/mlx5/core/en/qos.c:377:40: warning:
      incorrect type in initializer (different address spaces)
    drivers/net/ethernet/mellanox/mlx5/core/en/qos.c:377:40:    expected
      struct Qdisc *qdisc
    drivers/net/ethernet/mellanox/mlx5/core/en/qos.c:377:40:    got struct
      Qdisc [noderef] __rcu *qdisc_sleeping
    
    Signed-off-by: Leon Romanovsky <leonro@nvidia.com>
    Signed-off-by: Tariq Toukan <tariqt@nvidia.com>
    Reviewed-by: Michal Swiatkowski <michal.swiatkowski@linux.intel.com>
    Link: https://patch.msgid.link/1752675472-201445-4-git-send-email-tariqt@nvidia.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net/sched: ets: use old 'nbands' while purging unused classes [+ + +]

Author: Davide Caratti <dcaratti@redhat.com>
Date:   Tue Aug 12 18:40:29 2025 +0200

    net/sched: ets: use old 'nbands' while purging unused classes
    
    commit 87c6efc5ce9c126ae4a781bc04504b83780e3650 upstream.
    
    Shuang reported sch_ets test-case [1] crashing in ets_class_qlen_notify()
    after recent changes from Lion [2]. The problem is: in ets_qdisc_change()
    we purge unused DWRR queues; the value of 'q->nbands' is the new one, and
    the cleanup should be done with the old one. The problem is here since my
    first attempts to fix ets_qdisc_change(), but it surfaced again after the
    recent qdisc len accounting fixes. Fix it purging idle DWRR queues before
    assigning a new value of 'q->nbands', so that all purge operations find a
    consistent configuration:
    
     - old 'q->nbands' because it's needed by ets_class_find()
     - old 'q->nstrict' because it's needed by ets_class_is_strict()
    
     BUG: kernel NULL pointer dereference, address: 0000000000000000
     #PF: supervisor read access in kernel mode
     #PF: error_code(0x0000) - not-present page
     PGD 0 P4D 0
     Oops: Oops: 0000 [#1] SMP NOPTI
     CPU: 62 UID: 0 PID: 39457 Comm: tc Kdump: loaded Not tainted 6.12.0-116.el10.x86_64 #1 PREEMPT(voluntary)
     Hardware name: Dell Inc. PowerEdge R640/06DKY5, BIOS 2.12.2 07/09/2021
     RIP: 0010:__list_del_entry_valid_or_report+0x4/0x80
     Code: ff 4c 39 c7 0f 84 39 19 8e ff b8 01 00 00 00 c3 cc cc cc cc 66 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 f3 0f 1e fa <48> 8b 17 48 8b 4f 08 48 85 d2 0f 84 56 19 8e ff 48 85 c9 0f 84 ab
     RSP: 0018:ffffba186009f400 EFLAGS: 00010202
     RAX: 00000000000000d6 RBX: 0000000000000000 RCX: 0000000000000004
     RDX: ffff9f0fa29b69c0 RSI: 0000000000000000 RDI: 0000000000000000
     RBP: ffffffffc12c2400 R08: 0000000000000008 R09: 0000000000000004
     R10: ffffffffffffffff R11: 0000000000000004 R12: 0000000000000000
     R13: ffff9f0f8cfe0000 R14: 0000000000100005 R15: 0000000000000000
     FS:  00007f2154f37480(0000) GS:ffff9f269c1c0000(0000) knlGS:0000000000000000
     CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
     CR2: 0000000000000000 CR3: 00000001530be001 CR4: 00000000007726f0
     DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
     DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
     PKRU: 55555554
     Call Trace:
      <TASK>
      ets_class_qlen_notify+0x65/0x90 [sch_ets]
      qdisc_tree_reduce_backlog+0x74/0x110
      ets_qdisc_change+0x630/0xa40 [sch_ets]
      __tc_modify_qdisc.constprop.0+0x216/0x7f0
      tc_modify_qdisc+0x7c/0x120
      rtnetlink_rcv_msg+0x145/0x3f0
      netlink_rcv_skb+0x53/0x100
      netlink_unicast+0x245/0x390
      netlink_sendmsg+0x21b/0x470
      ____sys_sendmsg+0x39d/0x3d0
      ___sys_sendmsg+0x9a/0xe0
      __sys_sendmsg+0x7a/0xd0
      do_syscall_64+0x7d/0x160
      entry_SYSCALL_64_after_hwframe+0x76/0x7e
     RIP: 0033:0x7f2155114084
     Code: 89 02 b8 ff ff ff ff eb bb 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 00 f3 0f 1e fa 80 3d 25 f0 0c 00 00 74 13 b8 2e 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 48 83 ec 28 89 54 24 1c 48 89
     RSP: 002b:00007fff1fd7a988 EFLAGS: 00000202 ORIG_RAX: 000000000000002e
     RAX: ffffffffffffffda RBX: 0000560ec063e5e0 RCX: 00007f2155114084
     RDX: 0000000000000000 RSI: 00007fff1fd7a9f0 RDI: 0000000000000003
     RBP: 00007fff1fd7aa60 R08: 0000000000000010 R09: 000000000000003f
     R10: 0000560ee9b3a010 R11: 0000000000000202 R12: 00007fff1fd7aae0
     R13: 000000006891ccde R14: 0000560ec063e5e0 R15: 00007fff1fd7aad0
      </TASK>
    
     [1] https://lore.kernel.org/netdev/e08c7f4a6882f260011909a868311c6e9b54f3e4.1639153474.git.dcaratti@redhat.com/
     [2] https://lore.kernel.org/netdev/d912cbd7-193b-4269-9857-525bee8bbb6a@gmail.com/
    
    Cc: stable@vger.kernel.org
    Fixes: 103406b38c60 ("net/sched: Always pass notifications when child class becomes empty")
    Fixes: c062f2a0b04d ("net/sched: sch_ets: don't remove idle classes from the round-robin list")
    Fixes: dcc68b4d8084 ("net: sch_ets: Add a new Qdisc")
    Reported-by: Li Shuang <shuali@redhat.com>
    Closes: https://issues.redhat.com/browse/RHEL-108026
    Reviewed-by: Petr Machata <petrm@nvidia.com>
    Co-developed-by: Ivan Vecera <ivecera@redhat.com>
    Signed-off-by: Ivan Vecera <ivecera@redhat.com>
    Signed-off-by: Davide Caratti <dcaratti@redhat.com>
    Link: https://patch.msgid.link/7928ff6d17db47a2ae7cc205c44777b1f1950545.1755016081.git.dcaratti@redhat.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net: ag71xx: Add missing check after DMA map [+ + +]

Author: Thomas Fourier <fourier.thomas@gmail.com>
Date:   Wed Jul 16 11:57:25 2025 +0200

    net: ag71xx: Add missing check after DMA map
    
    [ Upstream commit 96a1e15e60216b52da0e6da5336b6d7f5b0188b0 ]
    
    The DMA map functions can fail and should be tested for errors.
    
    Signed-off-by: Thomas Fourier <fourier.thomas@gmail.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://patch.msgid.link/20250716095733.37452-3-fourier.thomas@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: atlantic: add set_power to fw_ops for atl2 to fix wol [+ + +]

Author: Eric Work <work.eric@gmail.com>
Date:   Sat Jun 28 22:15:28 2025 -0700

    net: atlantic: add set_power to fw_ops for atl2 to fix wol
    
    [ Upstream commit fad9cf216597a71936ac87143d1618fbbcf97cbe ]
    
    Aquantia AQC113(C) using ATL2FW doesn't properly prepare the NIC for
    enabling wake-on-lan. The FW operation `set_power` was only implemented
    for `hw_atl` and not `hw_atl2`. Implement the `set_power` functionality
    for `hw_atl2`.
    
    Tested with both AQC113 and AQC113C devices. Confirmed you can shutdown
    the system and wake from S5 using magic packets. NIC was previously
    powered off when entering S5. If the NIC was configured for WOL by the
    Windows driver, loading the atlantic driver would disable WOL.
    
    Partially cherry-picks changes from commit,
    https://github.com/Aquantia/AQtion/commit/37bd5cc
    
    Attributing original authors from Marvell for the referenced commit.
    
    Closes: https://github.com/Aquantia/AQtion/issues/70
    Co-developed-by: Igor Russkikh <irusskikh@marvell.com>
    Co-developed-by: Mark Starovoitov <mstarovoitov@marvell.com>
    Co-developed-by: Dmitry Bogdanov <dbogdanov@marvell.com>
    Co-developed-by: Pavel Belous <pbelous@marvell.com>
    Co-developed-by: Nikita Danilov <ndanilov@marvell.com>
    Signed-off-by: Eric Work <work.eric@gmail.com>
    Reviewed-by: Igor Russkikh <irusskikh@marvell.com>
    Link: https://patch.msgid.link/20250629051535.5172-1-work.eric@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: dpaa: fix device leak when querying time stamp info [+ + +]

Author: Johan Hovold <johan@kernel.org>
Date:   Fri Jul 25 19:12:09 2025 +0200

    net: dpaa: fix device leak when querying time stamp info
    
    commit 3fa840230f534385b34a4f39c8dd313fbe723f05 upstream.
    
    Make sure to drop the reference to the ptp device taken by
    of_find_device_by_node() when querying the time stamping capabilities.
    
    Note that holding a reference to the ptp device does not prevent its
    driver data from going away.
    
    Fixes: 17ae0b0ee9db ("dpaa_eth: add the get_ts_info interface for ethtool")
    Cc: stable@vger.kernel.org      # 4.19
    Cc: Yangbo Lu <yangbo.lu@nxp.com>
    Signed-off-by: Johan Hovold <johan@kernel.org>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://patch.msgid.link/20250725171213.880-2-johan@kernel.org
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net: dsa: b53: ensure BCM5325 PHYs are enabled [+ + +]

Author: Álvaro Fernández Rojas <noltari@gmail.com>
Date:   Sat Jun 14 10:00:00 2025 +0200

    net: dsa: b53: ensure BCM5325 PHYs are enabled
    
    [ Upstream commit 966a83df36c6f27476ac3501771422e7852098bc ]
    
    According to the datasheet, BCM5325 uses B53_PD_MODE_CTRL_25 register to
    disable clocking to individual PHYs.
    Only ports 1-4 can be enabled or disabled and the datasheet is explicit
    about not toggling BIT(0) since it disables the PLL power and the switch.
    
    Signed-off-by: Álvaro Fernández Rojas <noltari@gmail.com>
    Reviewed-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Link: https://patch.msgid.link/20250614080000.1884236-15-noltari@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: dsa: b53: fix b53_imp_vlan_setup for BCM5325 [+ + +]

Author: Álvaro Fernández Rojas <noltari@gmail.com>
Date:   Sat Jun 14 09:59:59 2025 +0200

    net: dsa: b53: fix b53_imp_vlan_setup for BCM5325
    
    [ Upstream commit c00df1018791185ea398f78af415a2a0aaa0c79c ]
    
    CPU port should be B53_CPU_PORT instead of B53_CPU_PORT_25 for
    B53_PVLAN_PORT_MASK register.
    
    Reviewed-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Signed-off-by: Álvaro Fernández Rojas <noltari@gmail.com>
    Link: https://patch.msgid.link/20250614080000.1884236-14-noltari@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: dsa: b53: fix IP_MULTICAST_CTRL on BCM5325 [+ + +]

Author: Álvaro Fernández Rojas <noltari@gmail.com>
Date:   Sat Jun 14 09:59:54 2025 +0200

    net: dsa: b53: fix IP_MULTICAST_CTRL on BCM5325
    
    [ Upstream commit 044d5ce2788b165798bfd173548e61bf7b6baf4d ]
    
    BCM5325 doesn't implement B53_UC_FWD_EN, B53_MC_FWD_EN or B53_IPMC_FWD_EN.
    
    Reviewed-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Signed-off-by: Álvaro Fernández Rojas <noltari@gmail.com>
    Link: https://patch.msgid.link/20250614080000.1884236-9-noltari@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: dsa: b53: prevent DIS_LEARNING access on BCM5325 [+ + +]

Author: Álvaro Fernández Rojas <noltari@gmail.com>
Date:   Sat Jun 14 09:59:55 2025 +0200

    net: dsa: b53: prevent DIS_LEARNING access on BCM5325
    
    [ Upstream commit 800728abd9f83bda4de62a30ce62a8b41c242020 ]
    
    BCM5325 doesn't implement DIS_LEARNING register so we should avoid reading
    or writing it.
    
    Reviewed-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Signed-off-by: Álvaro Fernández Rojas <noltari@gmail.com>
    Link: https://patch.msgid.link/20250614080000.1884236-10-noltari@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: dsa: b53: prevent GMII_PORT_OVERRIDE_CTRL access on BCM5325 [+ + +]

Author: Álvaro Fernández Rojas <noltari@gmail.com>
Date:   Sat Jun 14 09:59:57 2025 +0200

    net: dsa: b53: prevent GMII_PORT_OVERRIDE_CTRL access on BCM5325
    
    [ Upstream commit 37883bbc45a8555d6eca88d3a9730504d2dac86c ]
    
    BCM5325 doesn't implement GMII_PORT_OVERRIDE_CTRL register so we should
    avoid reading or writing it.
    PORT_OVERRIDE_RX_FLOW and PORT_OVERRIDE_TX_FLOW aren't defined on BCM5325
    and we should use PORT_OVERRIDE_LP_FLOW_25 instead.
    
    Reviewed-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Signed-off-by: Álvaro Fernández Rojas <noltari@gmail.com>
    Link: https://patch.msgid.link/20250614080000.1884236-12-noltari@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: dsa: b53: prevent SWITCH_CTRL access on BCM5325 [+ + +]

Author: Álvaro Fernández Rojas <noltari@gmail.com>
Date:   Sat Jun 14 09:59:53 2025 +0200

    net: dsa: b53: prevent SWITCH_CTRL access on BCM5325
    
    [ Upstream commit 22ccaaca43440e90a3b68d2183045b42247dc4be ]
    
    BCM5325 doesn't implement SWITCH_CTRL register so we should avoid reading
    or writing it.
    
    Reviewed-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Signed-off-by: Álvaro Fernández Rojas <noltari@gmail.com>
    Link: https://patch.msgid.link/20250614080000.1884236-8-noltari@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: enetc: fix device and OF node leak at probe [+ + +]

Author: Johan Hovold <johan@kernel.org>
Date:   Fri Jul 25 19:12:10 2025 +0200

    net: enetc: fix device and OF node leak at probe
    
    commit 70458f8a6b44daf3ad39f0d9b6d1097c8a7780ed upstream.
    
    Make sure to drop the references to the IERB OF node and platform device
    taken by of_parse_phandle() and of_find_device_by_node() during probe.
    
    Fixes: e7d48e5fbf30 ("net: enetc: add a mini driver for the Integrated Endpoint Register Block")
    Cc: stable@vger.kernel.org      # 5.13
    Cc: Vladimir Oltean <vladimir.oltean@nxp.com>
    Signed-off-by: Johan Hovold <johan@kernel.org>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://patch.msgid.link/20250725171213.880-3-johan@kernel.org
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net: enetc: separate 64-bit counters from enetc_port_counters [+ + +]

Author: Wei Fang <wei.fang@nxp.com>
Date:   Fri Jun 27 10:11:07 2025 +0800

    net: enetc: separate 64-bit counters from enetc_port_counters
    
    [ Upstream commit 9fe5f7145ad746e1b8e7522b8a955f642ff1b404 ]
    
    Some counters in enetc_port_counters are 32-bit registers, and some are
    64-bit registers. But in the current driver, they are all read through
    enetc_port_rd(), which can only read a 32-bit value. Therefore, separate
    64-bit counters (enetc_pm_counters) from enetc_port_counters and use
    enetc_port_rd64() to read the 64-bit statistics.
    
    Signed-off-by: Wei Fang <wei.fang@nxp.com>
    Reviewed-by: Claudiu Manoil <claudiu.manoil@nxp.com>
    Reviewed-by: Frank Li <Frank.Li@nxp.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://patch.msgid.link/20250627021108.3359642-3-wei.fang@nxp.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: fec: allow disable coalescing [+ + +]

Author: Jonas Rebmann <jre@pengutronix.de>
Date:   Thu Jun 26 15:44:02 2025 +0200

    net: fec: allow disable coalescing
    
    [ Upstream commit b7ad21258f9e9a7f58b19595d5ceed2cde3bed68 ]
    
    In the current implementation, IP coalescing is always enabled and
    cannot be disabled.
    
    As setting maximum frames to 0 or 1, or setting delay to zero implies
    immediate delivery of single packets/IRQs, disable coalescing in
    hardware in these cases.
    
    This also guarantees that coalescing is never enabled with ICFT or ICTT
    set to zero, a configuration that could lead to unpredictable behaviour
    according to i.MX8MP reference manual.
    
    Signed-off-by: Jonas Rebmann <jre@pengutronix.de>
    Reviewed-by: Wei Fang <wei.fang@nxp.com>
    Link: https://patch.msgid.link/20250626-fec_deactivate_coalescing-v2-1-0b217f2e80da@pengutronix.de
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: ftgmac100: fix potential NULL pointer access in ftgmac100_phy_disconnect [+ + +]

Author: Heiner Kallweit <hkallweit1@gmail.com>
Date:   Wed Jul 30 22:23:23 2025 +0200

    net: ftgmac100: fix potential NULL pointer access in ftgmac100_phy_disconnect
    
    commit e88fbc30dda1cb7438515303704ceddb3ade4ecd upstream.
    
    After the call to phy_disconnect() netdev->phydev is reset to NULL.
    So fixed_phy_unregister() would be called with a NULL pointer as argument.
    Therefore cache the phy_device before this call.
    
    Fixes: e24a6c874601 ("net: ftgmac100: Get link speed and duplex for NC-SI")
    Cc: stable@vger.kernel.org
    Signed-off-by: Heiner Kallweit <hkallweit1@gmail.com>
    Reviewed-by: Dawid Osuchowski <dawid.osuchowski@linux.intel.com>
    Link: https://patch.msgid.link/2b80a77a-06db-4dd7-85dc-3a8e0de55a1d@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net: gianfar: fix device leak when querying time stamp info [+ + +]

Author: Johan Hovold <johan@kernel.org>
Date:   Fri Jul 25 19:12:11 2025 +0200

    net: gianfar: fix device leak when querying time stamp info
    
    commit da717540acd34e5056e3fa35791d50f6b3303f55 upstream.
    
    Make sure to drop the reference to the ptp device taken by
    of_find_device_by_node() when querying the time stamping capabilities.
    
    Note that holding a reference to the ptp device does not prevent its
    driver data from going away.
    
    Fixes: 7349a74ea75c ("net: ethernet: gianfar_ethtool: get phc index through drvdata")
    Cc: stable@vger.kernel.org      # 4.18
    Cc: Yangbo Lu <yangbo.lu@nxp.com>
    Signed-off-by: Johan Hovold <johan@kernel.org>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://patch.msgid.link/20250725171213.880-4-johan@kernel.org
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net: hibmcge: fix rtnl deadlock issue [+ + +]

Author: Jijie Shao <shaojijie@huawei.com>
Date:   Wed Aug 6 18:27:56 2025 +0800

    net: hibmcge: fix rtnl deadlock issue
    
    [ Upstream commit c875503a9b9082928d7d3fc60b5400d16fbfae4e ]
    
    Currently, the hibmcge netdev acquires the rtnl_lock in
    pci_error_handlers.reset_prepare() and releases it in
    pci_error_handlers.reset_done().
    
    However, in the PCI framework:
    pci_reset_bus - __pci_reset_slot - pci_slot_save_and_disable_locked -
     pci_dev_save_and_disable - err_handler->reset_prepare(dev);
    
    In pci_slot_save_and_disable_locked():
            list_for_each_entry(dev, &slot->bus->devices, bus_list) {
                    if (!dev->slot || dev->slot!= slot)
                            continue;
                    pci_dev_save_and_disable(dev);
                    if (dev->subordinate)
                            pci_bus_save_and_disable_locked(dev->subordinate);
            }
    
    This will iterate through all devices under the current bus and execute
    err_handler->reset_prepare(), causing two devices of the hibmcge driver
    to sequentially request the rtnl_lock, leading to a deadlock.
    
    Since the driver now executes netif_device_detach()
    before the reset process, it will not concurrently with
    other netdev APIs, so there is no need to hold the rtnl_lock now.
    
    Therefore, this patch removes the rtnl_lock during the reset process and
    adjusts the position of HBG_NIC_STATE_RESETTING to ensure
    that multiple resets are not executed concurrently.
    
    Fixes: 3f5a61f6d504f ("net: hibmcge: Add reset supported in this module")
    Signed-off-by: Jijie Shao <shaojijie@huawei.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: hibmcge: fix the division by zero issue [+ + +]

Author: Jijie Shao <shaojijie@huawei.com>
Date:   Wed Aug 6 18:27:57 2025 +0800

    net: hibmcge: fix the division by zero issue
    
    [ Upstream commit 7004b26f0b64331143eb0b312e77a357a11427ce ]
    
    When the network port is down, the queue is released, and ring->len is 0.
    In debugfs, hbg_get_queue_used_num() will be called,
    which may lead to a division by zero issue.
    
    This patch adds a check, if ring->len is 0,
    hbg_get_queue_used_num() directly returns 0.
    
    Fixes: 40735e7543f9 ("net: hibmcge: Implement .ndo_start_xmit function")
    Signed-off-by: Jijie Shao <shaojijie@huawei.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: hibmcge: fix the np_link_fail error reporting issue [+ + +]

Author: Jijie Shao <shaojijie@huawei.com>
Date:   Wed Aug 6 18:27:58 2025 +0800

    net: hibmcge: fix the np_link_fail error reporting issue
    
    [ Upstream commit 62c50180ffda01468e640ac14925503796f255e2 ]
    
    Currently, after modifying device port mode, the np_link_ok state
    is immediately checked. At this point, the device may not yet ready,
    leading to the querying of an intermediate state.
    
    This patch will poll to check if np_link is ok after
    modifying device port mode, and only report np_link_fail upon timeout.
    
    Fixes: e0306637e85d ("net: hibmcge: Add support for mac link exception handling feature")
    Signed-off-by: Jijie Shao <shaojijie@huawei.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: ieee8021q: fix insufficient table-size assertion [+ + +]

Author: RubenKelevra <rubenkelevra@gmail.com>
Date:   Thu Jun 26 22:59:07 2025 +0200

    net: ieee8021q: fix insufficient table-size assertion
    
    [ Upstream commit 21deb2d966920f0d4dd098ca6c3a55efbc0b2f23 ]
    
    _Static_assert(ARRAY_SIZE(map) != IEEE8021Q_TT_MAX - 1) rejects only a
    length of 7 and allows any other mismatch. Replace it with a strict
    equality test via a helper macro so that every mapping table must have
    exactly IEEE8021Q_TT_MAX (8) entries.
    
    Signed-off-by: RubenKelevra <rubenkelevra@gmail.com>
    Link: https://patch.msgid.link/20250626205907.1566384-1-rubenkelevra@gmail.com
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: ipv4: fix incorrect MTU in broadcast routes [+ + +]

Author: Oscar Maes <oscmaes92@gmail.com>
Date:   Thu Jul 10 16:27:13 2025 +0200

    net: ipv4: fix incorrect MTU in broadcast routes
    
    [ Upstream commit 9e30ecf23b1b8f091f7d08b27968dea83aae7908 ]
    
    Currently, __mkroute_output overrules the MTU value configured for
    broadcast routes.
    
    This buggy behaviour can be reproduced with:
    
    ip link set dev eth1 mtu 9000
    ip route del broadcast 192.168.0.255 dev eth1 proto kernel scope link src 192.168.0.2
    ip route add broadcast 192.168.0.255 dev eth1 proto kernel scope link src 192.168.0.2 mtu 1500
    
    The maximum packet size should be 1500, but it is actually 8000:
    
    ping -b 192.168.0.255 -s 8000
    
    Fix __mkroute_output to allow MTU values to be configured for
    for broadcast routes (to support a mixed-MTU local-area-network).
    
    Signed-off-by: Oscar Maes <oscmaes92@gmail.com>
    Link: https://patch.msgid.link/20250710142714.12986-1-oscmaes92@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: kcm: Fix race condition in kcm_unattach() [+ + +]

Author: Sven Stegemann <sven@stegemann.de>
Date:   Tue Aug 12 21:18:03 2025 +0200

    net: kcm: Fix race condition in kcm_unattach()
    
    [ Upstream commit 52565a935213cd6a8662ddb8efe5b4219343a25d ]
    
    syzbot found a race condition when kcm_unattach(psock)
    and kcm_release(kcm) are executed at the same time.
    
    kcm_unattach() is missing a check of the flag
    kcm->tx_stopped before calling queue_work().
    
    If the kcm has a reserved psock, kcm_unattach() might get executed
    between cancel_work_sync() and unreserve_psock() in kcm_release(),
    requeuing kcm->tx_work right before kcm gets freed in kcm_done().
    
    Remove kcm->tx_stopped and replace it by the less
    error-prone disable_work_sync().
    
    Fixes: ab7ac4eb9832 ("kcm: Kernel Connection Multiplexor module")
    Reported-by: syzbot+e62c9db591c30e174662@syzkaller.appspotmail.com
    Closes: https://syzkaller.appspot.com/bug?extid=e62c9db591c30e174662
    Reported-by: syzbot+d199b52665b6c3069b94@syzkaller.appspotmail.com
    Closes: https://syzkaller.appspot.com/bug?extid=d199b52665b6c3069b94
    Reported-by: syzbot+be6b1fdfeae512726b4e@syzkaller.appspotmail.com
    Closes: https://syzkaller.appspot.com/bug?extid=be6b1fdfeae512726b4e
    Signed-off-by: Sven Stegemann <sven@stegemann.de>
    Link: https://patch.msgid.link/20250812191810.27777-1-sven@stegemann.de
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: lapbether: ignore ops-locked netdevs [+ + +]

Author: Stanislav Fomichev <sdf@fomichev.me>
Date:   Wed Aug 6 14:37:25 2025 -0700

    net: lapbether: ignore ops-locked netdevs
    
    [ Upstream commit 53898ebabe843bfa7baea9dae152797d5d0563c9 ]
    
    Syzkaller managed to trigger lock dependency in xsk_notify via
    register_netdevice. As discussed in [0], using register_netdevice
    in the notifiers is problematic so skip adding lapbeth for ops-locked
    devices.
    
           xsk_notifier+0xa4/0x280 net/xdp/xsk.c:1645
           notifier_call_chain+0xbc/0x410 kernel/notifier.c:85
           call_netdevice_notifiers_info+0xbe/0x140 net/core/dev.c:2230
           call_netdevice_notifiers_extack net/core/dev.c:2268 [inline]
           call_netdevice_notifiers net/core/dev.c:2282 [inline]
           unregister_netdevice_many_notify+0xf9d/0x2700 net/core/dev.c:12077
           unregister_netdevice_many net/core/dev.c:12140 [inline]
           unregister_netdevice_queue+0x305/0x3f0 net/core/dev.c:11984
           register_netdevice+0x18f1/0x2270 net/core/dev.c:11149
           lapbeth_new_device drivers/net/wan/lapbether.c:420 [inline]
           lapbeth_device_event+0x5b1/0xbe0 drivers/net/wan/lapbether.c:462
           notifier_call_chain+0xbc/0x410 kernel/notifier.c:85
           call_netdevice_notifiers_info+0xbe/0x140 net/core/dev.c:2230
           call_netdevice_notifiers_extack net/core/dev.c:2268 [inline]
           call_netdevice_notifiers net/core/dev.c:2282 [inline]
           __dev_notify_flags+0x12c/0x2e0 net/core/dev.c:9497
           netif_change_flags+0x108/0x160 net/core/dev.c:9526
           dev_change_flags+0xba/0x250 net/core/dev_api.c:68
           devinet_ioctl+0x11d5/0x1f50 net/ipv4/devinet.c:1200
           inet_ioctl+0x3a7/0x3f0 net/ipv4/af_inet.c:1001
    
    0: https://lore.kernel.org/netdev/20250625140357.6203d0af@kernel.org/
    Fixes: 4c975fd70002 ("net: hold instance lock during NETDEV_REGISTER/UP")
    Suggested-by: Jakub Kicinski <kuba@kernel.org>
    Reported-by: syzbot+e67ea9c235b13b4f0020@syzkaller.appspotmail.com
    Closes: https://syzkaller.appspot.com/bug?extid=e67ea9c235b13b4f0020
    Signed-off-by: Stanislav Fomichev <sdf@fomichev.me>
    Link: https://patch.msgid.link/20250806213726.1383379-1-sdf@fomichev.me
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: mctp: Prevent duplicate binds [+ + +]

Author: Matt Johnston <matt@codeconstruct.com.au>
Date:   Thu Jul 10 16:55:55 2025 +0800

    net: mctp: Prevent duplicate binds
    
    [ Upstream commit 3954502377ec05a1b37e2dc9bef0bacd4bbd71b2 ]
    
    Disallow bind() calls that have the same arguments as existing bound
    sockets.  Previously multiple sockets could bind() to the same
    type/local address, with an arbitrary socket receiving matched messages.
    
    This is only a partial fix, a future commit will define precedence order
    for MCTP_ADDR_ANY versus specific EID bind(), which are allowed to exist
    together.
    
    Signed-off-by: Matt Johnston <matt@codeconstruct.com.au>
    Link: https://patch.msgid.link/20250710-mctp-bind-v4-2-8ec2f6460c56@codeconstruct.com.au
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: mdiobus: release reset_gpio in mdiobus_unregister_device() [+ + +]

Author: Buday Csaba <buday.csaba@prolan.hu>
Date:   Thu Aug 7 15:54:49 2025 +0200

    net: mdiobus: release reset_gpio in mdiobus_unregister_device()
    
    [ Upstream commit 8ea25274ebaf2f6be8be374633b2ed8348ec0e70 ]
    
    reset_gpio is claimed in mdiobus_register_device(), but it is not
    released in mdiobus_unregister_device(). It is instead only
    released when the whole MDIO bus is unregistered.
    When a device uses the reset_gpio property, it becomes impossible
    to unregister it and register it again, because the GPIO remains
    claimed.
    This patch resolves that issue.
    
    Fixes: bafbdd527d56 ("phylib: Add device reset GPIO support") # see notes
    Reviewed-by: Andrew Lunn <andrew@lunn.ch>
    Cc: Csókás Bence <csokas.bence@prolan.hu>
    [ csokas.bence: Resolve rebase conflict and clarify msg ]
    Signed-off-by: Buday Csaba <buday.csaba@prolan.hu>
    Link: https://patch.msgid.link/20250807135449.254254-2-csokas.bence@prolan.hu
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: mtk_eth_soc: fix device leak at probe [+ + +]

Author: Johan Hovold <johan@kernel.org>
Date:   Fri Jul 25 19:12:12 2025 +0200

    net: mtk_eth_soc: fix device leak at probe
    
    commit 3e13274ca8750823e8b68181bdf185d238febe0d upstream.
    
    The reference count to the WED devices has already been incremented when
    looking them up using of_find_device_by_node() so drop the bogus
    additional reference taken during probe.
    
    Fixes: 804775dfc288 ("net: ethernet: mtk_eth_soc: add support for Wireless Ethernet Dispatch (WED)")
    Cc: stable@vger.kernel.org      # 5.19
    Cc: Felix Fietkau <nbd@nbd.name>
    Signed-off-by: Johan Hovold <johan@kernel.org>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://patch.msgid.link/20250725171213.880-5-johan@kernel.org
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net: ncsi: Fix buffer overflow in fetching version id [+ + +]

Author: Hari Kalavakunta <kalavakunta.hari.prasad@gmail.com>
Date:   Tue Jun 10 12:33:38 2025 -0700

    net: ncsi: Fix buffer overflow in fetching version id
    
    [ Upstream commit 8e16170ae972c7fed132bc928914a2ffb94690fc ]
    
    In NC-SI spec v1.2 section 8.4.44.2, the firmware name doesn't
    need to be null terminated while its size occupies the full size
    of the field. Fix the buffer overflow issue by adding one
    additional byte for null terminator.
    
    Signed-off-by: Hari Kalavakunta <kalavakunta.hari.prasad@gmail.com>
    Reviewed-by: Paul Fertser <fercerpav@gmail.com>
    Link: https://patch.msgid.link/20250610193338.1368-1-kalavakunta.hari.prasad@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: page_pool: allow enabling recycling late, fix false positive warning [+ + +]

Author: Jakub Kicinski <kuba@kernel.org>
Date:   Mon Aug 4 17:36:54 2025 -0700

    net: page_pool: allow enabling recycling late, fix false positive warning
    
    [ Upstream commit 64fdaa94bfe0cca3a0f4b2dd922486c5f59fe678 ]
    
    Page pool can have pages "directly" (locklessly) recycled to it,
    if the NAPI that owns the page pool is scheduled to run on the same CPU.
    To make this safe we check that the NAPI is disabled while we destroy
    the page pool. In most cases NAPI and page pool lifetimes are tied
    together so this happens naturally.
    
    The queue API expects the following order of calls:
     -> mem_alloc
        alloc new pp
     -> stop
        napi_disable
     -> start
        napi_enable
     -> mem_free
        free old pp
    
    Here we allocate the page pool in ->mem_alloc and free in ->mem_free.
    But the NAPIs are only stopped between ->stop and ->start. We created
    page_pool_disable_direct_recycling() to safely shut down the recycling
    in ->stop. This way the page_pool_destroy() call in ->mem_free doesn't
    have to worry about recycling any more.
    
    Unfortunately, the page_pool_disable_direct_recycling() is not enough
    to deal with failures which necessitate freeing the _new_ page pool.
    If we hit a failure in ->mem_alloc or ->stop the new page pool has
    to be freed while the NAPI is active (assuming driver attaches the
    page pool to an existing NAPI instance and doesn't reallocate NAPIs).
    
    Freeing the new page pool is technically safe because it hasn't been
    used for any packets, yet, so there can be no recycling. But the check
    in napi_assert_will_not_race() has no way of knowing that. We could
    check if page pool is empty but that'd make the check much less likely
    to trigger during development.
    
    Add page_pool_enable_direct_recycling(), pairing with
    page_pool_disable_direct_recycling(). It will allow us to create the new
    page pools in "disabled" state and only enable recycling when we know
    the reconfig operation will not fail.
    
    Coincidentally it will also let us re-enable the recycling for the old
    pool, if the reconfig failed:
    
     -> mem_alloc (new)
     -> stop (old)
        # disables direct recycling for old
     -> start (new)
        # fail!!
     -> start (old)
        # go back to old pp but direct recycling is lost :(
     -> mem_free (new)
    
    The new helper is idempotent to make the life easier for drivers,
    which can operate in HDS mode and support zero-copy Rx.
    The driver can call the helper twice whether there are two pools
    or it has multiple references to a single pool.
    
    Fixes: 40eca00ae605 ("bnxt_en: unlink page pool when stopping Rx queue")
    Tested-by: David Wei <dw@davidwei.uk>
    Link: https://patch.msgid.link/20250805003654.2944974-1-kuba@kernel.org
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: pcs: xpcs: mask readl() return value to 16 bits [+ + +]

Author: Jack Ping CHNG <jchng@maxlinear.com>
Date:   Wed Jul 16 11:03:49 2025 +0800

    net: pcs: xpcs: mask readl() return value to 16 bits
    
    [ Upstream commit 2b0ba7b5b010455c4e43ab557860f8b1089e7424 ]
    
    readl() returns 32-bit value but Clause 22/45 registers are 16-bit wide.
    Masking with 0xFFFF avoids using garbage upper bits.
    
    Signed-off-by: Jack Ping CHNG <jchng@maxlinear.com>
    Reviewed-by: Maxime Chevallier <maxime.chevallier@bootlin.com>
    Link: https://patch.msgid.link/20250716030349.3796806-1-jchng@maxlinear.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: phy: air_en8811h: Introduce resume/suspend and clk_restore_context to ensure correct CKO settings after network interface reinitialization. [+ + +]

Author: Lucien.Jheng <lucienzx159@gmail.com>
Date:   Mon Jun 30 23:41:47 2025 +0800

    net: phy: air_en8811h: Introduce resume/suspend and clk_restore_context to ensure correct CKO settings after network interface reinitialization.
    
    [ Upstream commit 6b9c9def95cb402374730d51a1f44927f467b774 ]
    
    If the user reinitializes the network interface, the PHY will reinitialize,
    and the CKO settings will revert to their initial configuration(be enabled).
    To prevent CKO from being re-enabled,
    en8811h_clk_restore_context and en8811h_resume were added
    to ensure the CKO settings remain correct.
    
    Signed-off-by: Lucien.Jheng <lucienzx159@gmail.com>
    Reviewed-by: Andrew Lunn <andrew@lunn.ch>
    Link: https://patch.msgid.link/20250630154147.80388-1-lucienzx159@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: phy: bcm54811: PHY initialization [+ + +]

Author: Kamil Horák - 2N <kamilh@axis.com>
Date:   Tue Jul 8 11:01:40 2025 +0200

    net: phy: bcm54811: PHY initialization
    
    [ Upstream commit 3117a11fff5af9e74f4946f07cb3ca083cbbdf4b ]
    
    Reset the bit 12 in PHY's LRE Control register upon initialization.
    According to the datasheet, this bit must be written to zero after
    every device reset.
    
    Signed-off-by: Kamil Horák - 2N <kamilh@axis.com>
    Reviewed-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Link: https://patch.msgid.link/20250708090140.61355-5-kamilh@axis.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: phy: micrel: Add ksz9131_resume() [+ + +]

Author: Biju Das <biju.das.jz@bp.renesas.com>
Date:   Fri Jul 11 06:40:21 2025 +0100

    net: phy: micrel: Add ksz9131_resume()
    
    [ Upstream commit f25a7eaa897f21396e99f90809af82ca553c9d14 ]
    
    The Renesas RZ/G3E SMARC EVK uses KSZ9131RNXC phy. On deep power state,
    PHY loses the power and on wakeup the rgmii delays are not reconfigured
    causing it to fail.
    
    Replace the callback kszphy_resume()->ksz9131_resume() for reconfiguring
    the rgmii_delay when it exits from PM suspend state.
    
    Signed-off-by: Biju Das <biju.das.jz@bp.renesas.com>
    Reviewed-by: Andrew Lunn <andrew@lunn.ch>
    Link: https://patch.msgid.link/20250711054029.48536-1-biju.das.jz@bp.renesas.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: phy: micrel: fix KSZ8081/KSZ8091 cable test [+ + +]

Author: Florian Larysch <fl@n621.de>
Date:   Thu Jul 24 00:20:42 2025 +0200

    net: phy: micrel: fix KSZ8081/KSZ8091 cable test
    
    commit 49db61c27c4bbd24364086dc0892bd3e14c1502e upstream.
    
    Commit 21b688dabecb ("net: phy: micrel: Cable Diag feature for lan8814
    phy") introduced cable_test support for the LAN8814 that reuses parts of
    the KSZ886x logic and introduced the cable_diag_reg and pair_mask
    parameters to account for differences between those chips.
    
    However, it did not update the ksz8081_type struct, so those members are
    now 0, causing no pairs to be tested in ksz886x_cable_test_get_status
    and ksz886x_cable_test_wait_for_completion to poll the wrong register
    for the affected PHYs (Basic Control/Reset, which is 0 in normal
    operation) and exit immediately.
    
    Fix this by setting both struct members accordingly.
    
    Fixes: 21b688dabecb ("net: phy: micrel: Cable Diag feature for lan8814 phy")
    Cc: stable@vger.kernel.org
    Signed-off-by: Florian Larysch <fl@n621.de>
    Link: https://patch.msgid.link/20250723222250.13960-1-fl@n621.de
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net: phy: nxp-c45-tja11xx: fix the PHY ID mismatch issue when using C45 [+ + +]

Author: Clark Wang <xiaoning.wang@nxp.com>
Date:   Thu Aug 7 12:08:32 2025 +0800

    net: phy: nxp-c45-tja11xx: fix the PHY ID mismatch issue when using C45
    
    [ Upstream commit 8ee90742cf29427683294a6a80f1e2b7f4af1cff ]
    
    TJA1103/04/20/21 support both C22 and C45 accessing methods.
    
    The TJA11xx driver has implemented the match_phy_device() API.
    However, it does not handle the C45 ID. If C45 was used to access
    TJA11xx, match_phy_device() would always return false due to
    phydev->phy_id only used by C22 being empty, resulting in the
    generic phy driver being used for TJA11xx PHYs.
    
    Therefore, check phydev->c45_ids.device_ids[MDIO_MMD_PMAPMD] when
    using C45.
    
    Fixes: 1b76b2497aba ("net: phy: nxp-c45-tja11xx: simplify .match_phy_device OP")
    Signed-off-by: Clark Wang <xiaoning.wang@nxp.com>
    Link: https://patch.msgid.link/20250807040832.2455306-1-xiaoning.wang@nxp.com
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: phy: realtek: add error handling to rtl8211f_get_wol [+ + +]

Author: Daniel Braunwarth <daniel.braunwarth@kuka.com>
Date:   Tue Jun 24 16:17:33 2025 +0200

    net: phy: realtek: add error handling to rtl8211f_get_wol
    
    [ Upstream commit a9b24b3583ae1da7dbda031f141264f2da260219 ]
    
    We should check if the WOL settings was successfully read from the PHY.
    
    In case this fails we cannot just use the error code and proceed.
    
    Signed-off-by: Daniel Braunwarth <daniel.braunwarth@kuka.com>
    Reported-by: Jon Hunter <jonathanh@nvidia.com>
    Closes: https://lore.kernel.org/baaa083b-9a69-460f-ab35-2a7cb3246ffd@nvidia.com
    Reviewed-by: Andrew Lunn <andrew@lunn.ch>
    Link: https://patch.msgid.link/20250624-realtek_fixes-v1-1-02a0b7c369bc@kuka.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: phy: smsc: add proper reset flags for LAN8710A [+ + +]

Author: Buday Csaba <buday.csaba@prolan.hu>
Date:   Mon Jul 28 17:29:16 2025 +0200

    net: phy: smsc: add proper reset flags for LAN8710A
    
    [ Upstream commit 57ec5a8735dc5dccd1ee68afdb1114956a3fce0d ]
    
    According to the LAN8710A datasheet (Rev. B, section 3.8.5.1), a hardware
    reset is required after power-on, and the reference clock (REF_CLK) must be
    established before asserting reset.
    
    Signed-off-by: Buday Csaba <buday.csaba@prolan.hu>
    Cc: Csókás Bence <csokas.bence@prolan.hu>
    Reviewed-by: Andrew Lunn <andrew@lunn.ch>
    Link: https://patch.msgid.link/20250728152916.46249-2-csokas.bence@prolan.hu
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: stmmac: thead: Get and enable APB clock on initialization [+ + +]

Author: Yao Zi <ziyao@disroot.org>
Date:   Fri Aug 8 09:36:55 2025 +0000

    net: stmmac: thead: Get and enable APB clock on initialization
    
    [ Upstream commit 4cc339ce482ba78589a2d5cbe1c84b735d263383 ]
    
    It's necessary to adjust the MAC TX clock when the linkspeed changes,
    but it's noted such adjustment always fails on TH1520 SoC, and reading
    back from APB glue registers that control clock generation results in
    garbage, causing broken link.
    
    With some testing, it's found a clock must be ungated for access to APB
    glue registers. Without any consumer, the clock is automatically
    disabled during late kernel startup. Let's get and enable it if it's
    described in devicetree.
    
    For backward compatibility with older devicetrees, probing won't fail if
    the APB clock isn't found. In this case, we emit a warning since the
    link will break if the speed changes.
    
    Fixes: 33a1a01e3afa ("net: stmmac: Add glue layer for T-HEAD TH1520 SoC")
    Signed-off-by: Yao Zi <ziyao@disroot.org>
    Tested-by: Drew Fustini <fustini@kernel.org>
    Reviewed-by: Drew Fustini <fustini@kernel.org>
    Link: https://patch.msgid.link/20250808093655.48074-4-ziyao@disroot.org
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: thunderbolt: Enable end-to-end flow control also in transmit [+ + +]

Author: zhangjianrong <zhangjianrong5@huawei.com>
Date:   Sat Jun 28 17:38:13 2025 +0800

    net: thunderbolt: Enable end-to-end flow control also in transmit
    
    [ Upstream commit a8065af3346ebd7c76ebc113451fb3ba94cf7769 ]
    
    According to USB4 specification, if E2E flow control is disabled for
    the Transmit Descriptor Ring, the Host Interface Adapter Layer shall
    not require any credits to be available before transmitting a Tunneled
    Packet from this Transmit Descriptor Ring, so e2e flow control should
    be enabled in both directions.
    
    Acked-by: Mika Westerberg <mika.westerberg@linux.intel.com>
    Link: https://lore.kernel.org/20250624153805.GC2824380@black.fi.intel.com
    Signed-off-by: zhangjianrong <zhangjianrong5@huawei.com>
    Link: https://patch.msgid.link/20250628093813.647005-1-zhangjianrong5@huawei.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: thunderbolt: Fix the parameter passing of tb_xdomain_enable_paths()/tb_xdomain_disable_paths() [+ + +]

Author: zhangjianrong <zhangjianrong5@huawei.com>
Date:   Sat Jun 28 17:49:20 2025 +0800

    net: thunderbolt: Fix the parameter passing of tb_xdomain_enable_paths()/tb_xdomain_disable_paths()
    
    [ Upstream commit 8ec31cb17cd355cea25cdb8496d9b3fbf1321647 ]
    
    According to the description of tb_xdomain_enable_paths(), the third
    parameter represents the transmit ring and the fifth parameter represents
    the receive ring. tb_xdomain_disable_paths() is the same case.
    
    [Jakub] Mika says: it works now because both rings ->hop is the same
    
    Acked-by: Mika Westerberg <mika.westerberg@linux.intel.com>
    Link: https://lore.kernel.org/20250625051149.GD2824380@black.fi.intel.com
    Signed-off-by: zhangjianrong <zhangjianrong5@huawei.com>
    Link: https://patch.msgid.link/20250628094920.656658-1-zhangjianrong5@huawei.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: thunderx: Fix format-truncation warning in bgx_acpi_match_id() [+ + +]

Author: Alok Tiwari <alok.a.tiwari@oracle.com>
Date:   Fri Jul 11 07:05:30 2025 -0700

    net: thunderx: Fix format-truncation warning in bgx_acpi_match_id()
    
    [ Upstream commit 53d20606c40678d425cc03f0978c614dca51f25e ]
    
    The buffer bgx_sel used in snprintf() was too small to safely hold
    the formatted string "BGX%d" for all valid bgx_id values. This caused
    a -Wformat-truncation warning with `Werror` enabled during build.
    
    Increase the buffer size from 5 to 7 and use `sizeof(bgx_sel)` in
    snprintf() to ensure safety and suppress the warning.
    
    Build warning:
      CC      drivers/net/ethernet/cavium/thunder/thunder_bgx.o
      drivers/net/ethernet/cavium/thunder/thunder_bgx.c: In function
    ‘bgx_acpi_match_id’:
      drivers/net/ethernet/cavium/thunder/thunder_bgx.c:1434:27: error: ‘%d’
    directive output may be truncated writing between 1 and 3 bytes into a
    region of size 2 [-Werror=format-truncation=]
        snprintf(bgx_sel, 5, "BGX%d", bgx->bgx_id);
                                 ^~
      drivers/net/ethernet/cavium/thunder/thunder_bgx.c:1434:23: note:
    directive argument in the range [0, 255]
        snprintf(bgx_sel, 5, "BGX%d", bgx->bgx_id);
                             ^~~~~~~
      drivers/net/ethernet/cavium/thunder/thunder_bgx.c:1434:2: note:
    ‘snprintf’ output between 5 and 7 bytes into a destination of size 5
        snprintf(bgx_sel, 5, "BGX%d", bgx->bgx_id);
    
    compiler warning due to insufficient snprintf buffer size.
    
    Signed-off-by: Alok Tiwari <alok.a.tiwari@oracle.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://patch.msgid.link/20250711140532.2463602-1-alok.a.tiwari@oracle.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: ti: icss-iep: fix device and OF node leaks at probe [+ + +]

Author: Johan Hovold <johan@kernel.org>
Date:   Fri Jul 25 19:12:13 2025 +0200

    net: ti: icss-iep: fix device and OF node leaks at probe
    
    commit e05c54974a05ab19658433545d6ced88d9075cf0 upstream.
    
    Make sure to drop the references to the IEP OF node and device taken by
    of_parse_phandle() and of_find_device_by_node() when looking up IEP
    devices during probe.
    
    Drop the bogus additional reference taken on successful lookup so that
    the device is released correctly by icss_iep_put().
    
    Fixes: c1e0230eeaab ("net: ti: icss-iep: Add IEP driver")
    Cc: stable@vger.kernel.org      # 6.6
    Cc: Roger Quadros <rogerq@kernel.org>
    Signed-off-by: Johan Hovold <johan@kernel.org>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://patch.msgid.link/20250725171213.880-6-johan@kernel.org
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net: ti: icss-iep: Fix incorrect type for return value in extts_enable() [+ + +]

Author: Alok Tiwari <alok.a.tiwari@oracle.com>
Date:   Tue Aug 5 07:23:18 2025 -0700

    net: ti: icss-iep: Fix incorrect type for return value in extts_enable()
    
    [ Upstream commit 5f1d1d14db7dabce9c815e7d7cd351f8d58b8585 ]
    
    The variable ret in icss_iep_extts_enable() was incorrectly declared
    as u32, while the function returns int and may return negative error
    codes. This will cause sign extension issues and incorrect error
    propagation. Update ret to be int to fix error handling.
    
    This change corrects the declaration to avoid potential type mismatch.
    
    Fixes: c1e0230eeaab ("net: ti: icss-iep: Add IEP driver")
    Signed-off-by: Alok Tiwari <alok.a.tiwari@oracle.com>
    Reviewed-by: Andrew Lunn <andrew@lunn.ch>
    Link: https://patch.msgid.link/20250805142323.1949406-1-alok.a.tiwari@oracle.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: ti: icssg-prueth: Fix emac link speed handling [+ + +]

Author: MD Danish Anwar <danishanwar@ti.com>
Date:   Tue Aug 5 23:08:12 2025 +0530

    net: ti: icssg-prueth: Fix emac link speed handling
    
    [ Upstream commit 06feac15406f4f66f4c0c6ea60b10d44775d4133 ]
    
    When link settings are changed emac->speed is populated by
    emac_adjust_link(). The link speed and other settings are then written into
    the DRAM. However if both ports are brought down after this and brought up
    again or if the operating mode is changed and a firmware reload is needed,
    the DRAM is cleared by icssg_config(). As a result the link settings are
    lost.
    
    Fix this by calling emac_adjust_link() after icssg_config(). This re
    populates the settings in the DRAM after a new firmware load.
    
    Fixes: 9facce84f406 ("net: ti: icssg-prueth: Fix firmware load sequence.")
    Signed-off-by: MD Danish Anwar <danishanwar@ti.com>
    Reviewed-by: Andrew Lunn <andrew@lunn.ch>
    Message-ID: <20250805173812.2183161-1-danishanwar@ti.com>
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: usb: asix_devices: add phy_mask for ax88772 mdio bus [+ + +]

Author: Xu Yang <xu.yang_2@nxp.com>
Date:   Mon Aug 11 17:29:31 2025 +0800

    net: usb: asix_devices: add phy_mask for ax88772 mdio bus
    
    commit 4faff70959d51078f9ee8372f8cff0d7045e4114 upstream.
    
    Without setting phy_mask for ax88772 mdio bus, current driver may create
    at most 32 mdio phy devices with phy address range from 0x00 ~ 0x1f.
    DLink DUB-E100 H/W Ver B1 is such a device. However, only one main phy
    device will bind to net phy driver. This is creating issue during system
    suspend/resume since phy_polling_mode() in phy_state_machine() will
    directly deference member of phydev->drv for non-main phy devices. Then
    NULL pointer dereference issue will occur. Due to only external phy or
    internal phy is necessary, add phy_mask for ax88772 mdio bus to workarnoud
    the issue.
    
    Closes: https://lore.kernel.org/netdev/20250806082931.3289134-1-xu.yang_2@nxp.com
    Fixes: e532a096be0e ("net: usb: asix: ax88772: add phylib support")
    Cc: stable@vger.kernel.org
    Signed-off-by: Xu Yang <xu.yang_2@nxp.com>
    Tested-by: Oleksij Rempel <o.rempel@pengutronix.de>
    Reviewed-by: Oleksij Rempel <o.rempel@pengutronix.de>
    Link: https://patch.msgid.link/20250811092931.860333-1-xu.yang_2@nxp.com
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net: usb: cdc-ncm: check for filtering capability [+ + +]

Author: Oliver Neukum <oneukum@suse.com>
Date:   Thu Jul 17 14:06:17 2025 +0200

    net: usb: cdc-ncm: check for filtering capability
    
    [ Upstream commit 61c3e8940f2d8b5bfeaeec4bedc2f3e7d873abb3 ]
    
    If the decice does not support filtering, filtering
    must not be used and all packets delivered for the
    upper layers to sort.
    
    Signed-off-by: Oliver Neukum <oneukum@suse.com>
    Link: https://patch.msgid.link/20250717120649.2090929-1-oneukum@suse.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: usb: qmi_wwan: add Telit Cinterion FN990A w/audio composition [+ + +]

Author: Fabio Porcedda <fabio.porcedda@gmail.com>
Date:   Fri Aug 8 15:31:08 2025 +0200

    net: usb: qmi_wwan: add Telit Cinterion FN990A w/audio composition
    
    commit 61aaca8b89fb98be58b8df19f01181bb983cccff upstream.
    
    Add the following Telit Cinterion FN990A w/audio composition:
    
    0x1077: tty (diag) + adb + rmnet + audio + tty (AT/NMEA) + tty (AT) +
    tty (AT) + tty (AT)
    T:  Bus=01 Lev=01 Prnt=01 Port=09 Cnt=01 Dev#=  8 Spd=480 MxCh= 0
    D:  Ver= 2.10 Cls=00(>ifc ) Sub=00 Prot=00 MxPS=64 #Cfgs=  1
    P:  Vendor=1bc7 ProdID=1077 Rev=05.04
    S:  Manufacturer=Telit Wireless Solutions
    S:  Product=FN990
    S:  SerialNumber=67e04c35
    C:  #Ifs=10 Cfg#= 1 Atr=e0 MxPwr=500mA
    I:  If#= 0 Alt= 0 #EPs= 2 Cls=ff(vend.) Sub=ff Prot=30 Driver=option
    E:  Ad=01(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms
    E:  Ad=81(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms
    I:  If#= 1 Alt= 0 #EPs= 2 Cls=ff(vend.) Sub=42 Prot=01 Driver=(none)
    E:  Ad=02(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms
    E:  Ad=82(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms
    I:  If#= 2 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=ff Prot=50 Driver=qmi_wwan
    E:  Ad=0f(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms
    E:  Ad=83(I) Atr=03(Int.) MxPS=   8 Ivl=32ms
    E:  Ad=8e(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms
    I:  If#= 3 Alt= 0 #EPs= 0 Cls=01(audio) Sub=01 Prot=20 Driver=snd-usb-audio
    I:  If#= 4 Alt= 1 #EPs= 1 Cls=01(audio) Sub=02 Prot=20 Driver=snd-usb-audio
    E:  Ad=03(O) Atr=0d(Isoc) MxPS=  68 Ivl=1ms
    I:  If#= 5 Alt= 1 #EPs= 1 Cls=01(audio) Sub=02 Prot=20 Driver=snd-usb-audio
    E:  Ad=84(I) Atr=0d(Isoc) MxPS=  68 Ivl=1ms
    I:  If#= 6 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=ff Prot=60 Driver=option
    E:  Ad=04(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms
    E:  Ad=85(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms
    E:  Ad=86(I) Atr=03(Int.) MxPS=  10 Ivl=32ms
    I:  If#= 7 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=ff Prot=40 Driver=option
    E:  Ad=05(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms
    E:  Ad=87(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms
    E:  Ad=88(I) Atr=03(Int.) MxPS=  10 Ivl=32ms
    I:  If#= 8 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=ff Prot=40 Driver=option
    E:  Ad=06(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms
    E:  Ad=89(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms
    E:  Ad=8a(I) Atr=03(Int.) MxPS=  10 Ivl=32ms
    I:  If#= 9 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=ff Prot=40 Driver=option
    E:  Ad=07(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms
    E:  Ad=8b(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms
    E:  Ad=8c(I) Atr=03(Int.) MxPS=  10 Ivl=32ms
    
    Cc: stable@vger.kernel.org
    Signed-off-by: Fabio Porcedda <fabio.porcedda@gmail.com>
    Acked-by: Bjørn Mork <bjorn@mork.no>
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net: vlan: Make is_vlan_dev() a stub when VLAN is not configured [+ + +]

Author: Gal Pressman <gal@nvidia.com>
Date:   Mon Jun 16 16:26:24 2025 +0300

    net: vlan: Make is_vlan_dev() a stub when VLAN is not configured
    
    [ Upstream commit 2de1ba0887e5d3bf02d7c212f380039b34e10aa3 ]
    
    Add a stub implementation of is_vlan_dev() that returns false when
    VLAN support is not compiled in (CONFIG_VLAN_8021Q=n).
    
    This allows us to compile-out VLAN-dependent dead code when it is not
    needed.
    
    This also resolves the following compilation error when:
    * CONFIG_VLAN_8021Q=n
    * CONFIG_OBJTOOL=y
    * CONFIG_OBJTOOL_WERROR=y
    
    drivers/net/ethernet/mellanox/mlx5/core/mlx5_core.o: error: objtool: parse_mirred.isra.0+0x370: mlx5e_tc_act_vlan_add_push_action() missing __noreturn in .c/.h or NORETURN() in noreturns.h
    
    The error occurs because objtool cannot determine that unreachable BUG()
    (which doesn't return) calls in VLAN code paths are actually dead code
    when VLAN support is disabled.
    
    Signed-off-by: Gal Pressman <gal@nvidia.com>
    Link: https://patch.msgid.link/20250616132626.1749331-2-gal@nvidia.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: vlan: Replace BUG() with WARN_ON_ONCE() in vlan_dev_* stubs [+ + +]

Author: Gal Pressman <gal@nvidia.com>
Date:   Mon Jun 16 16:26:25 2025 +0300

    net: vlan: Replace BUG() with WARN_ON_ONCE() in vlan_dev_* stubs
    
    [ Upstream commit 60a8b1a5d0824afda869f18dc0ecfe72f8dfda42 ]
    
    When CONFIG_VLAN_8021Q=n, a set of stub helpers are used, three of these
    helpers use BUG() unconditionally.
    
    This code should not be reached, as callers of these functions should
    always check for is_vlan_dev() first, but the usage of BUG() is not
    recommended, replace it with WARN_ON() instead.
    
    Reviewed-by: Alex Lazar <alazar@nvidia.com>
    Reviewed-by: Dragos Tatulea <dtatulea@nvidia.com>
    Signed-off-by: Gal Pressman <gal@nvidia.com>
    Link: https://patch.msgid.link/20250616132626.1749331-3-gal@nvidia.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfilter: ctnetlink: fix refcount leak on table dump [+ + +]

Author: Florian Westphal <fw@strlen.de>
Date:   Fri Aug 1 17:25:08 2025 +0200

    netfilter: ctnetlink: fix refcount leak on table dump
    
    [ Upstream commit de788b2e6227462b6dcd0e07474e72c089008f74 ]
    
    There is a reference count leak in ctnetlink_dump_table():
          if (res < 0) {
                    nf_conntrack_get(&ct->ct_general); // HERE
                    cb->args[1] = (unsigned long)ct;
                    ...
    
    While its very unlikely, its possible that ct == last.
    If this happens, then the refcount of ct was already incremented.
    This 2nd increment is never undone.
    
    This prevents the conntrack object from being released, which in turn
    keeps prevents cnet->count from dropping back to 0.
    
    This will then block the netns dismantle (or conntrack rmmod) as
    nf_conntrack_cleanup_net_list() will wait forever.
    
    This can be reproduced by running conntrack_resize.sh selftest in a loop.
    It takes ~20 minutes for me on a preemptible kernel on average before
    I see a runaway kworker spinning in nf_conntrack_cleanup_net_list.
    
    One fix would to change this to:
            if (res < 0) {
                    if (ct != last)
                            nf_conntrack_get(&ct->ct_general);
    
    But this reference counting isn't needed in the first place.
    We can just store a cookie value instead.
    
    A followup patch will do the same for ctnetlink_exp_dump_table,
    it looks to me as if this has the same problem and like
    ctnetlink_dump_table, we only need a 'skip hint', not the actual
    object so we can apply the same cookie strategy there as well.
    
    Fixes: d205dc40798d ("[NETFILTER]: ctnetlink: fix deadlock in table dumping")
    Signed-off-by: Florian Westphal <fw@strlen.de>
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfilter: ctnetlink: remove refcounting in expectation dumpers [+ + +]

Author: Florian Westphal <fw@strlen.de>
Date:   Fri Aug 1 17:25:09 2025 +0200

    netfilter: ctnetlink: remove refcounting in expectation dumpers
    
    [ Upstream commit 1492e3dcb2be3aa46d1963da96aa9593e4e4db5a ]
    
    Same pattern as previous patch: do not keep the expectation object
    alive via refcount, only store a cookie value and then use that
    as the skip hint for dump resumption.
    
    AFAICS this has the same issue as the one resolved in the conntrack
    dumper, when we do
      if (!refcount_inc_not_zero(&exp->use))
    
    to increment the refcount, there is a chance that exp == last, which
    causes a double-increment of the refcount and subsequent memory leak.
    
    Fixes: cf6994c2b981 ("[NETFILTER]: nf_conntrack_netlink: sync expectation dumping with conntrack table dumping")
    Fixes: e844a928431f ("netfilter: ctnetlink: allow to dump expectation per master conntrack")
    Signed-off-by: Florian Westphal <fw@strlen.de>
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfilter: nf_tables: reject duplicate device on updates [+ + +]

Author: Pablo Neira Ayuso <pablo@netfilter.org>
Date:   Wed Aug 13 02:38:50 2025 +0200

    netfilter: nf_tables: reject duplicate device on updates
    
    [ Upstream commit cf5fb87fcdaaaafec55dcc0dc5a9e15ead343973 ]
    
    A chain/flowtable update with duplicated devices in the same batch is
    possible. Unfortunately, netdev event path only removes the first
    device that is found, leaving unregistered the hook of the duplicated
    device.
    
    Check if a duplicated device exists in the transaction batch, bail out
    with EEXIST in such case.
    
    WARNING is hit when unregistering the hook:
    
     [49042.221275] WARNING: CPU: 4 PID: 8425 at net/netfilter/core.c:340 nf_hook_entry_head+0xaa/0x150
     [49042.221375] CPU: 4 UID: 0 PID: 8425 Comm: nft Tainted: G S                  6.16.0+ #170 PREEMPT(full)
     [...]
     [49042.221382] RIP: 0010:nf_hook_entry_head+0xaa/0x150
    
    Fixes: 78d9f48f7f44 ("netfilter: nf_tables: add devices to existing flowtable")
    Fixes: b9703ed44ffb ("netfilter: nf_tables: support for adding new devices to an existing netdev chain")
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Florian Westphal <fw@strlen.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfilter: nft_set_pipapo: prefer kvmalloc for scratch maps [+ + +]

Author: Florian Westphal <fw@strlen.de>
Date:   Wed Jul 9 19:05:16 2025 +0200

    netfilter: nft_set_pipapo: prefer kvmalloc for scratch maps
    
    [ Upstream commit 897eefee2eb73ec6c119a0ca357d7b4a3e92c5ef ]
    
    The scratchmap size depends on the number of elements in the set.
    For huge sets, each scratch map can easily require very large
    allocations, e.g. for 100k entries each scratch map will require
    close to 64kbyte of memory.
    
    Signed-off-by: Florian Westphal <fw@strlen.de>
    Reviewed-by: Stefano Brivio <sbrivio@redhat.com>
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netlink: avoid infinite retry looping in netlink_unicast() [+ + +]

Author: Fedor Pchelkin <pchelkin@ispras.ru>
Date:   Mon Jul 28 11:06:47 2025 +0300

    netlink: avoid infinite retry looping in netlink_unicast()
    
    commit 759dfc7d04bab1b0b86113f1164dc1fec192b859 upstream.
    
    netlink_attachskb() checks for the socket's read memory allocation
    constraints. Firstly, it has:
    
      rmem < READ_ONCE(sk->sk_rcvbuf)
    
    to check if the just increased rmem value fits into the socket's receive
    buffer. If not, it proceeds and tries to wait for the memory under:
    
      rmem + skb->truesize > READ_ONCE(sk->sk_rcvbuf)
    
    The checks don't cover the case when skb->truesize + sk->sk_rmem_alloc is
    equal to sk->sk_rcvbuf. Thus the function neither successfully accepts
    these conditions, nor manages to reschedule the task - and is called in
    retry loop for indefinite time which is caught as:
    
      rcu: INFO: rcu_sched self-detected stall on CPU
      rcu:     0-....: (25999 ticks this GP) idle=ef2/1/0x4000000000000000 softirq=262269/262269 fqs=6212
      (t=26000 jiffies g=230833 q=259957)
      NMI backtrace for cpu 0
      CPU: 0 PID: 22 Comm: kauditd Not tainted 5.10.240 #68
      Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.17.0-4.fc42 04/01/2014
      Call Trace:
      <IRQ>
      dump_stack lib/dump_stack.c:120
      nmi_cpu_backtrace.cold lib/nmi_backtrace.c:105
      nmi_trigger_cpumask_backtrace lib/nmi_backtrace.c:62
      rcu_dump_cpu_stacks kernel/rcu/tree_stall.h:335
      rcu_sched_clock_irq.cold kernel/rcu/tree.c:2590
      update_process_times kernel/time/timer.c:1953
      tick_sched_handle kernel/time/tick-sched.c:227
      tick_sched_timer kernel/time/tick-sched.c:1399
      __hrtimer_run_queues kernel/time/hrtimer.c:1652
      hrtimer_interrupt kernel/time/hrtimer.c:1717
      __sysvec_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1113
      asm_call_irq_on_stack arch/x86/entry/entry_64.S:808
      </IRQ>
    
      netlink_attachskb net/netlink/af_netlink.c:1234
      netlink_unicast net/netlink/af_netlink.c:1349
      kauditd_send_queue kernel/audit.c:776
      kauditd_thread kernel/audit.c:897
      kthread kernel/kthread.c:328
      ret_from_fork arch/x86/entry/entry_64.S:304
    
    Restore the original behavior of the check which commit in Fixes
    accidentally missed when restructuring the code.
    
    Found by Linux Verification Center (linuxtesting.org).
    
    Fixes: ae8f160e7eb2 ("netlink: Fix wraparounds of sk->sk_rmem_alloc.")
    Cc: stable@vger.kernel.org
    Signed-off-by: Fedor Pchelkin <pchelkin@ispras.ru>
    Reviewed-by: Kuniyuki Iwashima <kuniyu@google.com>
    Link: https://patch.msgid.link/20250728080727.255138-1-pchelkin@ispras.ru
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

netmem: fix skb_frag_address_safe with unreadable skbs [+ + +]

Author: Mina Almasry <almasrymina@google.com>
Date:   Thu Jun 19 17:52:38 2025 +0000

    netmem: fix skb_frag_address_safe with unreadable skbs
    
    [ Upstream commit 4672aec56d2e8edabcb74c3e2320301d106a377e ]
    
    skb_frag_address_safe() needs a check that the
    skb_frag_page exists check similar to skb_frag_address().
    
    Cc: ap420073@gmail.com
    
    Signed-off-by: Mina Almasry <almasrymina@google.com>
    Acked-by: Stanislav Fomichev <sdf@fomichev.me>
    Link: https://patch.msgid.link/20250619175239.3039329-1-almasrymina@google.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

NFS: Fix the setting of capabilities when automounting a new filesystem [+ + +]

Author: Trond Myklebust <trond.myklebust@hammerspace.com>
Date:   Sun Aug 3 14:31:59 2025 -0700

    NFS: Fix the setting of capabilities when automounting a new filesystem
    
    commit b01f21cacde9f2878492cf318fee61bf4ccad323 upstream.
    
    Capabilities cannot be inherited when we cross into a new filesystem.
    They need to be reset to the minimal defaults, and then probed for
    again.
    
    Fixes: 54ceac451598 ("NFS: Share NFS superblocks per-protocol per-server per-FSID")
    Cc: stable@vger.kernel.org
    Reviewed-by: Benjamin Coddington <bcodding@redhat.com>
    Signed-off-by: Trond Myklebust <trond.myklebust@hammerspace.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

NFSD: detect mismatch of file handle and delegation stateid in OPEN op [+ + +]

Author: Dai Ngo <dai.ngo@oracle.com>
Date:   Tue Jun 10 08:35:28 2025 -0700

    NFSD: detect mismatch of file handle and delegation stateid in OPEN op
    
    commit 9c65001c57164033ad08b654c8b5ae35512ddf4a upstream.
    
    When the client sends an OPEN with claim type CLAIM_DELEG_CUR_FH or
    CLAIM_DELEGATION_CUR, the delegation stateid and the file handle
    must belong to the same file, otherwise return NFS4ERR_INVAL.
    
    Note that RFC8881, section 8.2.4, mandates the server to return
    NFS4ERR_BAD_STATEID if the selected table entry does not match the
    current filehandle. However returning NFS4ERR_BAD_STATEID in the
    OPEN causes the client to retry the operation and therefor get the
    client into a loop. To avoid this situation we return NFS4ERR_INVAL
    instead.
    
    Reported-by: Petro Pavlov <petro.pavlov@vastdata.com>
    Fixes: c44c5eeb2c02 ("[PATCH] nfsd4: add open state code for CLAIM_DELEGATE_CUR")
    Cc: stable@vger.kernel.org
    Signed-off-by: Dai Ngo <dai.ngo@oracle.com>
    Reviewed-by: Jeff Layton <jlayton@kernel.org>
    Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

nfsd: handle get_client_locked() failure in nfsd4_setclientid_confirm() [+ + +]

Author: Jeff Layton <jlayton@kernel.org>
Date:   Wed Jun 4 12:01:10 2025 -0400

    nfsd: handle get_client_locked() failure in nfsd4_setclientid_confirm()
    
    commit 908e4ead7f757504d8b345452730636e298cbf68 upstream.
    
    Lei Lu recently reported that nfsd4_setclientid_confirm() did not check
    the return value from get_client_locked(). a SETCLIENTID_CONFIRM could
    race with a confirmed client expiring and fail to get a reference. That
    could later lead to a UAF.
    
    Fix this by getting a reference early in the case where there is an
    extant confirmed client. If that fails then treat it as if there were no
    confirmed client found at all.
    
    In the case where the unconfirmed client is expiring, just fail and
    return the result from get_client_locked().
    
    Reported-by: lei lu <llfamsec@gmail.com>
    Closes: https://lore.kernel.org/linux-nfs/CAEBF3_b=UvqzNKdnfD_52L05Mqrqui9vZ2eFamgAbV0WG+FNWQ@mail.gmail.com/
    Fixes: d20c11d86d8f ("nfsd: Protect session creation and client confirm using client_lock")
    Cc: stable@vger.kernel.org
    Signed-off-by: Jeff Layton <jlayton@kernel.org>
    Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

nvme-pci: try function level reset on init failure [+ + +]

Author: Keith Busch <kbusch@kernel.org>
Date:   Tue Jul 15 12:16:27 2025 -0700

    nvme-pci: try function level reset on init failure
    
    [ Upstream commit 5b2c214a95942f7997d1916a4c44017becbc3cac ]
    
    NVMe devices from multiple vendors appear to get stuck in a reset state
    that we can't get out of with an NVMe level Controller Reset. The kernel
    would report these with messages that look like:
    
      Device not ready; aborting reset, CSTS=0x1
    
    These have historically required a power cycle to make them usable
    again, but in many cases, a PCIe FLR is sufficient to restart operation
    without a power cycle. Try it if the initial controller reset fails
    during any nvme reset attempt.
    
    Signed-off-by: Keith Busch <kbusch@kernel.org>
    Reviewed-by: Chaitanya Kulkarni <kch@nvidia.com>
    Reviewed-by: Nitesh Shetty <nj.shetty@samsung.com>
    Signed-off-by: Christoph Hellwig <hch@lst.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

nvme-tcp: log TLS handshake failures at error level [+ + +]

Author: Maurizio Lombardi <mlombard@redhat.com>
Date:   Wed Jul 2 16:06:29 2025 +0200

    nvme-tcp: log TLS handshake failures at error level
    
    [ Upstream commit 5a58ac9bfc412a58c3cf26c6a7e54d4308e9d109 ]
    
    Update the nvme_tcp_start_tls() function to use dev_err() instead of
    dev_dbg() when a TLS error is detected. This ensures that handshake
    failures are visible by default, aiding in debugging.
    
    Signed-off-by: Maurizio Lombardi <mlombard@redhat.com>
    Reviewed-by: Laurence Oberman <loberman@redhat.com>
    Reviewed-by: Hannes Reinecke <hare@suse.de>
    Signed-off-by: Christoph Hellwig <hch@lst.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ocfs2: reset folio to NULL when get folio fails [+ + +]

Author: Lizhi Xu <lizhi.xu@windriver.com>
Date:   Mon Jun 16 09:31:40 2025 +0800

    ocfs2: reset folio to NULL when get folio fails
    
    commit 2ae826799932ff89409f56636ad3c25578fe7cf5 upstream.
    
    The reproducer uses FAULT_INJECTION to make memory allocation fail, which
    causes __filemap_get_folio() to fail, when initializing w_folios[i] in
    ocfs2_grab_folios_for_write(), it only returns an error code and the value
    of w_folios[i] is the error code, which causes
    ocfs2_unlock_and_free_folios() to recycle the invalid w_folios[i] when
    releasing folios.
    
    Link: https://lkml.kernel.org/r/20250616013140.3602219-1-lizhi.xu@windriver.com
    Reported-by: syzbot+c2ea94ae47cd7e3881ec@syzkaller.appspotmail.com
    Closes: https://syzkaller.appspot.com/bug?extid=c2ea94ae47cd7e3881ec
    Signed-off-by: Lizhi Xu <lizhi.xu@windriver.com>
    Reviewed-by: Joseph Qi <joseph.qi@linux.alibaba.com>
    Cc: Mark Fasheh <mark@fasheh.com>
    Cc: Joel Becker <jlbec@evilplan.org>
    Cc: Junxiao Bi <junxiao.bi@oracle.com>
    Cc: Changwei Ge <gechangwei@live.cn>
    Cc: Jun Piao <piaojun@huawei.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

parisc: Makefile: fix a typo in palo.conf [+ + +]

Author: Randy Dunlap <rdunlap@infradead.org>
Date:   Wed Jun 25 00:39:33 2025 -0700

    parisc: Makefile: fix a typo in palo.conf
    
    commit 963f1b20a8d2a098954606b9725cd54336a2a86c upstream.
    
    Correct "objree" to "objtree". "objree" is not defined.
    
    Fixes: 75dd47472b92 ("kbuild: remove src and obj from the top Makefile")
    Signed-off-by: Randy Dunlap <rdunlap@infradead.org>
    Cc: Masahiro Yamada <masahiroy@kernel.org>
    Cc: "James E.J. Bottomley" <James.Bottomley@HansenPartnership.com>
    Cc: Helge Deller <deller@gmx.de>
    Cc: linux-parisc@vger.kernel.org
    Signed-off-by: Helge Deller <deller@gmx.de>
    Cc: stable@vger.kernel.org # v5.3+
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

PCI/ACPI: Fix runtime PM ref imbalance on Hot-Plug Capable ports [+ + +]

Author: Lukas Wunner <lukas@wunner.de>
Date:   Sun Jul 13 16:31:01 2025 +0200

    PCI/ACPI: Fix runtime PM ref imbalance on Hot-Plug Capable ports
    
    commit 6cff20ce3b92ffbf2fc5eb9e5a030b3672aa414a upstream.
    
    pci_bridge_d3_possible() is called from both pcie_portdrv_probe() and
    pcie_portdrv_remove() to determine whether runtime power management shall
    be enabled (on probe) or disabled (on remove) on a PCIe port.
    
    The underlying assumption is that pci_bridge_d3_possible() always returns
    the same value, else a runtime PM reference imbalance would occur.  That
    assumption is not given if the PCIe port is inaccessible on remove due to
    hot-unplug:  pci_bridge_d3_possible() calls pciehp_is_native(), which
    accesses Config Space to determine whether the port is Hot-Plug Capable.
    An inaccessible port returns "all ones", which is converted to "all
    zeroes" by pcie_capability_read_dword().  Hence the port no longer seems
    Hot-Plug Capable on remove even though it was on probe.
    
    The resulting runtime PM ref imbalance causes warning messages such as:
    
      pcieport 0000:02:04.0: Runtime PM usage count underflow!
    
    Avoid the Config Space access (and thus the runtime PM ref imbalance) by
    caching the Hot-Plug Capable bit in struct pci_dev.
    
    The struct already contains an "is_hotplug_bridge" flag, which however is
    not only set on Hot-Plug Capable PCIe ports, but also Conventional PCI
    Hot-Plug bridges and ACPI slots.  The flag identifies bridges which are
    allocated additional MMIO and bus number resources to allow for hierarchy
    expansion.
    
    The kernel is somewhat sloppily using "is_hotplug_bridge" in a number of
    places to identify Hot-Plug Capable PCIe ports, even though the flag
    encompasses other devices.  Subsequent commits replace these occurrences
    with the new flag to clearly delineate Hot-Plug Capable PCIe ports from
    other kinds of hotplug bridges.
    
    Document the existing "is_hotplug_bridge" and the new "is_pciehp" flag
    and document the (non-obvious) requirement that pci_bridge_d3_possible()
    always returns the same value across the entire lifetime of a bridge,
    including its hot-removal.
    
    Fixes: 5352a44a561d ("PCI: pciehp: Make pciehp_is_native() stricter")
    Reported-by: Laurent Bigonville <bigon@bigon.be>
    Closes: https://bugzilla.kernel.org/show_bug.cgi?id=220216
    Reported-by: Mario Limonciello <mario.limonciello@amd.com>
    Closes: https://lore.kernel.org/r/20250609020223.269407-3-superm1@kernel.org/
    Link: https://lore.kernel.org/all/20250620025535.3425049-3-superm1@kernel.org/T/#u
    Signed-off-by: Lukas Wunner <lukas@wunner.de>
    Signed-off-by: Bjorn Helgaas <bhelgaas@google.com>
    Acked-by: Rafael J. Wysocki <rafael@kernel.org>
    Cc: stable@vger.kernel.org # v4.18+
    Link: https://patch.msgid.link/fe5dcc3b2e62ee1df7905d746bde161eb1b3291c.1752390101.git.lukas@wunner.de
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

PCI: dw-rockchip: Delay link training after hot reset in EP mode [+ + +]

Author: Wilfred Mallawa <wilfred.mallawa@wdc.com>
Date:   Fri Jun 13 12:19:09 2025 +0200

    PCI: dw-rockchip: Delay link training after hot reset in EP mode
    
    [ Upstream commit c0b93754547dde16c8370b8fdad5f396e7786647 ]
    
    RK3588 TRM, section "11.6.1.3.3 Hot Reset and Link-Down Reset" states that:
    
      If you want to delay link re-establishment (after reset) so that you can
      reprogram some registers through DBI, you must set app_ltssm_enable =0
      immediately after core_rst_n as shown in above. This can be achieved by
      enable the app_dly2_en, and end-up the delay by assert app_dly2_done.
    
    I.e. setting app_dly2_en will automatically deassert app_ltssm_enable on
    a hot reset, and setting app_dly2_done will re-assert app_ltssm_enable,
    re-enabling link training.
    
    When receiving a hot reset/link-down IRQ when running in EP mode, we will
    call dw_pcie_ep_linkdown(), which may update registers through DBI. Unless
    link training is inhibited, these register updates race with the link
    training.
    
    To avoid the race, set PCIE_LTSSM_APP_DLY2_EN so the controller never
    automatically trains the link after a link-down or hot reset interrupt.
    That way any DBI updates done in the dw_pcie_ep_linkdown() path will happen
    while the link is still down.  Then allow link training by setting
    PCIE_LTSSM_APP_DLY2_DONE
    
    Co-developed-by: Niklas Cassel <cassel@kernel.org>
    Signed-off-by: Wilfred Mallawa <wilfred.mallawa@wdc.com>
    Signed-off-by: Niklas Cassel <cassel@kernel.org>
    Signed-off-by: Manivannan Sadhasivam <mani@kernel.org>
    [bhelgaas: commit log]
    Signed-off-by: Bjorn Helgaas <bhelgaas@google.com>
    Link: https://patch.msgid.link/20250613101908.2182053-2-cassel@kernel.org
    Signed-off-by: Sasha Levin <sashal@kernel.org>

PCI: Extend isolated function probing to LoongArch [+ + +]

Author: Huacai Chen <chenhuacai@kernel.org>
Date:   Tue Jun 24 14:29:27 2025 +0800

    PCI: Extend isolated function probing to LoongArch
    
    commit a02fd05661d73a8507dd70dd820e9b984490c545 upstream.
    
    Like s390 and the jailhouse hypervisor, LoongArch's PCI architecture allows
    passing isolated PCI functions to a guest OS instance. So it is possible
    that there is a multi-function device without function 0 for the host or
    guest.
    
    Allow probing such functions by adding a IS_ENABLED(CONFIG_LOONGARCH) case
    in the hypervisor_isolated_pci_functions() helper.
    
    This is similar to commit 189c6c33ff42 ("PCI: Extend isolated function
    probing to s390").
    
    Signed-off-by: Huacai Chen <chenhuacai@loongson.cn>
    Signed-off-by: Bjorn Helgaas <bhelgaas@google.com>
    Cc: stable@vger.kernel.org
    Link: https://patch.msgid.link/20250624062927.4037734-1-chenhuacai@loongson.cn
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

perf/arm: Add missing .suppress_bind_attrs [+ + +]

Author: Robin Murphy <robin.murphy@arm.com>
Date:   Wed Jul 2 15:08:13 2025 +0100

    perf/arm: Add missing .suppress_bind_attrs
    
    [ Upstream commit 860a831de138a7ad6bc86019adaf10eb84c02655 ]
    
    PMU drivers should set .suppress_bind_attrs so that userspace is denied
    the opportunity to pull the driver out from underneath an in-use PMU
    (with predictably unpleasant consequences). Somehow both the CMN and NI
    drivers have managed to miss this; put that right.
    
    Signed-off-by: Robin Murphy <robin.murphy@arm.com>
    Reviewed-by: Leo Yan <leo.yan@arm.com>
    Link: https://lore.kernel.org/r/acd48c341b33b96804a3969ee00b355d40c546e2.1751465293.git.robin.murphy@arm.com
    Signed-off-by: Will Deacon <will@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

perf/cxlpmu: Remove unintended newline from IRQ name format string [+ + +]

Author: Alok Tiwari <alok.a.tiwari@oracle.com>
Date:   Tue Jun 24 12:43:39 2025 -0700

    perf/cxlpmu: Remove unintended newline from IRQ name format string
    
    [ Upstream commit 3e870815ccf5bc75274158f0b5e234fce6f93229 ]
    
    The IRQ name format string used in devm_kasprintf() mistakenly included
    a newline character "\n".
    This could lead to confusing log output or misformatted names in sysfs
    or debug messages.
    
    This fix removes the newline to ensure proper IRQ naming.
    
    Signed-off-by: Alok Tiwari <alok.a.tiwari@oracle.com>
    Reviewed-by: Jonathan Cameron <jonathan.cameron@huawei.com>
    Link: https://lore.kernel.org/r/20250624194350.109790-3-alok.a.tiwari@oracle.com
    Signed-off-by: Will Deacon <will@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

phy: rockchip-pcie: Enable all four lanes if required [+ + +]

Author: Valmantas Paliksa <walmis@gmail.com>
Date:   Mon Jun 30 19:25:14 2025 -0300

    phy: rockchip-pcie: Enable all four lanes if required
    
    [ Upstream commit c3fe7071e196e25789ecf90dbc9e8491a98884d7 ]
    
    Current code enables only Lane 0 because pwr_cnt will be incremented on
    first call to the function. Let's reorder the enablement code to enable
    all 4 lanes through GRF.
    
    Reviewed-by: Neil Armstrong <neil.armstrong@linaro.org>
    Reviewed-by: Robin Murphy <robin.murphy@arm.com>
    
    Signed-off-by: Valmantas Paliksa <walmis@gmail.com>
    Signed-off-by: Geraldo Nascimento <geraldogabriel@gmail.com>
    Reviewed-by: Robin Murphy <robin.murphy@arm.com>
    Reviewed-by: Neil Armstrong <neil.armstrong@linaro.org>
    Link: https://lore.kernel.org/r/16b610aab34e069fd31d9f57260c10df2a968f80.1751322015.git.geraldogabriel@gmail.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

phy: rockchip-pcie: Properly disable TEST_WRITE strobe signal [+ + +]

Author: Geraldo Nascimento <geraldogabriel@gmail.com>
Date:   Mon Jun 30 19:25:28 2025 -0300

    phy: rockchip-pcie: Properly disable TEST_WRITE strobe signal
    
    [ Upstream commit 25facbabc3fc33c794ad09d73f73268c0f8cbc7d ]
    
    pcie_conf is used to touch TEST_WRITE strobe signal. This signal should
    be enabled, a little time waited, and then disabled. Current code clearly
    was copy-pasted and never disables the strobe signal. Adjust the define.
    While at it, remove PHY_CFG_RD_MASK which has been unused since
    64cdc0360811 ("phy: rockchip-pcie: remove unused phy_rd_cfg function").
    
    Reviewed-by: Neil Armstrong <neil.armstrong@linaro.org>
    Signed-off-by: Geraldo Nascimento <geraldogabriel@gmail.com>
    Link: https://lore.kernel.org/r/d514d5d5627680caafa8b7548cbdfee4307f5440.1751322015.git.geraldogabriel@gmail.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

pidfs: raise SB_I_NODEV and SB_I_NOEXEC [+ + +]

Author: Christian Brauner <brauner@kernel.org>
Date:   Wed Jun 18 22:53:35 2025 +0200

    pidfs: raise SB_I_NODEV and SB_I_NOEXEC
    
    [ Upstream commit 1a1ad73aa1a66787f05f7f10f686b74bab77be72 ]
    
    Similar to commit 1ed95281c0c7 ("anon_inode: raise SB_I_NODEV and SB_I_NOEXEC"):
    it shouldn't be possible to execute pidfds via
    execveat(fd_anon_inode, "", NULL, NULL, AT_EMPTY_PATH)
    so raise SB_I_NOEXEC so that no one gets any creative ideas.
    
    Also raise SB_I_NODEV as we don't expect or support any devices on pidfs.
    
    Link: https://lore.kernel.org/20250618-work-pidfs-persistent-v2-1-98f3456fd552@kernel.org
    Reviewed-by: Alexander Mikhalitsyn <aleksandr.mikhalitsyn@canonical.com>
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

pinctrl: stm32: Manage irq affinity settings [+ + +]

Author: Cheick Traore <cheick.traore@foss.st.com>
Date:   Tue Jun 10 16:30:39 2025 +0200

    pinctrl: stm32: Manage irq affinity settings
    
    [ Upstream commit 4c5cc2f65386e22166ce006efe515c667aa075e4 ]
    
    Trying to set the affinity of the interrupts associated to stm32
    pinctrl results in a write error.
    
    Fill struct irq_chip::irq_set_affinity to use the default helper
    function.
    
    Signed-off-by: Cheick Traore <cheick.traore@foss.st.com>
    Signed-off-by: Antonio Borneo <antonio.borneo@foss.st.com>
    Link: https://lore.kernel.org/20250610143042.295376-3-antonio.borneo@foss.st.com
    Signed-off-by: Linus Walleij <linus.walleij@linaro.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

platform/chrome: cros_ec_sensorhub: Retries when a sensor is not ready [+ + +]

Author: Gwendal Grignou <gwendal@chromium.org>
Date:   Mon Jun 23 14:05:18 2025 -0700

    platform/chrome: cros_ec_sensorhub: Retries when a sensor is not ready
    
    [ Upstream commit 981d7f91aeda17424b29f033249f4fa7cd2a7556 ]
    
    When the EC/ISH starts, it can take a while for all the sensors to be up
    and running or declared broken.
    
    If the sensor stack return -EBUSY when checking for sensor information,
    retry up to 50 times.
    It has been observed 100ms wait time is enough to have valid sensors
    ready. It can take more time in case a sensor is really broken and is
    not coming up.
    
    Signed-off-by: Gwendal Grignou <gwendal@google.com>
    Link: https://lore.kernel.org/r/20250623210518.306740-1-gwendal@google.com
    Signed-off-by: Tzung-Bi Shih <tzungbi@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

platform/chrome: cros_ec_typec: Defer probe on missing EC parent [+ + +]

Author: Tomasz Michalec <tmichalec@google.com>
Date:   Tue Jun 10 17:37:47 2025 +0200

    platform/chrome: cros_ec_typec: Defer probe on missing EC parent
    
    [ Upstream commit 8866f4e557eba43e991f99711515217a95f62d2e ]
    
    If cros_typec_probe is called before EC device is registered,
    cros_typec_probe will fail. It may happen when cros-ec-typec.ko is
    loaded before EC bus layer module (e.g. cros_ec_lpcs.ko,
    cros_ec_spi.ko).
    
    Return -EPROBE_DEFER when cros_typec_probe doesn't get EC device, so
    the probe function can be called again after EC device is registered.
    
    Signed-off-by: Tomasz Michalec <tmichalec@google.com>
    Reviewed-by: Abhishek Pandit-Subedi <abhishekpandit@chromium.org>
    Link: https://lore.kernel.org/r/20250610153748.1858519-1-tmichalec@google.com
    Signed-off-by: Tzung-Bi Shih <tzungbi@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

platform/x86/amd: pmc: Add Lenovo Yoga 6 13ALC6 to pmc quirk list [+ + +]

Author: Mario Limonciello <mario.limonciello@amd.com>
Date:   Fri Jul 18 12:23:05 2025 -0500

    platform/x86/amd: pmc: Add Lenovo Yoga 6 13ALC6 to pmc quirk list
    
    [ Upstream commit 4ff3aeb664f7dfe824ba91ffb0b203397a8d431e ]
    
    The Lenovo Yoga 6 13ACL6 82ND has a similar BIOS problem as other Lenovo
    laptops from that vintage that causes a rather long resume from suspend.
    
    Add it to the quirk list that manipulates the scratch register to avoid
    the issue.
    
    Reported-by: Adam Berglund <adam.f.berglund@hotmail.com>
    Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/4434
    Tested-by: Adam Berglund <adam.f.berglund@hotmail.com>
    Signed-off-by: Mario Limonciello <mario.limonciello@amd.com>
    Reviewed-by: Hans de Goede <hansg@kernel.org>
    Link: https://lore.kernel.org/r/20250718172307.1928744-1-superm1@kernel.org
    Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

platform/x86: thinkpad_acpi: Handle KCOV __init vs inline mismatches [+ + +]

Author: Kees Cook <kees@kernel.org>
Date:   Thu May 29 11:18:37 2025 -0700

    platform/x86: thinkpad_acpi: Handle KCOV __init vs inline mismatches
    
    [ Upstream commit 6418a8504187dc7f5b6f9d0649c03e362cb0664b ]
    
    When KCOV is enabled all functions get instrumented, unless the
    __no_sanitize_coverage attribute is used. To prepare for
    __no_sanitize_coverage being applied to __init functions[1], we have
    to handle differences in how GCC's inline optimizations get resolved.
    For thinkpad_acpi routines, this means forcing two functions to be
    inline with __always_inline.
    
    Link: https://lore.kernel.org/lkml/20250523043935.2009972-11-kees@kernel.org/ [1]
    Signed-off-by: Kees Cook <kees@kernel.org>
    Link: https://lore.kernel.org/r/20250529181831.work.439-kees@kernel.org
    Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

PM / devfreq: governor: Replace sscanf() with kstrtoul() in set_freq_store() [+ + +]

Author: Lifeng Zheng <zhenglifeng1@huawei.com>
Date:   Mon Apr 21 11:00:17 2025 +0800

    PM / devfreq: governor: Replace sscanf() with kstrtoul() in set_freq_store()
    
    [ Upstream commit 914cc799b28f17d369d5b4db3b941957d18157e8 ]
    
    Replace sscanf() with kstrtoul() in set_freq_store() and check the result
    to avoid invalid input.
    
    Signed-off-by: Lifeng Zheng <zhenglifeng1@huawei.com>
    Link: https://lore.kernel.org/lkml/20250421030020.3108405-2-zhenglifeng1@huawei.com/
    Signed-off-by: Chanwoo Choi <cw00.choi@samsung.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

pm: cpupower: Fix the snapshot-order of tsc,mperf, clock in mperf_stop() [+ + +]

Author: Gautham R. Shenoy <gautham.shenoy@amd.com>
Date:   Thu Jun 12 17:53:54 2025 +0530

    pm: cpupower: Fix the snapshot-order of tsc,mperf, clock in mperf_stop()
    
    [ Upstream commit cda7ac8ce7de84cf32a3871ba5f318aa3b79381e ]
    
    In the function mperf_start(), mperf_monitor snapshots the time, tsc
    and finally the aperf,mperf MSRs. However, this order of snapshotting
    in is reversed in mperf_stop(). As a result, the C0 residency (which
    is computed as delta_mperf * 100 / delta_tsc) is under-reported on
    CPUs that is 100% busy.
    
    Fix this by snapshotting time, tsc and then aperf,mperf in
    mperf_stop() in the same order as in mperf_start().
    
    Link: https://lore.kernel.org/r/20250612122355.19629-2-gautham.shenoy@amd.com
    Signed-off-by: Gautham R. Shenoy <gautham.shenoy@amd.com>
    Signed-off-by: Shuah Khan <skhan@linuxfoundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

PM: runtime: Clear power.needs_force_resume in pm_runtime_reinit() [+ + +]

Author: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
Date:   Fri Jun 27 21:16:05 2025 +0200

    PM: runtime: Clear power.needs_force_resume in pm_runtime_reinit()
    
    [ Upstream commit 89d9cec3b1e9c49bae9375a2db6dc49bc7468af0 ]
    
    Clear power.needs_force_resume in pm_runtime_reinit() in case it has
    been set by pm_runtime_force_suspend() invoked from a driver remove
    callback.
    
    Suggested-by: Ulf Hansson <ulf.hansson@linaro.org>
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Reviewed-by: Ulf Hansson <ulf.hansson@linaro.org>
    Link: https://patch.msgid.link/9495163.CDJkKcVGEf@rjwysocki.net
    Signed-off-by: Sasha Levin <sashal@kernel.org>

PM: sleep: console: Fix the black screen issue [+ + +]

Author: tuhaowen <tuhaowen@uniontech.com>
Date:   Wed Jun 11 11:23:45 2025 +0800

    PM: sleep: console: Fix the black screen issue
    
    [ Upstream commit 4266e8fa56d3d982bf451d382a410b9db432015c ]
    
    When the computer enters sleep status without a monitor
    connected, the system switches the console to the virtual
    terminal tty63(SUSPEND_CONSOLE).
    
    If a monitor is subsequently connected before waking up,
    the system skips the required VT restoration process
    during wake-up, leaving the console on tty63 instead of
    switching back to tty1.
    
    To fix this issue, a global flag vt_switch_done is introduced
    to record whether the system has successfully switched to
    the suspend console via vt_move_to_console() during suspend.
    
    If the switch was completed, vt_switch_done is set to 1.
    Later during resume, this flag is checked to ensure that
    the original console is restored properly by calling
    vt_move_to_console(orig_fgconsole, 0).
    
    This prevents scenarios where the resume logic skips console
    restoration due to incorrect detection of the console state,
    especially when a monitor is reconnected before waking up.
    
    Signed-off-by: tuhaowen <tuhaowen@uniontech.com>
    Link: https://patch.msgid.link/20250611032345.29962-1-tuhaowen@uniontech.com
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

pmdomain: ti: Select PM_GENERIC_DOMAINS [+ + +]

Author: Guillaume La Roque <glaroque@baylibre.com>
Date:   Tue Jul 15 10:50:08 2025 +0200

    pmdomain: ti: Select PM_GENERIC_DOMAINS
    
    [ Upstream commit fcddcb7e8f38a40db99f87a962c5d0a153a76566 ]
    
    Select PM_GENERIC_DOMAINS instead of depending on it to ensure
    it is always enabled when TI_SCI_PM_DOMAINS is selected.
    Since PM_GENERIC_DOMAINS is an implicit symbol, it can only be enabled
    through 'select' and cannot be explicitly enabled in configuration.
    This simplifies the dependency chain and prevents build issues
    
    Signed-off-by: Guillaume La Roque <glaroque@baylibre.com>
    Reviewed-by: Nishanth Menon <nm@ti.com>
    Link: https://lore.kernel.org/r/20250715-depspmdomain-v2-1-6f0eda3ce824@baylibre.com
    Signed-off-by: Ulf Hansson <ulf.hansson@linaro.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

pNFS: Fix disk addr range check in block/scsi layout [+ + +]

Author: Sergey Bashirov <sergeybashirov@gmail.com>
Date:   Wed Jul 2 16:32:21 2025 +0300

    pNFS: Fix disk addr range check in block/scsi layout
    
    [ Upstream commit 7db6e66663681abda54f81d5916db3a3b8b1a13d ]
    
    At the end of the isect translation, disc_addr represents the physical
    disk offset. Thus, end calculated from disk_addr is also a physical disk
    offset. Therefore, range checking should be done using map->disk_offset,
    not map->start.
    
    Signed-off-by: Sergey Bashirov <sergeybashirov@gmail.com>
    Reviewed-by: Christoph Hellwig <hch@lst.de>
    Link: https://lore.kernel.org/r/20250702133226.212537-1-sergeybashirov@gmail.com
    Signed-off-by: Trond Myklebust <trond.myklebust@hammerspace.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

pNFS: Fix stripe mapping in block/scsi layout [+ + +]

Author: Sergey Bashirov <sergeybashirov@gmail.com>
Date:   Tue Jul 1 15:21:48 2025 +0300

    pNFS: Fix stripe mapping in block/scsi layout
    
    [ Upstream commit 81438498a285759f31e843ac4800f82a5ce6521f ]
    
    Because of integer division, we need to carefully calculate the
    disk offset. Consider the example below for a stripe of 6 volumes,
    a chunk size of 4096, and an offset of 70000.
    
    chunk = div_u64(offset, dev->chunk_size) = 70000 / 4096 = 17
    offset = chunk * dev->chunk_size = 17 * 4096 = 69632
    disk_offset_wrong = div_u64(offset, dev->nr_children) = 69632 / 6 = 11605
    disk_chunk = div_u64(chunk, dev->nr_children) = 17 / 6 = 2
    disk_offset = disk_chunk * dev->chunk_size = 2 * 4096 = 8192
    
    Signed-off-by: Sergey Bashirov <sergeybashirov@gmail.com>
    Reviewed-by: Christoph Hellwig <hch@lst.de>
    Link: https://lore.kernel.org/r/20250701122341.199112-1-sergeybashirov@gmail.com
    Signed-off-by: Trond Myklebust <trond.myklebust@hammerspace.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

pNFS: Fix uninited ptr deref in block/scsi layout [+ + +]

Author: Sergey Bashirov <sergeybashirov@gmail.com>
Date:   Mon Jun 30 21:35:26 2025 +0300

    pNFS: Fix uninited ptr deref in block/scsi layout
    
    [ Upstream commit 9768797c219326699778fba9cd3b607b2f1e7950 ]
    
    The error occurs on the third attempt to encode extents. When function
    ext_tree_prepare_commit() reallocates a larger buffer to retry encoding
    extents, the "layoutupdate_pages" page array is initialized only after the
    retry loop. But ext_tree_free_commitdata() is called on every iteration
    and tries to put pages in the array, thus dereferencing uninitialized
    pointers.
    
    An additional problem is that there is no limit on the maximum possible
    buffer_size. When there are too many extents, the client may create a
    layoutcommit that is larger than the maximum possible RPC size accepted
    by the server.
    
    During testing, we observed two typical scenarios. First, one memory page
    for extents is enough when we work with small files, append data to the
    end of the file, or preallocate extents before writing. But when we fill
    a new large file without preallocating, the number of extents can be huge,
    and counting the number of written extents in ext_tree_encode_commit()
    does not help much. Since this number increases even more between
    unlocking and locking of ext_tree, the reallocated buffer may not be
    large enough again and again.
    
    Co-developed-by: Konstantin Evtushenko <koevtushenko@yandex.com>
    Signed-off-by: Konstantin Evtushenko <koevtushenko@yandex.com>
    Signed-off-by: Sergey Bashirov <sergeybashirov@gmail.com>
    Reviewed-by: Christoph Hellwig <hch@lst.de>
    Link: https://lore.kernel.org/r/20250630183537.196479-2-sergeybashirov@gmail.com
    Signed-off-by: Trond Myklebust <trond.myklebust@hammerspace.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

pNFS: Handle RPC size limit for layoutcommits [+ + +]

Author: Sergey Bashirov <sergeybashirov@gmail.com>
Date:   Mon Jun 30 21:35:29 2025 +0300

    pNFS: Handle RPC size limit for layoutcommits
    
    [ Upstream commit d897d81671bc4615c80f4f3bd5e6b218f59df50c ]
    
    When there are too many block extents for a layoutcommit, they may not
    all fit into the maximum-sized RPC. This patch allows the generic pnfs
    code to properly handle -ENOSPC returned by the block/scsi layout driver
    and trigger additional layoutcommits if necessary.
    
    Co-developed-by: Konstantin Evtushenko <koevtushenko@yandex.com>
    Signed-off-by: Konstantin Evtushenko <koevtushenko@yandex.com>
    Signed-off-by: Sergey Bashirov <sergeybashirov@gmail.com>
    Reviewed-by: Christoph Hellwig <hch@lst.de>
    Link: https://lore.kernel.org/r/20250630183537.196479-5-sergeybashirov@gmail.com
    Signed-off-by: Trond Myklebust <trond.myklebust@hammerspace.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

power: supply: qcom_battmgr: Add lithium-polymer entry [+ + +]

Author: Abel Vesa <abel.vesa@linaro.org>
Date:   Fri May 23 13:14:22 2025 +0300

    power: supply: qcom_battmgr: Add lithium-polymer entry
    
    [ Upstream commit 202ac22b8e2e015e6c196fd8113f3d2a62dd1afc ]
    
    On some Dell XPS 13 (9345) variants, the battery used is lithium-polymer
    based. Currently, this is reported as unknown technology due to the entry
    missing.
    
    [ 4083.135325] Unknown battery technology 'LIP'
    
    Add another check for lithium-polymer in the technology parsing callback
    and return that instead of unknown.
    
    Signed-off-by: Abel Vesa <abel.vesa@linaro.org>
    Reviewed-by: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
    Link: https://lore.kernel.org/r/20250523-psy-qcom-battmgr-add-lipo-entry-v1-1-938c20a43a25@linaro.org
    Signed-off-by: Sebastian Reichel <sebastian.reichel@collabora.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

powerpc/thp: tracing: Hide hugepage events under CONFIG_PPC_BOOK3S_64 [+ + +]

Author: Steven Rostedt <rostedt@goodmis.org>
Date:   Thu Jun 12 10:12:59 2025 -0400

    powerpc/thp: tracing: Hide hugepage events under CONFIG_PPC_BOOK3S_64
    
    [ Upstream commit 43cf0e05089afe23dac74fa6e1e109d49f2903c4 ]
    
    The events hugepage_set_pmd, hugepage_set_pud, hugepage_update_pmd and
    hugepage_update_pud are only called when CONFIG_PPC_BOOK3S_64 is defined.
    As each event can take up to 5K regardless if they are used or not, it's
    best not to define them when they are not used. Add #ifdef around these
    events when they are not used.
    
    Cc: Masami Hiramatsu <mhiramat@kernel.org>
    Cc: Mathieu Desnoyers <mathieu.desnoyers@efficios.com>
    Cc: Andrew Morton <akpm@linux-foundation.org>
    Cc: Michael Ellerman <mpe@ellerman.id.au>
    Link: https://lore.kernel.org/20250612101259.0ad43e48@batman.local.home
    Acked-by: David Hildenbrand <david@redhat.com>
    Acked-by: Madhavan Srinivasan <maddy@linux.ibm.com>
    Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

powerpc: floppy: Add missing checks after DMA map [+ + +]

Author: Thomas Fourier <fourier.thomas@gmail.com>
Date:   Fri Jun 20 09:55:55 2025 +0200

    powerpc: floppy: Add missing checks after DMA map
    
    [ Upstream commit cf183c1730f2634245da35e9b5d53381b787d112 ]
    
    The DMA map functions can fail and should be tested for errors.
    
    Signed-off-by: Thomas Fourier <fourier.thomas@gmail.com>
    Reviewed-by: Christophe Leroy <christophe.leroy@csgroup.eu>
    Signed-off-by: Madhavan Srinivasan <maddy@linux.ibm.com>
    Link: https://patch.msgid.link/20250620075602.12575-1-fourier.thomas@gmail.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

pps: clients: gpio: fix interrupt handling order in remove path [+ + +]

Author: Eliav Farber <farbere@amazon.com>
Date:   Tue May 27 05:33:55 2025 +0000

    pps: clients: gpio: fix interrupt handling order in remove path
    
    [ Upstream commit 6bca1e955830808dc90e0506b2951b4256b81bbb ]
    
    The interrupt handler in pps_gpio_probe() is registered after calling
    pps_register_source() using devm_request_irq(). However, in the
    corresponding remove function, pps_unregister_source() is called before
    the IRQ is freed, since devm-managed resources are released after the
    remove function completes.
    
    This creates a potential race condition where an interrupt may occur
    after the PPS source is unregistered but before the handler is removed,
    possibly leading to a kernel panic.
    
    To prevent this, switch from devm-managed IRQ registration to manual
    management by using request_irq() and calling free_irq() explicitly in
    the remove path before unregistering the PPS source. This ensures the
    interrupt handler is safely removed before deactivating the PPS source.
    
    Signed-off-by: Eliav Farber <farbere@amazon.com>
    Link: https://lore.kernel.org/r/20250527053355.37185-1-farbere@amazon.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

printk: nbcon: Allow reacquire during panic [+ + +]

Author: John Ogness <john.ogness@linutronix.de>
Date:   Fri Jun 6 21:01:49 2025 +0206

    printk: nbcon: Allow reacquire during panic
    
    [ Upstream commit 571c1ea91a73db56bd94054fabecd0f070dc90db ]
    
    If a console printer is interrupted during panic, it will never
    be able to reacquire ownership in order to perform and cleanup.
    That in itself is not a problem, since the non-panic CPU will
    simply quiesce in an endless loop within nbcon_reacquire_nobuf().
    
    However, in this state, platforms that do not support a true NMI
    to interrupt the quiesced CPU will not be able to shutdown that
    CPU from within panic(). This then causes problems for such as
    being unable to load and run a kdump kernel.
    
    Fix this by allowing non-panic CPUs to reacquire ownership using
    a direct acquire. Then the non-panic CPUs can successfullyl exit
    the nbcon_reacquire_nobuf() loop and the console driver can
    perform any necessary cleanup. But more importantly, the CPU is
    no longer quiesced and is free to process any interrupts
    necessary for panic() to shutdown the CPU.
    
    All other forms of acquire are still not allowed for non-panic
    CPUs since it is safer to have them avoid gaining console
    ownership that is not strictly necessary.
    
    Reported-by: Michael Kelley <mhklinux@outlook.com>
    Closes: https://lore.kernel.org/r/SN6PR02MB4157A4C5E8CB219A75263A17D46DA@SN6PR02MB4157.namprd02.prod.outlook.com
    Signed-off-by: John Ogness <john.ogness@linutronix.de>
    Reviewed-by: Petr Mladek <pmladek@suse.com>
    Tested-by: Michael Kelley <mhklinux@outlook.com>
    Link: https://patch.msgid.link/20250606185549.900611-1-john.ogness@linutronix.de
    Signed-off-by: Petr Mladek <pmladek@suse.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ptp: prevent possible ABBA deadlock in ptp_clock_freerun() [+ + +]

Author: Jeongjun Park <aha310510@gmail.com>
Date:   Mon Jul 28 15:26:49 2025 +0900

    ptp: prevent possible ABBA deadlock in ptp_clock_freerun()
    
    [ Upstream commit 2efe41234dbd0a83fdb7cd38226c2f70039a2cd3 ]
    
    syzbot reported the following ABBA deadlock:
    
           CPU0                           CPU1
           ----                           ----
      n_vclocks_store()
        lock(&ptp->n_vclocks_mux) [1]
            (physical clock)
                                         pc_clock_adjtime()
                                           lock(&clk->rwsem) [2]
                                            (physical clock)
                                           ...
                                           ptp_clock_freerun()
                                             ptp_vclock_in_use()
                                               lock(&ptp->n_vclocks_mux) [3]
                                                  (physical clock)
        ptp_clock_unregister()
          posix_clock_unregister()
            lock(&clk->rwsem) [4]
              (virtual clock)
    
    Since ptp virtual clock is registered only under ptp physical clock, both
    ptp_clock and posix_clock must be physical clocks for ptp_vclock_in_use()
    to lock &ptp->n_vclocks_mux and check ptp->n_vclocks.
    
    However, when unregistering vclocks in n_vclocks_store(), the locking
    ptp->n_vclocks_mux is a physical clock lock, but clk->rwsem of
    ptp_clock_unregister() called through device_for_each_child_reverse()
    is a virtual clock lock.
    
    Therefore, clk->rwsem used in CPU0 and clk->rwsem used in CPU1 are
    different locks, but in lockdep, a false positive occurs because the
    possibility of deadlock is determined through lock-class.
    
    To solve this, lock subclass annotation must be added to the posix_clock
    rwsem of the vclock.
    
    Reported-by: syzbot+7cfb66a237c4a5fb22ad@syzkaller.appspotmail.com
    Closes: https://syzkaller.appspot.com/bug?extid=7cfb66a237c4a5fb22ad
    Fixes: 73f37068d540 ("ptp: support ptp physical/virtual clocks conversion")
    Signed-off-by: Jeongjun Park <aha310510@gmail.com>
    Acked-by: Richard Cochran <richardcochran@gmail.com>
    Reviewed-by: Vladimir Oltean <vladimir.oltean@nxp.com>
    Link: https://patch.msgid.link/20250728062649.469882-1-aha310510@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ptp: Use ratelimite for freerun error message [+ + +]

Author: Breno Leitao <leitao@debian.org>
Date:   Fri Jun 13 10:15:46 2025 -0700

    ptp: Use ratelimite for freerun error message
    
    [ Upstream commit e9a7795e75b78b56997fb0070c18d6e1057b6462 ]
    
    Replace pr_err() with pr_err_ratelimited() in ptp_clock_settime() to
    prevent log flooding when the physical clock is free running, which
    happens on some of my hosts. This ensures error messages are
    rate-limited and improves kernel log readability.
    
    Signed-off-by: Breno Leitao <leitao@debian.org>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://patch.msgid.link/20250613-ptp-v1-1-ee44260ce9e2@debian.org
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

rcu/nocb: Fix possible invalid rdp's->nocb_cb_kthread pointer access [+ + +]

Author: Zqiang <qiang.zhang1211@gmail.com>
Date:   Wed May 7 19:26:05 2025 +0800

    rcu/nocb: Fix possible invalid rdp's->nocb_cb_kthread pointer access
    
    [ Upstream commit 1bba3900ca18bdae28d1b9fa10f16a8f8cb2ada1 ]
    
    In the preparation stage of CPU online, if the corresponding
    the rdp's->nocb_cb_kthread does not exist, will be created,
    there is a situation where the rdp's rcuop kthreads creation fails,
    and then de-offload this CPU's rdp, does not assign this CPU's
    rdp->nocb_cb_kthread pointer, but this rdp's->nocb_gp_rdp and
    rdp's->rdp_gp->nocb_gp_kthread is still valid.
    
    This will cause the subsequent re-offload operation of this offline
    CPU, which will pass the conditional check and the kthread_unpark()
    will access invalid rdp's->nocb_cb_kthread pointer.
    
    This commit therefore use rdp's->nocb_gp_kthread instead of
    rdp_gp's->nocb_gp_kthread for safety check.
    
    Signed-off-by: Zqiang <qiang.zhang1211@gmail.com>
    Reviewed-by: Frederic Weisbecker <frederic@kernel.org>
    Signed-off-by: Neeraj Upadhyay (AMD) <neeraj.upadhyay@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

rcu: Fix racy re-initialization of irq_work causing hangs [+ + +]

Author: Frederic Weisbecker <frederic@kernel.org>
Date:   Fri Aug 8 19:03:22 2025 +0200

    rcu: Fix racy re-initialization of irq_work causing hangs
    
    commit 61399e0c5410567ef60cb1cda34cca42903842e3 upstream.
    
    RCU re-initializes the deferred QS irq work everytime before attempting
    to queue it. However there are situations where the irq work is
    attempted to be queued even though it is already queued. In that case
    re-initializing messes-up with the irq work queue that is about to be
    handled.
    
    The chances for that to happen are higher when the architecture doesn't
    support self-IPIs and irq work are then all lazy, such as with the
    following sequence:
    
    1) rcu_read_unlock() is called when IRQs are disabled and there is a
       grace period involving blocked tasks on the node. The irq work
       is then initialized and queued.
    
    2) The related tasks are unblocked and the CPU quiescent state
       is reported. rdp->defer_qs_iw_pending is reset to DEFER_QS_IDLE,
       allowing the irq work to be requeued in the future (note the previous
       one hasn't fired yet).
    
    3) A new grace period starts and the node has blocked tasks.
    
    4) rcu_read_unlock() is called when IRQs are disabled again. The irq work
       is re-initialized (but it's queued! and its node is cleared) and
       requeued. Which means it's requeued to itself.
    
    5) The irq work finally fires with the tick. But since it was requeued
       to itself, it loops and hangs.
    
    Fix this with initializing the irq work only once before the CPU boots.
    
    Fixes: b41642c87716 ("rcu: Fix rcu_read_unlock() deadloop due to IRQ work")
    Reported-by: kernel test robot <oliver.sang@intel.com>
    Closes: https://lore.kernel.org/oe-lkp/202508071303.c1134cce-lkp@intel.com
    Signed-off-by: Frederic Weisbecker <frederic@kernel.org>
    Reviewed-by: Joel Fernandes <joelagnelf@nvidia.com>
    Signed-off-by: Neeraj Upadhyay (AMD) <neeraj.upadhyay@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

rcu: Fix rcu_read_unlock() deadloop due to IRQ work [+ + +]

Author: Joel Fernandes <joelagnelf@nvidia.com>
Date:   Tue Jul 8 10:22:19 2025 -0400

    rcu: Fix rcu_read_unlock() deadloop due to IRQ work
    
    [ Upstream commit b41642c87716bbd09797b1e4ea7d904f06c39b7b ]
    
    During rcu_read_unlock_special(), if this happens during irq_exit(), we
    can lockup if an IPI is issued. This is because the IPI itself triggers
    the irq_exit() path causing a recursive lock up.
    
    This is precisely what Xiongfeng found when invoking a BPF program on
    the trace_tick_stop() tracepoint As shown in the trace below. Fix by
    managing the irq_work state correctly.
    
    irq_exit()
      __irq_exit_rcu()
        /* in_hardirq() returns false after this */
        preempt_count_sub(HARDIRQ_OFFSET)
        tick_irq_exit()
          tick_nohz_irq_exit()
                tick_nohz_stop_sched_tick()
                  trace_tick_stop()  /* a bpf prog is hooked on this trace point */
                       __bpf_trace_tick_stop()
                          bpf_trace_run2()
                                rcu_read_unlock_special()
                                  /* will send a IPI to itself */
                                  irq_work_queue_on(&rdp->defer_qs_iw, rdp->cpu);
    
    A simple reproducer can also be obtained by doing the following in
    tick_irq_exit(). It will hang on boot without the patch:
    
      static inline void tick_irq_exit(void)
      {
     +      rcu_read_lock();
     +      WRITE_ONCE(current->rcu_read_unlock_special.b.need_qs, true);
     +      rcu_read_unlock();
     +
    
    Reported-by: Xiongfeng Wang <wangxiongfeng2@huawei.com>
    Closes: https://lore.kernel.org/all/9acd5f9f-6732-7701-6880-4b51190aa070@huawei.com/
    Tested-by: Qi Xi <xiqi2@huawei.com>
    Signed-off-by: Joel Fernandes <joelagnelf@nvidia.com>
    Reviewed-by: "Paul E. McKenney" <paulmck@kernel.org>
    Reported-by: Linux Kernel Functional Testing <lkft@linaro.org>
    [neeraj: Apply Frederic's suggested fix for PREEMPT_RT]
    Signed-off-by: Neeraj Upadhyay (AMD) <neeraj.upadhyay@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

rcu: Protect ->defer_qs_iw_pending from data race [+ + +]

Author: Paul E. McKenney <paulmck@kernel.org>
Date:   Thu Apr 24 16:49:53 2025 -0700

    rcu: Protect ->defer_qs_iw_pending from data race
    
    [ Upstream commit 90c09d57caeca94e6f3f87c49e96a91edd40cbfd ]
    
    On kernels built with CONFIG_IRQ_WORK=y, when rcu_read_unlock() is
    invoked within an interrupts-disabled region of code [1], it will invoke
    rcu_read_unlock_special(), which uses an irq-work handler to force the
    system to notice when the RCU read-side critical section actually ends.
    That end won't happen until interrupts are enabled at the soonest.
    
    In some kernels, such as those booted with rcutree.use_softirq=y, the
    irq-work handler is used unconditionally.
    
    The per-CPU rcu_data structure's ->defer_qs_iw_pending field is
    updated by the irq-work handler and is both read and updated by
    rcu_read_unlock_special().  This resulted in the following KCSAN splat:
    
    ------------------------------------------------------------------------
    
    BUG: KCSAN: data-race in rcu_preempt_deferred_qs_handler / rcu_read_unlock_special
    
    read to 0xffff96b95f42d8d8 of 1 bytes by task 90 on cpu 8:
     rcu_read_unlock_special+0x175/0x260
     __rcu_read_unlock+0x92/0xa0
     rt_spin_unlock+0x9b/0xc0
     __local_bh_enable+0x10d/0x170
     __local_bh_enable_ip+0xfb/0x150
     rcu_do_batch+0x595/0xc40
     rcu_cpu_kthread+0x4e9/0x830
     smpboot_thread_fn+0x24d/0x3b0
     kthread+0x3bd/0x410
     ret_from_fork+0x35/0x40
     ret_from_fork_asm+0x1a/0x30
    
    write to 0xffff96b95f42d8d8 of 1 bytes by task 88 on cpu 8:
     rcu_preempt_deferred_qs_handler+0x1e/0x30
     irq_work_single+0xaf/0x160
     run_irq_workd+0x91/0xc0
     smpboot_thread_fn+0x24d/0x3b0
     kthread+0x3bd/0x410
     ret_from_fork+0x35/0x40
     ret_from_fork_asm+0x1a/0x30
    
    no locks held by irq_work/8/88.
    irq event stamp: 200272
    hardirqs last  enabled at (200272): [<ffffffffb0f56121>] finish_task_switch+0x131/0x320
    hardirqs last disabled at (200271): [<ffffffffb25c7859>] __schedule+0x129/0xd70
    softirqs last  enabled at (0): [<ffffffffb0ee093f>] copy_process+0x4df/0x1cc0
    softirqs last disabled at (0): [<0000000000000000>] 0x0
    
    ------------------------------------------------------------------------
    
    The problem is that irq-work handlers run with interrupts enabled, which
    means that rcu_preempt_deferred_qs_handler() could be interrupted,
    and that interrupt handler might contain an RCU read-side critical
    section, which might invoke rcu_read_unlock_special().  In the strict
    KCSAN mode of operation used by RCU, this constitutes a data race on
    the ->defer_qs_iw_pending field.
    
    This commit therefore disables interrupts across the portion of the
    rcu_preempt_deferred_qs_handler() that updates the ->defer_qs_iw_pending
    field.  This suffices because this handler is not a fast path.
    
    Signed-off-by: Paul E. McKenney <paulmck@kernel.org>
    Reviewed-by: Frederic Weisbecker <frederic@kernel.org>
    Signed-off-by: Neeraj Upadhyay (AMD) <neeraj.upadhyay@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

rcutorture: Fix rcutorture_one_extend_check() splat in RT kernels [+ + +]

Author: Zqiang <qiang.zhang1211@gmail.com>
Date:   Wed May 7 19:26:03 2025 +0800

    rcutorture: Fix rcutorture_one_extend_check() splat in RT kernels
    
    [ Upstream commit 8d71351d88e478d3c4e945e3218e97ec677fd807 ]
    
    For built with CONFIG_PREEMPT_RT=y kernels, running rcutorture
    tests resulted in the following splat:
    
    [   68.797425] rcutorture_one_extend_check during change: Current 0x1  To add 0x1  To remove 0x0  preempt_count() 0x0
    [   68.797533] WARNING: CPU: 2 PID: 512 at kernel/rcu/rcutorture.c:1993 rcutorture_one_extend_check+0x419/0x560 [rcutorture]
    [   68.797601] Call Trace:
    [   68.797602]  <TASK>
    [   68.797619]  ? lockdep_softirqs_off+0xa5/0x160
    [   68.797631]  rcutorture_one_extend+0x18e/0xcc0 [rcutorture 2466dbd2ff34dbaa36049cb323a80c3306ac997c]
    [   68.797646]  ? local_clock+0x19/0x40
    [   68.797659]  rcu_torture_one_read+0xf0/0x280 [rcutorture 2466dbd2ff34dbaa36049cb323a80c3306ac997c]
    [   68.797678]  ? __pfx_rcu_torture_one_read+0x10/0x10 [rcutorture 2466dbd2ff34dbaa36049cb323a80c3306ac997c]
    [   68.797804]  ? __pfx_rcu_torture_timer+0x10/0x10 [rcutorture 2466dbd2ff34dbaa36049cb323a80c3306ac997c]
    [   68.797815] rcu-torture: rcu_torture_reader task started
    [   68.797824] rcu-torture: Creating rcu_torture_reader task
    [   68.797824]  rcu_torture_reader+0x238/0x580 [rcutorture 2466dbd2ff34dbaa36049cb323a80c3306ac997c]
    [   68.797836]  ? kvm_sched_clock_read+0x15/0x30
    
    Disable BH does not change the SOFTIRQ corresponding bits in
    preempt_count() for RT kernels, this commit therefore use
    softirq_count() to check the if BH is disabled.
    
    Reviewed-by: Paul E. McKenney <paulmck@kernel.org>
    Signed-off-by: Zqiang <qiang.zhang1211@gmail.com>
    Signed-off-by: Joel Fernandes <joelagnelf@nvidia.com>
    Signed-off-by: Neeraj Upadhyay (AMD) <neeraj.upadhyay@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

RDMA/bnxt_re: Fix size of uverbs_copy_to() in BNXT_RE_METHOD_GET_TOGGLE_MEM [+ + +]

Author: Kalesh AP <kalesh-anakkur.purayil@broadcom.com>
Date:   Fri Jul 4 10:08:55 2025 +0530

    RDMA/bnxt_re: Fix size of uverbs_copy_to() in BNXT_RE_METHOD_GET_TOGGLE_MEM
    
    [ Upstream commit 09d231ab569ca97478445ccc1ad44ab026de39b1 ]
    
    Since both "length" and "offset" are of type u32, there is
    no functional issue here.
    
    Reviewed-by: Saravanan Vajravel <saravanan.vajravel@broadcom.com>
    Signed-off-by: Shravya KN <shravya.k-n@broadcom.com>
    Signed-off-by: Kalesh AP <kalesh-anakkur.purayil@broadcom.com>
    Link: https://patch.msgid.link/20250704043857.19158-2-kalesh-anakkur.purayil@broadcom.com
    Signed-off-by: Leon Romanovsky <leon@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

RDMA/core: reduce stack using in nldev_stat_get_doit() [+ + +]

Author: Arnd Bergmann <arnd@arndb.de>
Date:   Fri Jun 20 13:33:26 2025 +0200

    RDMA/core: reduce stack using in nldev_stat_get_doit()
    
    [ Upstream commit 43163f4c30f94d2103c948a247cdf2cda5068ca7 ]
    
    In the s390 defconfig, gcc-10 and earlier end up inlining three functions
    into nldev_stat_get_doit(), and each of them uses some 600 bytes of stack.
    
    The result is a function with an overly large stack frame and a warning:
    
    drivers/infiniband/core/nldev.c:2466:1: error: the frame size of 1720 bytes is larger than 1280 bytes [-Werror=frame-larger-than=]
    
    Mark the three functions noinline_for_stack to prevent this, ensuring
    that only one copy of the nlattr array is on the stack of each function.
    
    Signed-off-by: Arnd Bergmann <arnd@arndb.de>
    Link: https://patch.msgid.link/20250620113335.3776965-1-arnd@kernel.org
    Signed-off-by: Leon Romanovsky <leon@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

RDMA/siw: Fix the sendmsg byte count in siw_tcp_sendpages [+ + +]

Author: Pedro Falcato <pfalcato@suse.de>
Date:   Tue Jul 29 13:03:48 2025 +0100

    RDMA/siw: Fix the sendmsg byte count in siw_tcp_sendpages
    
    commit c18646248fed07683d4cee8a8af933fc4fe83c0d upstream.
    
    Ever since commit c2ff29e99a76 ("siw: Inline do_tcp_sendpages()"),
    we have been doing this:
    
    static int siw_tcp_sendpages(struct socket *s, struct page **page, int offset,
                                 size_t size)
    [...]
            /* Calculate the number of bytes we need to push, for this page
             * specifically */
            size_t bytes = min_t(size_t, PAGE_SIZE - offset, size);
            /* If we can't splice it, then copy it in, as normal */
            if (!sendpage_ok(page[i]))
                    msg.msg_flags &= ~MSG_SPLICE_PAGES;
            /* Set the bvec pointing to the page, with len $bytes */
            bvec_set_page(&bvec, page[i], bytes, offset);
            /* Set the iter to $size, aka the size of the whole sendpages (!!!) */
            iov_iter_bvec(&msg.msg_iter, ITER_SOURCE, &bvec, 1, size);
    try_page_again:
            lock_sock(sk);
            /* Sendmsg with $size size (!!!) */
            rv = tcp_sendmsg_locked(sk, &msg, size);
    
    This means we've been sending oversized iov_iters and tcp_sendmsg calls
    for a while. This has a been a benign bug because sendpage_ok() always
    returned true. With the recent slab allocator changes being slowly
    introduced into next (that disallow sendpage on large kmalloc
    allocations), we have recently hit out-of-bounds crashes, due to slight
    differences in iov_iter behavior between the MSG_SPLICE_PAGES and
    "regular" copy paths:
    
    (MSG_SPLICE_PAGES)
    skb_splice_from_iter
      iov_iter_extract_pages
        iov_iter_extract_bvec_pages
          uses i->nr_segs to correctly stop in its tracks before OoB'ing everywhere
      skb_splice_from_iter gets a "short" read
    
    (!MSG_SPLICE_PAGES)
    skb_copy_to_page_nocache copy=iov_iter_count
     [...]
       copy_from_iter
            /* this doesn't help */
            if (unlikely(iter->count < len))
                    len = iter->count;
              iterate_bvec
                ... and we run off the bvecs
    
    Fix this by properly setting the iov_iter's byte count, plus sending the
    correct byte count to tcp_sendmsg_locked.
    
    Link: https://patch.msgid.link/r/20250729120348.495568-1-pfalcato@suse.de
    Cc: stable@vger.kernel.org
    Fixes: c2ff29e99a76 ("siw: Inline do_tcp_sendpages()")
    Reported-by: kernel test robot <oliver.sang@intel.com>
    Closes: https://lore.kernel.org/oe-lkp/202507220801.50a7210-lkp@intel.com
    Reviewed-by: David Howells <dhowells@redhat.com>
    Signed-off-by: Pedro Falcato <pfalcato@suse.de>
    Acked-by: Bernard Metzler <bernard.metzler@linux.dev>
    Signed-off-by: Jason Gunthorpe <jgg@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

RDMA: hfi1: fix possible divide-by-zero in find_hw_thread_mask() [+ + +]

Author: Yury Norov [NVIDIA] <yury.norov@gmail.com>
Date:   Wed Jun 4 15:39:38 2025 -0400

    RDMA: hfi1: fix possible divide-by-zero in find_hw_thread_mask()
    
    [ Upstream commit 59f7d2138591ef8f0e4e4ab5f1ab674e8181ad3a ]
    
    The function divides number of online CPUs by num_core_siblings, and
    later checks the divider by zero. This implies a possibility to get
    and divide-by-zero runtime error. Fix it by moving the check prior to
    division. This also helps to save one indentation level.
    
    Signed-off-by: Yury Norov [NVIDIA] <yury.norov@gmail.com>
    Link: https://patch.msgid.link/20250604193947.11834-3-yury.norov@gmail.com
    Signed-off-by: Leon Romanovsky <leon@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

regmap: irq: Free the regmap-irq mutex [+ + +]

Author: Mark Brown <broonie@kernel.org>
Date:   Thu Jul 31 21:38:18 2025 +0100

    regmap: irq: Free the regmap-irq mutex
    
    [ Upstream commit 1da33858af6250184d2ef907494d698af03283de ]
    
    We do not currently free the mutex allocated by regmap-irq, do so.
    
    Tested-by: Russell King (Oracle) <rmk+kernel@armlinux.org.uk>
    Reviewed-by: Russell King (Oracle) <rmk+kernel@armlinux.org.uk>
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Link: https://patch.msgid.link/20250731-regmap-irq-nesting-v1-1-98b4d1bf20f0@kernel.org
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

remoteproc: imx_rproc: skip clock enable when M-core is managed by the SCU [+ + +]

Author: Hiago De Franco <hiago.franco@toradex.com>
Date:   Sun Jun 29 14:25:11 2025 -0300

    remoteproc: imx_rproc: skip clock enable when M-core is managed by the SCU
    
    [ Upstream commit 496deecb020d14ba89ba7084fbc3024f91687023 ]
    
    For the i.MX8X and i.MX8 family SoCs, when the Cortex-M core is powered
    up and started by the Cortex-A core using the bootloader (e.g., via the
    U-Boot bootaux command), both M-core and Linux run within the same SCFW
    (System Controller Firmware) partition. With that, Linux has permission
    to control the M-core.
    
    But once the M-core is started by the bootloader, the SCFW automatically
    enables its clock and sets the clock rate. If Linux later attempts to
    enable the same clock via clk_prepare_enable(), the SCFW returns a
    'LOCKED' error, as the clock is already configured by the SCFW. This
    causes the probe function in imx_rproc.c to fail, leading to the M-core
    power domain being shut down while the core is still running. This
    results in a fault from the SCU (System Controller Unit) and triggers a
    system reset.
    
    To address this issue, ignore handling the clk for i.MX8X and i.MX8
    M-core, as SCFW already takes care of enabling and configuring the
    clock.
    
    Suggested-by: Peng Fan <peng.fan@nxp.com>
    Reviewed-by: Ulf Hansson <ulf.hansson@linaro.org>
    Reviewed-by: Peng Fan <peng.fan@nxp.com>
    Signed-off-by: Hiago De Franco <hiago.franco@toradex.com>
    Acked-by: Mathieu Poirier <mathieu.poirier@linaro.org>
    Link: https://lore.kernel.org/r/20250629172512.14857-3-hiagofranco@gmail.com
    Signed-off-by: Ulf Hansson <ulf.hansson@linaro.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

reset: brcmstb: Enable reset drivers for ARCH_BCM2835 [+ + +]

Author: Peter Robinson <pbrobinson@gmail.com>
Date:   Mon Jun 30 18:52:58 2025 +0100

    reset: brcmstb: Enable reset drivers for ARCH_BCM2835
    
    [ Upstream commit 1d99f92f71b6b4b2eee776562c991428490f71ef ]
    
    The BRCMSTB and BRCMSTB_RESCAL reset drivers are also
    used in the BCM2712, AKA the RPi5. The RPi platforms
    have typically used the ARCH_BCM2835, and the PCIe
    support for this SoC can use this config which depends
    on these drivers so enable building them when just that
    arch option is enabled to ensure the platform works as
    expected.
    
    Signed-off-by: Peter Robinson <pbrobinson@gmail.com>
    Acked-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Link: https://lore.kernel.org/r/20250630175301.846082-1-pbrobinson@gmail.com
    Signed-off-by: Philipp Zabel <p.zabel@pengutronix.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

Revert "gpio: mlxbf3: only get IRQ for device instance 0" [+ + +]

Author: David Thompson <davthompson@nvidia.com>
Date:   Mon Aug 11 13:50:44 2025 -0400

    Revert "gpio: mlxbf3: only get IRQ for device instance 0"
    
    commit 56bdf7270ff4f870e2d4bfacdc00161e766dba2d upstream.
    
    This reverts commit 10af0273a35ab4513ca1546644b8c853044da134.
    
    While this change was merged, it is not the preferred solution.
    During review of a similar change to the gpio-mlxbf2 driver, the
    use of "platform_get_irq_optional" was identified as the preferred
    solution, so let's use it for gpio-mlxbf3 driver as well.
    
    Cc: stable@vger.kernel.org
    Fixes: 10af0273a35a ("gpio: mlxbf3: only get IRQ for device instance 0")
    Signed-off-by: David Thompson <davthompson@nvidia.com>
    Reviewed-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com>
    Link: https://lore.kernel.org/r/8d2b630c71b3742f2c74242cf7d602706a6108e6.1754928650.git.davthompson@nvidia.com
    Signed-off-by: Bartosz Golaszewski <bartosz.golaszewski@linaro.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

Revert "gpio: pxa: Make irq_chip immutable" [+ + +]

Author: Bartosz Golaszewski <bartosz.golaszewski@linaro.org>
Date:   Fri Aug 1 09:18:58 2025 +0200

    Revert "gpio: pxa: Make irq_chip immutable"
    
    commit b644c640923b625340c603cdb8d8f456406eb4de upstream.
    
    This reverts commit 20117cf426b6 ("gpio: pxa: Make irq_chip immutableas")
    as it caused a regression on samsung coreprimevelte and we've not been
    able to fix it so far.
    
    Cc: stable@vger.kernel.org # v6.16
    Fixes: 20117cf426b6 ("gpio: pxa: Make irq_chip immutableas")
    Reported-by: Duje Mihanović <duje@dujemihanovic.xyz>
    Closes: https://lore.kernel.org/all/3367665.aeNJFYEL58@radijator/
    Tested-by: Duje Mihanović <duje@dujemihanovic.xyz>
    Link: https://lore.kernel.org/r/20250801071858.7554-1-brgl@bgdev.pl
    Signed-off-by: Bartosz Golaszewski <bartosz.golaszewski@linaro.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

Revert "leds: trigger: netdev: Configure LED blink interval for HW offload" [+ + +]

Author: Daniel Golle <daniel@makrotopia.org>
Date:   Sat Jul 12 16:39:21 2025 +0100

    Revert "leds: trigger: netdev: Configure LED blink interval for HW offload"
    
    commit 26f732791f2bcab18f59c61915bbe35225f30136 upstream.
    
    This reverts commit c629c972b310af41e9e072febb6dae9a299edde6.
    
    While .led_blink_set() would previously put an LED into an unconditional
    permanently blinking state, the offending commit now uses same operation
    to (also?) set the blink timing of the netdev trigger when offloading.
    
    This breaks many if not all of the existing PHY drivers which offer
    offloading LED operations, as those drivers would just put the LED into
    blinking state after .led_blink_set() has been called.
    
    Unfortunately the change even made it into stable kernels for unknown
    reasons, so it should be reverted there as well.
    
    Fixes: c629c972b310a ("leds: trigger: netdev: Configure LED blink interval for HW offload")
    Link: https://lore.kernel.org/linux-leds/c6134e26-2e45-4121-aa15-58aaef327201@lunn.ch/T/#m9d6fe81bbcb273e59f12bbedbd633edd32118387
    Suggested-by: Andrew Lunn <andrew@lunn.ch>
    Cc: stable@vger.kernel.org
    Signed-off-by: Daniel Golle <daniel@makrotopia.org>
    Reviewed-by: Andrew Lunn <andrew@lunn.ch>
    Link: https://lore.kernel.org/r/6dcc77ee1c9676891d6250d8994850f521426a0f.1752334655.git.daniel@makrotopia.org
    Signed-off-by: Lee Jones <lee@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

riscv: dts: thead: Add APB clocks for TH1520 GMACs [+ + +]

Author: Yao Zi <ziyao@disroot.org>
Date:   Fri Aug 8 09:36:56 2025 +0000

    riscv: dts: thead: Add APB clocks for TH1520 GMACs
    
    [ Upstream commit a7f75e2883c4bd57b12c3be61bb926929adad9c0 ]
    
    Describe perisys-apb4-hclk as the APB clock for TH1520 SoC, which is
    essential for accessing GMAC glue registers.
    
    Fixes: 7e756671a664 ("riscv: dts: thead: Add TH1520 ethernet nodes")
    Signed-off-by: Yao Zi <ziyao@disroot.org>
    Reviewed-by: Drew Fustini <fustini@kernel.org>
    Tested-by: Drew Fustini <fustini@kernel.org>
    Link: https://patch.msgid.link/20250808093655.48074-5-ziyao@disroot.org
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

rtc: ds1307: handle oscillator stop flag (OSF) for ds1341 [+ + +]

Author: Meagan Lloyd <meaganlloyd@linux.microsoft.com>
Date:   Wed Jun 11 11:14:16 2025 -0700

    rtc: ds1307: handle oscillator stop flag (OSF) for ds1341
    
    [ Upstream commit 523923cfd5d622b8f4ba893fdaf29fa6adeb8c3e ]
    
    In using CONFIG_RTC_HCTOSYS, rtc_hctosys() will sync the RTC time to the
    kernel time as long as rtc_read_time() succeeds. In some power loss
    situations, our supercapacitor-backed DS1342 RTC comes up with either an
    unpredictable future time or the default 01/01/00 from the datasheet.
    The oscillator stop flag (OSF) is set in these scenarios due to the
    power loss and can be used to determine the validity of the RTC data.
    
    This change expands the oscillator stop flag (OSF) handling that has
    already been implemented for some chips to the ds1341 chip (DS1341 and
    DS1342 share a datasheet). This handling manages the validity of the RTC
    data in .read_time and .set_time based on the OSF.
    
    Signed-off-by: Meagan Lloyd <meaganlloyd@linux.microsoft.com>
    Reviewed-by: Tyler Hicks <code@tyhicks.com>
    Acked-by: Rodolfo Giometti <giometti@enneenne.com>
    Link: https://lore.kernel.org/r/1749665656-30108-3-git-send-email-meaganlloyd@linux.microsoft.com
    Signed-off-by: Alexandre Belloni <alexandre.belloni@bootlin.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

rtc: ds1307: remove clear of oscillator stop flag (OSF) in probe [+ + +]

Author: Meagan Lloyd <meaganlloyd@linux.microsoft.com>
Date:   Wed Jun 11 11:14:15 2025 -0700

    rtc: ds1307: remove clear of oscillator stop flag (OSF) in probe
    
    [ Upstream commit 48458654659c9c2e149c211d86637f1592470da5 ]
    
    In using CONFIG_RTC_HCTOSYS, rtc_hctosys() will sync the RTC time to the
    kernel time as long as rtc_read_time() succeeds. In some power loss
    situations, our supercapacitor-backed DS1342 RTC comes up with either an
    unpredictable future time or the default 01/01/00 from the datasheet.
    The oscillator stop flag (OSF) is set in these scenarios due to the
    power loss and can be used to determine the validity of the RTC data.
    
    Some chip types in the ds1307 driver already have OSF handling to
    determine whether .read_time provides valid RTC data or returns -EINVAL.
    
    This change removes the clear of the OSF in .probe as the OSF needs to
    be preserved to expand the OSF handling to the ds1341 chip type (note
    that DS1341 and DS1342 share a datasheet).
    
    Signed-off-by: Meagan Lloyd <meaganlloyd@linux.microsoft.com>
    Reviewed-by: Tyler Hicks <code@tyhicks.com>
    Acked-by: Rodolfo Giometti <giometti@enneenne.com>
    Link: https://lore.kernel.org/r/1749665656-30108-2-git-send-email-meaganlloyd@linux.microsoft.com
    Signed-off-by: Alexandre Belloni <alexandre.belloni@bootlin.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

rust: kbuild: clean output before running `rustdoc` [+ + +]

Author: Miguel Ojeda <ojeda@kernel.org>
Date:   Sat Jul 26 15:34:35 2025 +0200

    rust: kbuild: clean output before running `rustdoc`
    
    commit 252fea131e15aba2cd487119d1a8f546471199e2 upstream.
    
    `rustdoc` can get confused when generating documentation into a folder
    that contains generated files from other `rustdoc` versions.
    
    For instance, running something like:
    
        rustup default 1.78.0
        make LLVM=1 rustdoc
        rustup default 1.88.0
        make LLVM=1 rustdoc
    
    may generate errors like:
    
        error: couldn't generate documentation: invalid template: last line expected to start with a comment
          |
          = note: failed to create or modify "./Documentation/output/rust/rustdoc/src-files.js"
    
    Thus just always clean the output folder before generating the
    documentation -- we are anyway regenerating it every time the `rustdoc`
    target gets called, at least for the time being.
    
    Cc: stable@vger.kernel.org # Needed in 6.12.y and later (Rust is pinned in older LTSs).
    Reported-by: Daniel Almeida <daniel.almeida@collabora.com>
    Closes: https://rust-for-linux.zulipchat.com/#narrow/channel/288089/topic/x/near/527201113
    Reviewed-by: Tamir Duberstein <tamird@kernel.org>
    Link: https://lore.kernel.org/r/20250726133435.2460085-1-ojeda@kernel.org
    Signed-off-by: Miguel Ojeda <ojeda@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

rust: workaround `rustdoc` target modifiers bug [+ + +]

Author: Miguel Ojeda <ojeda@kernel.org>
Date:   Sun Jul 27 11:23:17 2025 +0200

    rust: workaround `rustdoc` target modifiers bug
    
    commit abbf9a44944171ca99c150adad9361a2f517d3b6 upstream.
    
    Starting with Rust 1.88.0 (released 2025-06-26), `rustdoc` complains
    about a target modifier mismatch in configurations where `-Zfixed-x18`
    is passed:
    
        error: mixing `-Zfixed-x18` will cause an ABI mismatch in crate `rust_out`
          |
          = help: the `-Zfixed-x18` flag modifies the ABI so Rust crates compiled with different values of this flag cannot be used together safely
          = note: unset `-Zfixed-x18` in this crate is incompatible with `-Zfixed-x18=` in dependency `core`
          = help: set `-Zfixed-x18=` in this crate or unset `-Zfixed-x18` in `core`
          = help: if you are sure this will not cause problems, you may use `-Cunsafe-allow-abi-mismatch=fixed-x18` to silence this error
    
    The reason is that `rustdoc` was not passing the target modifiers when
    configuring the session options, and thus it would report a mismatch
    that did not exist as soon as a target modifier is used in a dependency.
    
    We did not notice it in the kernel until now because `-Zfixed-x18` has
    been a target modifier only since 1.88.0 (and it is the only one we use
    so far).
    
    The issue has been reported upstream [1] and a fix has been submitted
    [2], including a test similar to the kernel case.
    
      [ This is now fixed upstream (thanks Guillaume for the quick review),
        so it will be fixed in Rust 1.90.0 (expected 2025-09-18).
    
          - Miguel ]
    
    Meanwhile, conditionally pass `-Cunsafe-allow-abi-mismatch=fixed-x18`
    to workaround the issue on our side.
    
    Cc: stable@vger.kernel.org # Needed in 6.12.y and later (Rust is pinned in older LTSs).
    Reported-by: Konrad Dybcio <konrad.dybcio@oss.qualcomm.com>
    Closes: https://lore.kernel.org/rust-for-linux/36cdc798-524f-4910-8b77-d7b9fac08d77@oss.qualcomm.com/
    Link: https://github.com/rust-lang/rust/issues/144521 [1]
    Link: https://github.com/rust-lang/rust/pull/144523 [2]
    Reviewed-by: Alice Ryhl <aliceryhl@google.com>
    Link: https://lore.kernel.org/r/20250727092317.2930617-1-ojeda@kernel.org
    Signed-off-by: Miguel Ojeda <ojeda@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

rv: Add #undef TRACE_INCLUDE_FILE [+ + +]

Author: Nam Cao <namcao@linutronix.de>
Date:   Wed Jul 9 21:21:12 2025 +0200

    rv: Add #undef TRACE_INCLUDE_FILE
    
    [ Upstream commit 2d088762631b212eb0809e112642843844ef64eb ]
    
    Without "#undef TRACE_INCLUDE_FILE", there could be a build error due to
    TRACE_INCLUDE_FILE being redefined. Therefore add it.
    
    Also fix a typo while at it.
    
    Cc: John Ogness <john.ogness@linutronix.de>
    Cc: Masami Hiramatsu <mhiramat@kernel.org>
    Cc: Mathieu Desnoyers <mathieu.desnoyers@efficios.com>
    Link: https://lore.kernel.org/f805e074581e927bb176c742c981fa7675b6ebe5.1752088709.git.namcao@linutronix.de
    Reviewed-by: Gabriele Monaco <gmonaco@redhat.com>
    Signed-off-by: Nam Cao <namcao@linutronix.de>
    Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

s390/early: Copy last breaking event address to pt_regs [+ + +]

Author: Heiko Carstens <hca@linux.ibm.com>
Date:   Tue Jul 1 18:12:06 2025 +0200

    s390/early: Copy last breaking event address to pt_regs
    
    [ Upstream commit 7cf636c99b257c1b4b12066ab34fd5f06e8d892f ]
    
    In case of an early crash the early program check handler also prints the
    last breaking event address which is contained within the pt_regs
    structure. However it is not initialized, and therefore a more or less
    random value is printed in case of a crash.
    
    Copy the last breaking event address from lowcore to pt_regs in case of an
    early program check to address this. This also makes it easier to analyze
    early crashes.
    
    Reviewed-by: Alexander Gordeev <agordeev@linux.ibm.com>
    Signed-off-by: Heiko Carstens <hca@linux.ibm.com>
    Signed-off-by: Alexander Gordeev <agordeev@linux.ibm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

s390/sclp: Use monotonic clock in sclp_sync_wait() [+ + +]

Author: Sven Schnelle <svens@linux.ibm.com>
Date:   Thu Jul 10 09:43:40 2025 +0200

    s390/sclp: Use monotonic clock in sclp_sync_wait()
    
    [ Upstream commit 925f0707a67cae0a974c4bd5b718f0263dc56824 ]
    
    sclp_sync_wait() should use the monotonic clock for the delay loop.
    Otherwise the code won't work correctly when the clock is changed.
    
    Signed-off-by: Sven Schnelle <svens@linux.ibm.com>
    Reviewed-by: Heiko Carstens <hca@linux.ibm.com>
    Signed-off-by: Alexander Gordeev <agordeev@linux.ibm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

s390/stp: Remove udelay from stp_sync_clock() [+ + +]

Author: Sven Schnelle <svens@linux.ibm.com>
Date:   Thu Jul 3 13:50:27 2025 +0200

    s390/stp: Remove udelay from stp_sync_clock()
    
    [ Upstream commit b367017cdac21781a74eff4e208d3d38e1f38d3f ]
    
    When an stp sync check is handled on a system with multiple
    cpus each cpu gets a machine check but only the first one
    actually handles the sync operation. All other CPUs spin
    waiting for the first one to finish with a short udelay().
    But udelay can't be used here as the first CPU modifies tod_clock_base
    before performing the sync op. During this timeframe
    get_tod_clock_monotonic() might return a non-monotonic time.
    
    The time spent waiting should be very short and udelay is a busy loop
    anyways, therefore simply remove the udelay.
    
    Reviewed-by: Heiko Carstens <hca@linux.ibm.com>
    Signed-off-by: Sven Schnelle <svens@linux.ibm.com>
    Signed-off-by: Alexander Gordeev <agordeev@linux.ibm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

s390/time: Use monotonic clock in get_cycles() [+ + +]

Author: Sven Schnelle <svens@linux.ibm.com>
Date:   Thu Jul 10 09:42:29 2025 +0200

    s390/time: Use monotonic clock in get_cycles()
    
    [ Upstream commit 09e7e29d2b49ba84bcefb3dc1657726d2de5bb24 ]
    
    Otherwise the code might not work correctly when the clock
    is changed.
    
    Signed-off-by: Sven Schnelle <svens@linux.ibm.com>
    Reviewed-by: Heiko Carstens <hca@linux.ibm.com>
    Signed-off-by: Alexander Gordeev <agordeev@linux.ibm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

samples/damon/mtier: support boot time enable setup [+ + +]

Author: SeongJae Park <sj@kernel.org>
Date:   Sun Jul 6 12:32:04 2025 -0700

    samples/damon/mtier: support boot time enable setup
    
    commit 964314344eab7bc43e38a32be281c5ea0609773b upstream.
    
    If 'enable' parameter of the 'mtier' DAMON sample module is set at boot
    time via the kernel command line, memory allocation is tried before the
    slab is initialized.  As a result kernel NULL pointer dereference BUG can
    happen.  Fix it by checking the initialization status.
    
    Link: https://lkml.kernel.org/r/20250706193207.39810-4-sj@kernel.org
    Fixes: 82a08bde3cf7 ("samples/damon: implement a DAMON module for memory tiering")
    Signed-off-by: SeongJae Park <sj@kernel.org>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

samples/damon/wsse: fix boot time enable handling [+ + +]

Author: SeongJae Park <sj@kernel.org>
Date:   Sun Jul 6 12:32:02 2025 -0700

    samples/damon/wsse: fix boot time enable handling
    
    commit 0ed1165c37277822b519f519d0982d36efc30006 upstream.
    
    Patch series "mm/damon: fix misc bugs in DAMON modules".
    
    From manual code review, I found below bugs in DAMON modules.
    
    DAMON sample modules crash if those are enabled at boot time, via kernel
    command line.  A similar issue was found and fixed on DAMON non-sample
    modules in the past, but we didn't check that for sample modules.
    
    DAMON non-sample modules are not setting 'enabled' parameters accordingly
    when real enabling is failed.  Honggyu found and fixed[1] this type of
    bugs in DAMON sample modules, and my inspection was motivated by the great
    work.  Kudos to Honggyu.
    
    Finally, DAMON_RECLIAM is mistakenly losing scheme internal status due to
    misuse of damon_commit_ctx().  DAMON_LRU_SORT has a similar misuse, but
    fortunately it is not causing real status loss.
    
    Fix the bugs.  Since these are similar patterns of bugs that were found in
    the past, it would be better to add tests or refactor the code, in future.
    
    
    This patch (of 6):
    
    If 'enable' parameter of the 'wsse' DAMON sample module is set at boot
    time via the kernel command line, memory allocation is tried before the
    slab is initialized.  As a result kernel NULL pointer dereference BUG can
    happen.  Fix it by checking the initialization status.
    
    Link: https://lkml.kernel.org/r/20250706193207.39810-1-sj@kernel.org
    Link: https://lkml.kernel.org/r/20250706193207.39810-2-sj@kernel.org
    Link: https://lore.kernel.org/20250702000205.1921-1-honggyu.kim@sk.com [1]
    Fixes: b757c6cfc696 ("samples/damon/wsse: start and stop DAMON as the user requests")
    Signed-off-by: SeongJae Park <sj@kernel.org>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

sched/deadline: Fix accounting after global limits change [+ + +]

Author: Juri Lelli <juri.lelli@redhat.com>
Date:   Fri Jun 27 13:51:16 2025 +0200

    sched/deadline: Fix accounting after global limits change
    
    [ Upstream commit 440989c10f4e32620e9e2717ca52c3ed7ae11048 ]
    
    A global limits change (sched_rt_handler() logic) currently leaves stale
    and/or incorrect values in variables related to accounting (e.g.
    extra_bw).
    
    Properly clean up per runqueue variables before implementing the change
    and rebuild scheduling domains (so that accounting is also properly
    restored) after such a change is complete.
    
    Reported-by: Marcel Ziswiler <marcel.ziswiler@codethink.co.uk>
    Signed-off-by: Juri Lelli <juri.lelli@redhat.com>
    Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org>
    Tested-by: Marcel Ziswiler <marcel.ziswiler@codethink.co.uk> # nuc & rock5b
    Link: https://lore.kernel.org/r/20250627115118.438797-4-juri.lelli@redhat.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

sched/fair: Bump sd->max_newidle_lb_cost when newidle balance fails [+ + +]

Author: Chris Mason <clm@fb.com>
Date:   Thu Jun 26 07:39:10 2025 -0700

    sched/fair: Bump sd->max_newidle_lb_cost when newidle balance fails
    
    [ Upstream commit 155213a2aed42c85361bf4f5c817f5cb68951c3b ]
    
    schbench (https://github.com/masoncl/schbench.git) is showing a
    regression from previous production kernels that bisected down to:
    
    sched/fair: Remove sysctl_sched_migration_cost condition (c5b0a7eefc)
    
    The schbench command line was:
    
    schbench -L -m 4 -M auto -t 256 -n 0 -r 0 -s 0
    
    This creates 4 message threads pinned to CPUs 0-3, and 256x4 worker
    threads spread across the rest of the CPUs.  Neither the worker threads
    or the message threads do any work, they just wake each other up and go
    back to sleep as soon as possible.
    
    The end result is the first 4 CPUs are pegged waking up those 1024
    workers, and the rest of the CPUs are constantly banging in and out of
    idle.  If I take a v6.9 Linus kernel and revert that one commit,
    performance goes from 3.4M RPS to 5.4M RPS.
    
    schedstat shows there are ~100x  more new idle balance operations, and
    profiling shows the worker threads are spending ~20% of their CPU time
    on new idle balance.  schedstats also shows that almost all of these new
    idle balance attemps are failing to find busy groups.
    
    The fix used here is to crank up the cost of the newidle balance whenever it
    fails.  Since we don't want sd->max_newidle_lb_cost to grow out of
    control, this also changes update_newidle_cost() to use
    sysctl_sched_migration_cost as the upper limit on max_newidle_lb_cost.
    
    Signed-off-by: Chris Mason <clm@fb.com>
    Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org>
    Acked-by: Vincent Guittot <vincent.guittot@linaro.org>
    Link: https://lkml.kernel.org/r/20250626144017.1510594-2-clm@fb.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

scsi: aacraid: Stop using PCI_IRQ_AFFINITY [+ + +]

Author: John Garry <john.g.garry@oracle.com>
Date:   Tue Jul 15 11:15:35 2025 +0000

    scsi: aacraid: Stop using PCI_IRQ_AFFINITY
    
    [ Upstream commit dafeaf2c03e71255438ffe5a341d94d180e6c88e ]
    
    When PCI_IRQ_AFFINITY is set for calling pci_alloc_irq_vectors(), it
    means interrupts are spread around the available CPUs. It also means that
    the interrupts become managed, which means that an interrupt is shutdown
    when all the CPUs in the interrupt affinity mask go offline.
    
    Using managed interrupts in this way means that we should ensure that
    completions should not occur on HW queues where the associated interrupt
    is shutdown. This is typically achieved by ensuring only CPUs which are
    online can generate IO completion traffic to the HW queue which they are
    mapped to (so that they can also serve completion interrupts for that HW
    queue).
    
    The problem in the driver is that a CPU can generate completions to a HW
    queue whose interrupt may be shutdown, as the CPUs in the HW queue
    interrupt affinity mask may be offline. This can cause IOs to never
    complete and hang the system. The driver maintains its own CPU <-> HW
    queue mapping for submissions, see aac_fib_vector_assign(), but this does
    not reflect the CPU <-> HW queue interrupt affinity mapping.
    
    Commit 9dc704dcc09e ("scsi: aacraid: Reply queue mapping to CPUs based on
    IRQ affinity") tried to remedy this issue may mapping CPUs properly to HW
    queue interrupts. However this was later reverted in commit c5becf57dd56
    ("Revert "scsi: aacraid: Reply queue mapping to CPUs based on IRQ
    affinity") - it seems that there were other reports of hangs. I guess
    that this was due to some implementation issue in the original commit or
    maybe a HW issue.
    
    Fix the very original hang by just not using managed interrupts by not
    setting PCI_IRQ_AFFINITY.  In this way, all CPUs will be in each HW queue
    affinity mask, so should not create completion problems if any CPUs go
    offline.
    
    Signed-off-by: John Garry <john.g.garry@oracle.com>
    Link: https://lore.kernel.org/r/20250715111535.499853-1-john.g.garry@oracle.com
    Closes: https://lore.kernel.org/linux-scsi/20250618192427.3845724-1-jmeneghi@redhat.com/
    Reviewed-by: John Meneghini <jmeneghi@redhat.com>
    Tested-by: John Meneghini <jmeneghi@redhat.com>
    Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

scsi: bfa: Double-free fix [+ + +]

Author: jackysliu <1972843537@qq.com>
Date:   Tue Jun 24 19:58:24 2025 +0800

    scsi: bfa: Double-free fix
    
    [ Upstream commit add4c4850363d7c1b72e8fce9ccb21fdd2cf5dc9 ]
    
    When the bfad_im_probe() function fails during initialization, the memory
    pointed to by bfad->im is freed without setting bfad->im to NULL.
    
    Subsequently, during driver uninstallation, when the state machine enters
    the bfad_sm_stopping state and calls the bfad_im_probe_undo() function,
    it attempts to free the memory pointed to by bfad->im again, thereby
    triggering a double-free vulnerability.
    
    Set bfad->im to NULL if probing fails.
    
    Signed-off-by: jackysliu <1972843537@qq.com>
    Link: https://lore.kernel.org/r/tencent_3BB950D6D2D470976F55FC879206DE0B9A09@qq.com
    Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

scsi: Fix sas_user_scan() to handle wildcard and multi-channel scans [+ + +]

Author: Ranjan Kumar <ranjan.kumar@broadcom.com>
Date:   Tue Jun 24 11:46:49 2025 +0530

    scsi: Fix sas_user_scan() to handle wildcard and multi-channel scans
    
    [ Upstream commit 37c4e72b0651e7697eb338cd1fb09feef472cc1a ]
    
    sas_user_scan() did not fully process wildcard channel scans
    (SCAN_WILD_CARD) when a transport-specific user_scan() callback was
    present. Only channel 0 would be scanned via user_scan(), while the
    remaining channels were skipped, potentially missing devices.
    
    user_scan() invokes updated sas_user_scan() for channel 0, and if
    successful, iteratively scans remaining channels (1 to
    shost->max_channel) via scsi_scan_host_selected().  This ensures complete
    wildcard scanning without affecting transport-specific scanning behavior.
    
    Signed-off-by: Ranjan Kumar <ranjan.kumar@broadcom.com>
    Link: https://lore.kernel.org/r/20250624061649.17990-1-ranjan.kumar@broadcom.com
    Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

scsi: libiscsi: Initialize iscsi_conn->dd_data only if memory is allocated [+ + +]

Author: Showrya M N <showrya@chelsio.com>
Date:   Fri Jun 27 16:53:29 2025 +0530

    scsi: libiscsi: Initialize iscsi_conn->dd_data only if memory is allocated
    
    [ Upstream commit 3ea3a256ed81f95ab0f3281a0e234b01a9cae605 ]
    
    In case of an ib_fast_reg_mr allocation failure during iSER setup, the
    machine hits a panic because iscsi_conn->dd_data is initialized
    unconditionally, even when no memory is allocated (dd_size == 0).  This
    leads invalid pointer dereference during connection teardown.
    
    Fix by setting iscsi_conn->dd_data only if memory is actually allocated.
    
    Panic trace:
    ------------
     iser: iser_create_fastreg_desc: Failed to allocate ib_fast_reg_mr err=-12
     iser: iser_alloc_rx_descriptors: failed allocating rx descriptors / data buffers
     BUG: unable to handle page fault for address: fffffffffffffff8
     RIP: 0010:swake_up_locked.part.5+0xa/0x40
     Call Trace:
      complete+0x31/0x40
      iscsi_iser_conn_stop+0x88/0xb0 [ib_iser]
      iscsi_stop_conn+0x66/0xc0 [scsi_transport_iscsi]
      iscsi_if_stop_conn+0x14a/0x150 [scsi_transport_iscsi]
      iscsi_if_rx+0x1135/0x1834 [scsi_transport_iscsi]
      ? netlink_lookup+0x12f/0x1b0
      ? netlink_deliver_tap+0x2c/0x200
      netlink_unicast+0x1ab/0x280
      netlink_sendmsg+0x257/0x4f0
      ? _copy_from_user+0x29/0x60
      sock_sendmsg+0x5f/0x70
    
    Signed-off-by: Showrya M N <showrya@chelsio.com>
    Signed-off-by: Potnuri Bharat Teja <bharat@chelsio.com>
    Link: https://lore.kernel.org/r/20250627112329.19763-1-showrya@chelsio.com
    Reviewed-by: Chris Leech <cleech@redhat.com>
    Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

scsi: lpfc: Check for hdwq null ptr when cleaning up lpfc_vport structure [+ + +]

Author: Justin Tee <justin.tee@broadcom.com>
Date:   Wed Jun 18 12:21:28 2025 -0700

    scsi: lpfc: Check for hdwq null ptr when cleaning up lpfc_vport structure
    
    [ Upstream commit 6698796282e828733cde3329c887b4ae9e5545e9 ]
    
    If a call to lpfc_sli4_read_rev() from lpfc_sli4_hba_setup() fails, the
    resultant cleanup routine lpfc_sli4_vport_delete_fcp_xri_aborted() may
    occur before sli4_hba.hdwqs are allocated.  This may result in a null
    pointer dereference when attempting to take the abts_io_buf_list_lock for
    the first hardware queue.  Fix by adding a null ptr check on
    phba->sli4_hba.hdwq and early return because this situation means there
    must have been an error during port initialization.
    
    Signed-off-by: Justin Tee <justin.tee@broadcom.com>
    Link: https://lore.kernel.org/r/20250618192138.124116-4-justintee8345@gmail.com
    Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

scsi: lpfc: Ensure HBA_SETUP flag is used only for SLI4 in dev_loss_tmo_callbk [+ + +]

Author: Justin Tee <justin.tee@broadcom.com>
Date:   Wed Jun 18 12:21:33 2025 -0700

    scsi: lpfc: Ensure HBA_SETUP flag is used only for SLI4 in dev_loss_tmo_callbk
    
    [ Upstream commit 1cced5779e7a3ff7ec025fc47c76a7bd3bb38877 ]
    
    For SLI3, the HBA_SETUP flag is never set so the lpfc_dev_loss_tmo_callbk
    always early returns.  Add a phba->sli_rev check for SLI4 mode so that
    the SLI3 path can flow through the original dev_loss_tmo worker thread
    design to lpfc_dev_loss_tmo_handler instead of early return.
    
    Signed-off-by: Justin Tee <justin.tee@broadcom.com>
    Link: https://lore.kernel.org/r/20250618192138.124116-9-justintee8345@gmail.com
    Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

scsi: lpfc: Remove redundant assignment to avoid memory leak [+ + +]

Author: Jiasheng Jiang <jiashengjiangcool@gmail.com>
Date:   Fri Aug 1 18:52:02 2025 +0000

    scsi: lpfc: Remove redundant assignment to avoid memory leak
    
    [ Upstream commit eea6cafb5890db488fce1c69d05464214616d800 ]
    
    Remove the redundant assignment if kzalloc() succeeds to avoid memory
    leak.
    
    Fixes: bd2cdd5e400f ("scsi: lpfc: NVME Initiator: Add debugfs support")
    Signed-off-by: Jiasheng Jiang <jiashengjiangcool@gmail.com>
    Link: https://lore.kernel.org/r/20250801185202.42631-1-jiashengjiangcool@gmail.com
    Reviewed-by: Justin Tee <justin.tee@broadcom.com>
    Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

scsi: mpi3mr: Correctly handle ATA device errors [+ + +]

Author: Damien Le Moal <dlemoal@kernel.org>
Date:   Fri Jun 6 14:27:46 2025 +0900

    scsi: mpi3mr: Correctly handle ATA device errors
    
    [ Upstream commit 04caad5a7ba86e830d04750417a15bad8ac2613c ]
    
    With the ATA error model, an NCQ command failure always triggers an abort
    (termination) of all NCQ commands queued on the device. In such case, the
    SAT or the host must handle the failed command according to the command
    sense data and immediately retry all other NCQ commands that were aborted
    due to the failed NCQ command.
    
    For SAS HBAs controlled by the mpi3mr driver, NCQ command aborts are not
    handled by the HBA SAT and sent back to the host, with an ioc log
    information equal to 0x31080000 (IOC_LOGINFO_PREFIX_PL with the PL code
    PL_LOGINFO_CODE_SATA_NCQ_FAIL_ALL_CMDS_AFTR_ERR). The function
    mpi3mr_process_op_reply_desc() always forces a retry of commands
    terminated with the status MPI3_IOCSTATUS_SCSI_IOC_TERMINATED using the
    SCSI result DID_SOFT_ERROR, regardless of the ioc_loginfo for the
    command. This correctly forces the retry of collateral NCQ abort
    commands, but with the retry counter for the command being incremented.
    If a command to an ATA device is subject to too many retries due to other
    NCQ commands failing (e.g. read commands trying to access unreadable
    sectors), the collateral NCQ abort commands may be terminated with an
    error as they run out of retries. This violates the SAT specification and
    causes hard-to-debug command errors.
    
    Solve this issue by modifying the handling of the
    MPI3_IOCSTATUS_SCSI_IOC_TERMINATED status to check if a command is for an
    ATA device and if the command ioc_loginfo indicates an NCQ collateral
    abort. If that is the case, force the command retry using the SCSI result
    DID_IMM_RETRY to avoid incrementing the command retry count.
    
    Signed-off-by: Damien Le Moal <dlemoal@kernel.org>
    Link: https://lore.kernel.org/r/20250606052747.742998-2-dlemoal@kernel.org
    Tested-by: Yafang Shao <laoar.shao@gmail.com>
    Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

scsi: mpt3sas: Correctly handle ATA device errors [+ + +]

Author: Damien Le Moal <dlemoal@kernel.org>
Date:   Fri Jun 6 14:27:47 2025 +0900

    scsi: mpt3sas: Correctly handle ATA device errors
    
    [ Upstream commit 15592a11d5a5c8411ac8494ec49736b658f6fbff ]
    
    With the ATA error model, an NCQ command failure always triggers an abort
    (termination) of all NCQ commands queued on the device. In such case, the
    SAT or the host must handle the failed command according to the command
    sense data and immediately retry all other NCQ commands that were aborted
    due to the failed NCQ command.
    
    For SAS HBAs controlled by the mpt3sas driver, NCQ command aborts are not
    handled by the HBA SAT and sent back to the host, with an ioc log
    information equal to 0x31080000 (IOC_LOGINFO_PREFIX_PL with the PL code
    PL_LOGINFO_CODE_SATA_NCQ_FAIL_ALL_CMDS_AFTR_ERR). The function
    _scsih_io_done() always forces a retry of commands terminated with the
    status MPI2_IOCSTATUS_SCSI_IOC_TERMINATED using the SCSI result
    DID_SOFT_ERROR, regardless of the log_info for the command.  This
    correctly forces the retry of collateral NCQ abort commands, but with the
    retry counter for the command being incremented. If a command to an ATA
    device is subject to too many retries due to other NCQ commands failing
    (e.g. read commands trying to access unreadable sectors), the collateral
    NCQ abort commands may be terminated with an error as they run out of
    retries. This violates the SAT specification and causes hard-to-debug
    command errors.
    
    Solve this issue by modifying the handling of the
    MPI2_IOCSTATUS_SCSI_IOC_TERMINATED status to check if a command is for an
    ATA device and if the command loginfo indicates an NCQ collateral
    abort. If that is the case, force the command retry using the SCSI result
    DID_IMM_RETRY to avoid incrementing the command retry count.
    
    Signed-off-by: Damien Le Moal <dlemoal@kernel.org>
    Link: https://lore.kernel.org/r/20250606052747.742998-3-dlemoal@kernel.org
    Tested-by: Yafang Shao <laoar.shao@gmail.com>
    Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

scsi: pm80xx: Free allocated tags after failure [+ + +]

Author: Francisco Gutierrez <frankramirez@google.com>
Date:   Tue Jun 17 21:04:43 2025 +0000

    scsi: pm80xx: Free allocated tags after failure
    
    [ Upstream commit 258a0a19621793b811356fc9d1849f950629d669 ]
    
    This change frees resources after an error is detected.
    
    Signed-off-by: Francisco Gutierrez <frankramirez@google.com>
    Link: https://lore.kernel.org/r/20250617210443.989058-1-frankramirez@google.com
    Acked-by: Jack Wang <jinpu.wang@ionos.com>
    Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

scsi: target: core: Generate correct identifiers for PR OUT transport IDs [+ + +]

Author: Maurizio Lombardi <mlombard@redhat.com>
Date:   Mon Jul 14 15:37:38 2025 +0200

    scsi: target: core: Generate correct identifiers for PR OUT transport IDs
    
    [ Upstream commit 6e0f6aa44b68335df404a2df955055f416b5f2aa ]
    
    Fix target_parse_pr_out_transport_id() to return a string representing
    the transport ID in a human-readable format (e.g., naa.xxxxxxxx...)  for
    various SCSI protocol types (SAS, FCP, SRP, SBP).
    
    Previously, the function returned a pointer to the raw binary buffer,
    which was incorrectly compared against human-readable strings, causing
    comparisons to fail.  Now, the function writes a properly formatted
    string into a buffer provided by the caller.  The output format depends
    on the transport protocol:
    
    * SAS: 64-bit identifier, "naa." prefix.
    * FCP: 64-bit identifier, colon separated values.
    * SBP: 64-bit identifier, no prefix.
    * SRP: 128-bit identifier, "0x" prefix.
    * iSCSI: IQN string.
    
    Signed-off-by: Maurizio Lombardi <mlombard@redhat.com>
    Link: https://lore.kernel.org/r/20250714133738.11054-1-mlombard@redhat.com
    Reviewed-by: Dmitry Bogdanov <d.bogdanov@yadro.com>
    Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

scsi: ufs: core: Fix interrupt handling for MCQ Mode [+ + +]

Author: Nitin Rawat <quic_nitirawa@quicinc.com>
Date:   Tue Jul 29 04:27:11 2025 +0530

    scsi: ufs: core: Fix interrupt handling for MCQ Mode
    
    [ Upstream commit 034d319c8899e8c5c0a35c6692c7fc7e8c12c374 ]
    
    Commit 3c7ac40d7322 ("scsi: ufs: core: Delegate the interrupt service
    routine to a threaded IRQ handler") introduced a regression where the UFS
    interrupt status register (IS) was not cleared in ufshcd_intr() when
    operating in MCQ mode. As a result, the IS register remained uncleared.
    
    This led to a persistent issue during UIC interrupts:
    ufshcd_is_auto_hibern8_error() consistently returned true because the
    UFSHCD_UIC_HIBERN8_MASK bit was set, while the active command was neither
    UIC_CMD_DME_HIBER_ENTER nor UIC_CMD_DME_HIBER_EXIT. This caused
    continuous auto hibern8 enter errors and device failed to boot.
    
    To fix this, ensure that the interrupt status register is properly
    cleared in the ufshcd_intr() function for both MCQ mode with ESI enabled.
    
    [    4.553226] ufshcd-qcom 1d84000.ufs: ufshcd_check_errors: Auto
    Hibern8 Enter failed - status: 0x00000040, upmcrs: 0x00000001
    [    4.553229] ufshcd-qcom 1d84000.ufs: ufshcd_check_errors: saved_err
    0x40 saved_uic_err 0x0
    [    4.553311] host_regs: 00000000: d5c7033f 20e0071f 00000400 00000000
    [    4.553312] host_regs: 00000010: 01000000 00010217 00000c96 00000000
    [    4.553314] host_regs: 00000020: 00000440 00170ef5 00000000 00000000
    [    4.553316] host_regs: 00000030: 0000010f 00000001 00000000 00000000
    [    4.553317] host_regs: 00000040: 00000000 00000000 00000000 00000000
    [    4.553319] host_regs: 00000050: fffdf000 0000000f 00000000 00000000
    [    4.553320] host_regs: 00000060: 00000001 80000000 00000000 00000000
    [    4.553322] host_regs: 00000070: fffde000 0000000f 00000000 00000000
    [    4.553323] host_regs: 00000080: 00000001 00000000 00000000 00000000
    [    4.553325] host_regs: 00000090: 00000002 d0020000 00000000 01930200
    
    Fixes: 3c7ac40d7322 ("scsi: ufs: core: Delegate the interrupt service routine to a threaded IRQ handler")
    Co-developed-by: Palash Kambar <quic_pkambar@quicinc.com>
    Signed-off-by: Palash Kambar <quic_pkambar@quicinc.com>
    Signed-off-by: Nitin Rawat <quic_nitirawa@quicinc.com>
    Link: https://lore.kernel.org/r/20250728225711.29273-1-quic_nitirawa@quicinc.com
    Tested-by: Neil Armstrong <neil.armstrong@linaro.org> # on SM8650-QRD
    Reviewed-by: Bart Van Assche <bvanassche@acm.org>
    Reviewed-by: Peter Wang <peter.wang@mediatek.com>
    Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

sctp: linearize cloned gso packets in sctp_rcv [+ + +]

Author: Xin Long <lucien.xin@gmail.com>
Date:   Thu Aug 7 15:40:11 2025 -0400

    sctp: linearize cloned gso packets in sctp_rcv
    
    [ Upstream commit fd60d8a086191fe33c2d719732d2482052fa6805 ]
    
    A cloned head skb still shares these frag skbs in fraglist with the
    original head skb. It's not safe to access these frag skbs.
    
    syzbot reported two use-of-uninitialized-memory bugs caused by this:
    
      BUG: KMSAN: uninit-value in sctp_inq_pop+0x15b7/0x1920 net/sctp/inqueue.c:211
       sctp_inq_pop+0x15b7/0x1920 net/sctp/inqueue.c:211
       sctp_assoc_bh_rcv+0x1a7/0xc50 net/sctp/associola.c:998
       sctp_inq_push+0x2ef/0x380 net/sctp/inqueue.c:88
       sctp_backlog_rcv+0x397/0xdb0 net/sctp/input.c:331
       sk_backlog_rcv+0x13b/0x420 include/net/sock.h:1122
       __release_sock+0x1da/0x330 net/core/sock.c:3106
       release_sock+0x6b/0x250 net/core/sock.c:3660
       sctp_wait_for_connect+0x487/0x820 net/sctp/socket.c:9360
       sctp_sendmsg_to_asoc+0x1ec1/0x1f00 net/sctp/socket.c:1885
       sctp_sendmsg+0x32b9/0x4a80 net/sctp/socket.c:2031
       inet_sendmsg+0x25a/0x280 net/ipv4/af_inet.c:851
       sock_sendmsg_nosec net/socket.c:718 [inline]
    
    and
    
      BUG: KMSAN: uninit-value in sctp_assoc_bh_rcv+0x34e/0xbc0 net/sctp/associola.c:987
       sctp_assoc_bh_rcv+0x34e/0xbc0 net/sctp/associola.c:987
       sctp_inq_push+0x2a3/0x350 net/sctp/inqueue.c:88
       sctp_backlog_rcv+0x3c7/0xda0 net/sctp/input.c:331
       sk_backlog_rcv+0x142/0x420 include/net/sock.h:1148
       __release_sock+0x1d3/0x330 net/core/sock.c:3213
       release_sock+0x6b/0x270 net/core/sock.c:3767
       sctp_wait_for_connect+0x458/0x820 net/sctp/socket.c:9367
       sctp_sendmsg_to_asoc+0x223a/0x2260 net/sctp/socket.c:1886
       sctp_sendmsg+0x3910/0x49f0 net/sctp/socket.c:2032
       inet_sendmsg+0x269/0x2a0 net/ipv4/af_inet.c:851
       sock_sendmsg_nosec net/socket.c:712 [inline]
    
    This patch fixes it by linearizing cloned gso packets in sctp_rcv().
    
    Fixes: 90017accff61 ("sctp: Add GSO support")
    Reported-by: syzbot+773e51afe420baaf0e2b@syzkaller.appspotmail.com
    Reported-by: syzbot+70a42f45e76bede082be@syzkaller.appspotmail.com
    Signed-off-by: Xin Long <lucien.xin@gmail.com>
    Reviewed-by: Marcelo Ricardo Leitner <marcelo.leitner@gmail.com>
    Link: https://patch.msgid.link/dd7dc337b99876d4132d0961f776913719f7d225.1754595611.git.lucien.xin@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

securityfs: don't pin dentries twice, once is enough... [+ + +]

Author: Al Viro <viro@zeniv.linux.org.uk>
Date:   Thu May 8 23:38:01 2025 -0400

    securityfs: don't pin dentries twice, once is enough...
    
    [ Upstream commit 27cd1bf1240d482e4f02ca4f9812e748f3106e4f ]
    
    incidentally, securityfs_recursive_remove() is broken without that -
    it leaks dentries, since simple_recursive_removal() does not expect
    anything of that sort.  It could be worked around by dput() in
    remove_one() callback, but it's easier to just drop that double-get
    stuff.
    
    Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

selftests/bpf: Fix a user_ringbuf failure with arm64 64KB page size [+ + +]

Author: Yonghong Song <yonghong.song@linux.dev>
Date:   Fri Jun 6 18:36:26 2025 -0700

    selftests/bpf: Fix a user_ringbuf failure with arm64 64KB page size
    
    [ Upstream commit bbc7bd658ddc662083639b9e9a280b90225ecd9a ]
    
    The ringbuf max_entries must be PAGE_ALIGNED. See kernel function
    ringbuf_map_alloc(). So for arm64 64KB page size, adjust max_entries
    properly.
    
    Signed-off-by: Yonghong Song <yonghong.song@linux.dev>
    Link: https://lore.kernel.org/r/20250607013626.1553001-1-yonghong.song@linux.dev
    Signed-off-by: Alexei Starovoitov <ast@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

selftests/bpf: Fix ringbuf/ringbuf_write test failure with arm64 64KB page size [+ + +]

Author: Yonghong Song <yonghong.song@linux.dev>
Date:   Fri Jun 6 18:36:21 2025 -0700

    selftests/bpf: Fix ringbuf/ringbuf_write test failure with arm64 64KB page size
    
    [ Upstream commit 8c8c5e3c854a2593ec90dacd868f3066b67de1c4 ]
    
    The ringbuf max_entries must be PAGE_ALIGNED. See kernel function
    ringbuf_map_alloc(). So for arm64 64KB page size, adjust max_entries
    and other related metrics properly.
    
    Signed-off-by: Yonghong Song <yonghong.song@linux.dev>
    Link: https://lore.kernel.org/r/20250607013621.1552332-1-yonghong.song@linux.dev
    Signed-off-by: Alexei Starovoitov <ast@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

selftests/futex: Define SYS_futex on 32-bit architectures with 64-bit time_t [+ + +]

Author: Cynthia Huang <cynthia@andestech.com>
Date:   Thu Jul 10 18:36:30 2025 +0800

    selftests/futex: Define SYS_futex on 32-bit architectures with 64-bit time_t
    
    [ Upstream commit 04850819c65c8242072818655d4341e70ae998b5 ]
    
    The kernel does not provide sys_futex() on 32-bit architectures that do not
    support 32-bit time representations, such as riscv32.
    
    As a result, glibc cannot define SYS_futex, causing compilation failures in
    tests that rely on this syscall. Define SYS_futex as SYS_futex_time64 in
    such cases to ensure successful compilation and compatibility.
    
    Signed-off-by: Cynthia Huang <cynthia@andestech.com>
    Signed-off-by: Ben Zong-You Xie <ben717@andestech.com>
    Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
    Reviewed-by: Muhammad Usama Anjum <usama.anjum@collabora.com>
    Link: https://lore.kernel.org/all/20250710103630.3156130-1-ben717@andestech.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

selftests/kexec: fix test_kexec_jump build [+ + +]

Author: Moon Hee Lee <moonhee.lee.ca@gmail.com>
Date:   Wed Jul 2 10:17:05 2025 -0700

    selftests/kexec: fix test_kexec_jump build
    
    [ Upstream commit 661e9cd196598c7d2502260ebbe60970546cca35 ]
    
    The test_kexec_jump program builds correctly when invoked from the top-level
    selftests/Makefile, which explicitly sets the OUTPUT variable. However,
    building directly in tools/testing/selftests/kexec fails with:
    
      make: *** No rule to make target '/test_kexec_jump', needed by 'test_kexec_jump.sh'.  Stop.
    
    This failure occurs because the Makefile rule relies on $(OUTPUT), which is
    undefined in direct builds.
    
    Fix this by listing test_kexec_jump in TEST_GEN_PROGS, the standard way to
    declare generated test binaries in the kselftest framework. This ensures the
    binary is built regardless of invocation context and properly removed by
    make clean.
    
    Link: https://lore.kernel.org/r/20250702171704.22559-2-moonhee.lee.ca@gmail.com
    Acked-by: Shuah Khan <skhan@linuxfoundation.org>
    Signed-off-by: Moon Hee Lee <moonhee.lee.ca@gmail.com>
    Acked-by: Baoquan He <bhe@redhat.com>
    Acked-by: David Woodhouse <dwmw@amazon.co.uk>
    Signed-off-by: Shuah Khan <skhan@linuxfoundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

selftests: netfilter: Enable CONFIG_INET_SCTP_DIAG [+ + +]

Author: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
Date:   Mon Jun 30 17:44:25 2025 +0200

    selftests: netfilter: Enable CONFIG_INET_SCTP_DIAG
    
    [ Upstream commit ba71a6e58b38aa6f86865d4e18579cb014903692 ]
    
    The config snippet specifies CONFIG_SCTP_DIAG. This was never an option.
    
    Replace CONFIG_SCTP_DIAG with the intended CONFIG_INET_SCTP_DIAG.
    
    Signed-off-by: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

selftests: tracing: Use mutex_unlock for testing glob filter [+ + +]

Author: Masami Hiramatsu (Google) <mhiramat@kernel.org>
Date:   Thu Jul 3 13:26:43 2025 +0900

    selftests: tracing: Use mutex_unlock for testing glob filter
    
    [ Upstream commit a089bb2822a49b0c5777a8936f82c1f8629231fb ]
    
    Since commit c5b6ababd21a ("locking/mutex: implement
    mutex_trylock_nested") makes mutex_trylock() as an inlined
    function if CONFIG_DEBUG_LOCK_ALLOC=y, we can not use
    mutex_trylock() for testing the glob filter of ftrace.
    
    Use mutex_unlock instead.
    
    Link: https://lore.kernel.org/r/175151680309.2149615.9795104805153538717.stgit@mhiramat.tok.corp.google.com
    Signed-off-by: Masami Hiramatsu (Google) <mhiramat@kernel.org>
    Acked-by: Steven Rostedt (Google) <rostedt@goodmis.org>
    Signed-off-by: Shuah Khan <skhan@linuxfoundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

selftests: vDSO: vdso_test_getrandom: Always print TAP header [+ + +]

Author: Thomas Weißschuh <thomas.weissschuh@linutronix.de>
Date:   Wed Jun 11 12:33:58 2025 +0200

    selftests: vDSO: vdso_test_getrandom: Always print TAP header
    
    [ Upstream commit 1158220b24674edaf885433153deb4f0e5c7d331 ]
    
    The TAP specification requires that the output begins with a header line.
    If vgetrandom_init() fails and skips the test, that header line is missing.
    
    Call vgetrandom_init() after ksft_print_header().
    
    Signed-off-by: Thomas Weißschuh <thomas.weissschuh@linutronix.de>
    Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
    Reviewed-by: Muhammad Usama Anjum <usama.anjum@collabora.com>
    Link: https://lore.kernel.org/all/20250611-selftests-vdso-fixes-v3-8-e62e37a6bcf5@linutronix.de
    Signed-off-by: Sasha Levin <sashal@kernel.org>

smb/server: avoid deadlock when linking with ReplaceIfExists [+ + +]

Author: NeilBrown <neil@brown.name>
Date:   Mon Jun 9 09:35:09 2025 +1000

    smb/server: avoid deadlock when linking with ReplaceIfExists
    
    [ Upstream commit d5fc1400a34b4ea5e8f2ce296ea12bf8c8421694 ]
    
    If smb2_create_link() is called with ReplaceIfExists set and the name
    does exist then a deadlock will happen.
    
    ksmbd_vfs_kern_path_locked() will return with success and the parent
    directory will be locked.  ksmbd_vfs_remove_file() will then remove the
    file.  ksmbd_vfs_link() will then be called while the parent is still
    locked.  It will try to lock the same parent and will deadlock.
    
    This patch moves the ksmbd_vfs_kern_path_unlock() call to *before*
    ksmbd_vfs_link() and then simplifies the code, removing the file_present
    flag variable.
    
    Signed-off-by: NeilBrown <neil@brown.name>
    Acked-by: Namjae Jeon <linkinjeon@kernel.org>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

smb3: fix for slab out of bounds on mount to ksmbd [+ + +]

Author: Steve French <stfrench@microsoft.com>
Date:   Mon Aug 11 23:14:55 2025 -0500

    smb3: fix for slab out of bounds on mount to ksmbd
    
    commit 7d34ec36abb84fdfb6632a0f2cbda90379ae21fc upstream.
    
    With KASAN enabled, it is possible to get a slab out of bounds
    during mount to ksmbd due to missing check in parse_server_interfaces()
    (see below):
    
     BUG: KASAN: slab-out-of-bounds in
     parse_server_interfaces+0x14ee/0x1880 [cifs]
     Read of size 4 at addr ffff8881433dba98 by task mount/9827
    
     CPU: 5 UID: 0 PID: 9827 Comm: mount Tainted: G
     OE       6.16.0-rc2-kasan #2 PREEMPT(voluntary)
     Tainted: [O]=OOT_MODULE, [E]=UNSIGNED_MODULE
     Hardware name: Dell Inc. Precision Tower 3620/0MWYPT,
     BIOS 2.13.1 06/14/2019
     Call Trace:
      <TASK>
     dump_stack_lvl+0x9f/0xf0
     print_report+0xd1/0x670
     __virt_addr_valid+0x22c/0x430
     ? parse_server_interfaces+0x14ee/0x1880 [cifs]
     ? kasan_complete_mode_report_info+0x2a/0x1f0
     ? parse_server_interfaces+0x14ee/0x1880 [cifs]
       kasan_report+0xd6/0x110
       parse_server_interfaces+0x14ee/0x1880 [cifs]
       __asan_report_load_n_noabort+0x13/0x20
       parse_server_interfaces+0x14ee/0x1880 [cifs]
     ? __pfx_parse_server_interfaces+0x10/0x10 [cifs]
     ? trace_hardirqs_on+0x51/0x60
     SMB3_request_interfaces+0x1ad/0x3f0 [cifs]
     ? __pfx_SMB3_request_interfaces+0x10/0x10 [cifs]
     ? SMB2_tcon+0x23c/0x15d0 [cifs]
     smb3_qfs_tcon+0x173/0x2b0 [cifs]
     ? __pfx_smb3_qfs_tcon+0x10/0x10 [cifs]
     ? cifs_get_tcon+0x105d/0x2120 [cifs]
     ? do_raw_spin_unlock+0x5d/0x200
     ? cifs_get_tcon+0x105d/0x2120 [cifs]
     ? __pfx_smb3_qfs_tcon+0x10/0x10 [cifs]
     cifs_mount_get_tcon+0x369/0xb90 [cifs]
     ? dfs_cache_find+0xe7/0x150 [cifs]
     dfs_mount_share+0x985/0x2970 [cifs]
     ? check_path.constprop.0+0x28/0x50
     ? save_trace+0x54/0x370
     ? __pfx_dfs_mount_share+0x10/0x10 [cifs]
     ? __lock_acquire+0xb82/0x2ba0
     ? __kasan_check_write+0x18/0x20
     cifs_mount+0xbc/0x9e0 [cifs]
     ? __pfx_cifs_mount+0x10/0x10 [cifs]
     ? do_raw_spin_unlock+0x5d/0x200
     ? cifs_setup_cifs_sb+0x29d/0x810 [cifs]
     cifs_smb3_do_mount+0x263/0x1990 [cifs]
    
    Reported-by: Namjae Jeon <linkinjeon@kernel.org>
    Tested-by: Namjae Jeon <linkinjeon@kernel.org>
    Cc: stable@vger.kernel.org
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

smb: client: don't call init_waitqueue_head(&info->conn_wait) twice in _smbd_get_connection [+ + +]

Author: Stefan Metzmacher <metze@samba.org>
Date:   Thu Aug 7 18:12:13 2025 +0200

    smb: client: don't call init_waitqueue_head(&info->conn_wait) twice in _smbd_get_connection
    
    [ Upstream commit 550a194c5998e4e77affc6235e80d3766dc2d27e ]
    
    It is already called long before we may hit this cleanup code path.
    
    Cc: Steve French <smfrench@gmail.com>
    Cc: Tom Talpey <tom@talpey.com>
    Cc: Long Li <longli@microsoft.com>
    Cc: linux-cifs@vger.kernel.org
    Cc: samba-technical@lists.samba.org
    Signed-off-by: Stefan Metzmacher <metze@samba.org>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

smb: client: don't wait for info->send_pending == 0 on error [+ + +]

Author: Stefan Metzmacher <metze@samba.org>
Date:   Tue Aug 12 18:45:06 2025 +0200

    smb: client: don't wait for info->send_pending == 0 on error
    
    commit 8c48e1c7520321cc87ff651e96093e2f412785fb upstream.
    
    We already called ib_drain_qp() before and that makes sure
    send_done() was called with IB_WC_WR_FLUSH_ERR, but
    didn't called atomic_dec_and_test(&sc->send_io.pending.count)
    
    So we may never reach the info->send_pending == 0 condition.
    
    Cc: Steve French <smfrench@gmail.com>
    Cc: Tom Talpey <tom@talpey.com>
    Cc: Long Li <longli@microsoft.com>
    Cc: linux-cifs@vger.kernel.org
    Cc: samba-technical@lists.samba.org
    Fixes: 5349ae5e05fa ("smb: client: let send_done() cleanup before calling smbd_disconnect_rdma_connection()")
    Signed-off-by: Stefan Metzmacher <metze@samba.org>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

smb: client: fix session setup against servers that require SPN [+ + +]

Author: Paulo Alcantara <pc@manguebit.org>
Date:   Fri Jul 25 00:04:44 2025 -0300

    smb: client: fix session setup against servers that require SPN
    
    [ Upstream commit 33cfdd726381828b9907a61c038a9f48b6690a31 ]
    
    Some servers might enforce the SPN to be set in the target info
    blob (AV pairs) when sending NTLMSSP_AUTH message.  In Windows Server,
    this could be enforced with SmbServerNameHardeningLevel set to 2.
    
    Fix this by always appending SPN (cifs/<hostname>) to the existing
    list of target infos when setting up NTLMv2 response blob.
    
    Cc: linux-cifs@vger.kernel.org
    Cc: David Howells <dhowells@redhat.com>
    Reported-by: Pierguido Lambri <plambri@redhat.com>
    Signed-off-by: Paulo Alcantara (Red Hat) <pc@manguebit.org>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

smb: client: let send_done() cleanup before calling smbd_disconnect_rdma_connection() [+ + +]

Author: Stefan Metzmacher <metze@samba.org>
Date:   Mon Aug 4 14:10:12 2025 +0200

    smb: client: let send_done() cleanup before calling smbd_disconnect_rdma_connection()
    
    commit 5349ae5e05fa37409fd48a1eb483b199c32c889b upstream.
    
    We should call ib_dma_unmap_single() and mempool_free() before calling
    smbd_disconnect_rdma_connection().
    
    And smbd_disconnect_rdma_connection() needs to be the last function to
    call as all other state might already be gone after it returns.
    
    Cc: Steve French <smfrench@gmail.com>
    Cc: Tom Talpey <tom@talpey.com>
    Cc: Long Li <longli@microsoft.com>
    Cc: linux-cifs@vger.kernel.org
    Cc: samba-technical@lists.samba.org
    Fixes: f198186aa9bb ("CIFS: SMBD: Establish SMB Direct connection")
    Signed-off-by: Stefan Metzmacher <metze@samba.org>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

smb: client: remove redundant lstrp update in negotiate protocol [+ + +]

Author: Wang Zhaolong <wangzhaolong@huaweicloud.com>
Date:   Fri Aug 1 17:07:24 2025 +0800

    smb: client: remove redundant lstrp update in negotiate protocol
    
    commit e19d8dd694d261ac26adb2a26121a37c107c81ad upstream.
    
    Commit 34331d7beed7 ("smb: client: fix first command failure during
    re-negotiation") addressed a race condition by updating lstrp before
    entering negotiate state. However, this approach may have some unintended
    side effects.
    
    The lstrp field is documented as "when we got last response from this
    server", and updating it before actually receiving a server response
    could potentially affect other mechanisms that rely on this timestamp.
    For example, the SMB echo detection logic also uses lstrp as a reference
    point. In scenarios with frequent user operations during reconnect states,
    the repeated calls to cifs_negotiate_protocol() might continuously
    update lstrp, which could interfere with the echo detection timing.
    
    Additionally, commit 266b5d02e14f ("smb: client: fix race condition in
    negotiate timeout by using more precise timing") introduced a dedicated
    neg_start field specifically for tracking negotiate start time. This
    provides a more precise solution for the original race condition while
    preserving the intended semantics of lstrp.
    
    Since the race condition is now properly handled by the neg_start
    mechanism, the lstrp update in cifs_negotiate_protocol() is no longer
    necessary and can be safely removed.
    
    Fixes: 266b5d02e14f ("smb: client: fix race condition in negotiate timeout by using more precise timing")
    Cc: stable@vger.kernel.org
    Acked-by: Paulo Alcantara (Red Hat) <pc@manguebit.org>
    Signed-off-by: Wang Zhaolong <wangzhaolong@huaweicloud.com>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

soc: qcom: mdt_loader: Actually use the e_phoff [+ + +]

Author: Bjorn Andersson <bjorn.andersson@oss.qualcomm.com>
Date:   Tue Jun 10 21:58:30 2025 -0500

    soc: qcom: mdt_loader: Actually use the e_phoff
    
    [ Upstream commit 47e339cac89143709e84a3b71ba8bd9b2fdd2368 ]
    
    Rather than relying/assuming that the tools generating the firmware
    places the program headers immediately following the ELF header, use
    e_phoff as intended to find the program headers.
    
    Reviewed-by: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
    Signed-off-by: Bjorn Andersson <bjorn.andersson@oss.qualcomm.com>
    Link: https://lore.kernel.org/r/20250610-mdt-loader-validation-and-fixes-v2-3-f7073e9ab899@oss.qualcomm.com
    Signed-off-by: Bjorn Andersson <andersson@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

soc: qcom: rpmh-rsc: Add RSC version 4 support [+ + +]

Author: Maulik Shah <maulik.shah@oss.qualcomm.com>
Date:   Mon Jun 23 11:19:43 2025 +0530

    soc: qcom: rpmh-rsc: Add RSC version 4 support
    
    [ Upstream commit 84684c57c9cd47b86c883a7170dd68222d97ef13 ]
    
    Register offsets for v3 and v4 versions are backward compatible. Assign v3
    offsets for v4 and all higher versions to avoid end up using v2 offsets.
    
    Signed-off-by: Maulik Shah <maulik.shah@oss.qualcomm.com>
    Reviewed-by: Konrad Dybcio <konrad.dybcio@oss.qualcomm.com>
    Reviewed-by: Neil Armstrong <neil.armstrong@linaro.org>
    Link: https://lore.kernel.org/r/20250623-rsc_v4-v1-1-275b27bc5e3c@oss.qualcomm.com
    Signed-off-by: Bjorn Andersson <andersson@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

soundwire: amd: cancel pending slave status handling workqueue during remove sequence [+ + +]

Author: Vijendar Mukunda <Vijendar.Mukunda@amd.com>
Date:   Fri May 30 11:13:41 2025 +0530

    soundwire: amd: cancel pending slave status handling workqueue during remove sequence
    
    [ Upstream commit f93b697ed98e3c85d1973ea170d4f4e7a6b2b45d ]
    
    During remove sequence, cancel the pending slave status update workqueue.
    
    Signed-off-by: Vijendar Mukunda <Vijendar.Mukunda@amd.com>
    Link: https://lore.kernel.org/r/20250530054447.1645807-4-Vijendar.Mukunda@amd.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

soundwire: amd: serialize amd manager resume sequence during pm_prepare [+ + +]

Author: Vijendar Mukunda <Vijendar.Mukunda@amd.com>
Date:   Fri May 30 11:13:40 2025 +0530

    soundwire: amd: serialize amd manager resume sequence during pm_prepare
    
    [ Upstream commit 03837341790039d6f1cbf7a1ae7dfa2cb77ef0a4 ]
    
    During pm_prepare callback, pm_request_resume() delays SoundWire manager D0
    entry sequence. Synchronize runtime resume sequence for amd_manager
    instance prior to invoking child devices resume sequence for both the amd
    power modes(ClockStop Mode and Power off mode).
    Change the power_mode_mask check and use pm_runtime_resume() in
    amd_pm_prepare() callback.
    
    Signed-off-by: Vijendar Mukunda <Vijendar.Mukunda@amd.com>
    Link: https://lore.kernel.org/r/20250530054447.1645807-3-Vijendar.Mukunda@amd.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

soundwire: Move handle_nested_irq outside of sdw_dev_lock [+ + +]

Author: Charles Keepax <ckeepax@opensource.cirrus.com>
Date:   Mon Jun 9 15:30:40 2025 +0100

    soundwire: Move handle_nested_irq outside of sdw_dev_lock
    
    [ Upstream commit ccb7bb13c00bcc3178d270da052635c56148bc16 ]
    
    The sdw_dev_lock protects the SoundWire driver callbacks against
    the probed flag, which is used to skip the callbacks if the
    driver gets removed. For more information see commit bd29c00edd0a
    ("soundwire: revisit driver bind/unbind and callbacks").
    
    However, this lock is a frequent source of mutex inversions.
    Many audio operations eventually hit the hardware resulting in a
    SoundWire callback, this means that typically the driver has the
    locking order ALSA/ASoC locks -> sdw_dev_lock. Conversely, the IRQ
    comes in directly from the SoundWire hardware, but then will often
    want to access ALSA/ASoC, such as updating something in DAPM or
    an ALSA control. This gives the other lock order sdw_dev_lock ->
    ALSA/ASoC locks.
    
    When the IRQ handling was initially added to SoundWire this was
    through a callback mechanism. As such it required being covered by
    the lock because the callbacks are part of the sdw_driver structure
    and are thus present regardless of if the driver is currently
    probed.
    
    Since then a newer mechanism using the IRQ framework has been
    added, which is currently covered by the same lock but this isn't
    actually required. Handlers for the IRQ framework are registered in
    probe and should by released during remove, thus the IRQ framework
    will have already unbound the IRQ before the slave driver is
    removed. Avoid the aforementioned mutex inversion by moving the
    handle_nested_irq call outside of the sdw_dev_lock.
    
    Signed-off-by: Charles Keepax <ckeepax@opensource.cirrus.com>
    Link: https://lore.kernel.org/r/20250609143041.495049-3-ckeepax@opensource.cirrus.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

sphinx: kernel_abi: fix performance regression with O= [+ + +]

Author: Mauro Carvalho Chehab <mchehab+huawei@kernel.org>
Date:   Thu Jul 17 13:37:19 2025 +0200

    sphinx: kernel_abi: fix performance regression with O=<dir>
    
    [ Upstream commit 2b16b71a05a7f056221751b906c13f8809656b1f ]
    
    The logic there which adds a dependency note to Sphinx cache
    is not taking into account that the build dir may not be
    the source dir. This causes a performance regression:
    
    $ time make O=/tmp/foo SPHINXDIRS=admin-guide htmldocs
    
            [OUTDATED]
            Added: set()
            Changed: {'abi-obsolete', 'abi-removed', 'abi-stable-files', 'abi-obsolete-files', 'abi-stable', 'abi', 'abi-removed-files', 'abi-testing-files', 'abi-testing', 'gpio/index', 'gpio/obsolete'}
            Removed: set()
            All docs count: 385
            Found docs count: 385
    
            real    0m11,324s
            user    0m15,783s
            sys     0m1,164s
    
    To get the root cause of the problem (ABI files reported as changed),
    I used this changeset:
    
    #       diff --git a/Documentation/conf.py b/Documentation/conf.py
    #       index e8766e689c1b..ab486623bd8b 100644
    #       --- a/Documentation/conf.py
    #       +++ b/Documentation/conf.py
    #       @@ -571,3 +571,16 @@ def setup(app):
    #            """Patterns need to be updated at init time on older Sphinx versions"""
    #
    #            app.connect('config-inited', update_patterns)
    #       +    app.connect('env-get-outdated', on_outdated)
    #       +
    #       +def on_outdated(app, env, added, changed, removed):
    #       +    """Track cache outdated due to added/changed/removed files"""
    #       +    print("\n[OUTDATED]")
    #       +    print(f"Added: {added}")
    #       +    print(f"Changed: {changed}")
    #       +    print(f"Removed: {removed}")
    #       +    print(f"All docs count: {len(env.all_docs)}")
    #       +    print(f"Found docs count: {len(env.found_docs)}")
    #       +
    #       +    # Just return what we have
    #       +    return added | changed | removed
    
    Reported-by: Akira Yokosawa <akiyks@gmail.com>
    Closes: https://lore.kernel.org/linux-doc/c174f7c5-ec21-4eae-b1c3-f643cca90d9d@gmail.com/
    Signed-off-by: Mauro Carvalho Chehab <mchehab+huawei@kernel.org>
    Tested-by: Akira Yokosawa <akiyks@gmail.com>
    Signed-off-by: Jonathan Corbet <corbet@lwn.net>
    Link: https://lore.kernel.org/r/e25673d87357457bc54ee863e97ff8f75956580d.1752752211.git.mchehab+huawei@kernel.org
    Signed-off-by: Sasha Levin <sashal@kernel.org>

staging: gpib: Add init response codes for new ni-usb-hs+ [+ + +]

Author: Dave Penkler <dpenkler@gmail.com>
Date:   Tue Jul 22 18:48:10 2025 +0200

    staging: gpib: Add init response codes for new ni-usb-hs+
    
    [ Upstream commit f50d5e0c1f80d004510bf77cb0e1759103585c00 ]
    
    A new version of a bona fide genuine NI-USB-HS+ adaptor
    sends new response codes to the initialization sequence.
    
    Add the checking for these response codes to suppress
    console warning messages.
    
    Signed-off-by: Dave Penkler <dpenkler@gmail.com>
    Link: https://lore.kernel.org/r/20250722164810.2621-1-dpenkler@gmail.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

sunvdc: Balance device refcount in vdc_port_mpgroup_check [+ + +]

Author: Ma Ke <make24@iscas.ac.cn>
Date:   Sat Jul 19 15:58:56 2025 +0800

    sunvdc: Balance device refcount in vdc_port_mpgroup_check
    
    commit 63ce53724637e2e7ba51fe3a4f78351715049905 upstream.
    
    Using device_find_child() to locate a probed virtual-device-port node
    causes a device refcount imbalance, as device_find_child() internally
    calls get_device() to increment the device’s reference count before
    returning its pointer. vdc_port_mpgroup_check() directly returns true
    upon finding a matching device without releasing the reference via
    put_device(). We should call put_device() to decrement refcount.
    
    As comment of device_find_child() says, 'NOTE: you will need to drop
    the reference with put_device() after use'.
    
    Found by code review.
    
    Cc: stable@vger.kernel.org
    Fixes: 3ee70591d6c4 ("sunvdc: prevent sunvdc panic when mpgroup disk added to guest domain")
    Signed-off-by: Ma Ke <make24@iscas.ac.cn>
    Link: https://lore.kernel.org/r/20250719075856.3447953-1-make24@iscas.ac.cn
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

thermal/drivers/qcom-spmi-temp-alarm: Enable stage 2 shutdown when required [+ + +]

Author: David Collins <david.collins@oss.qualcomm.com>
Date:   Thu Jul 10 15:45:51 2025 -0700

    thermal/drivers/qcom-spmi-temp-alarm: Enable stage 2 shutdown when required
    
    [ Upstream commit f8e157ff2df46ddabd930815d196895976227831 ]
    
    Certain TEMP_ALARM GEN2 PMIC peripherals need over-temperature stage 2
    automatic PMIC partial shutdown. This will ensure that in the event of
    reaching the hotter stage 3 over-temperature threshold, repeated faults
    will be avoided during the automatic PMIC hardware full shutdown.
    Modify the stage 2 shutdown control logic to ensure that stage 2
    shutdown is enabled on all affected PMICs. Read the digital major
    and minor revision registers to identify these PMICs.
    
    Signed-off-by: David Collins <david.collins@oss.qualcomm.com>
    Signed-off-by: Anjelique Melendez <anjelique.melendez@oss.qualcomm.com>
    Link: https://lore.kernel.org/r/20250710224555.3047790-2-anjelique.melendez@oss.qualcomm.com
    Signed-off-by: Daniel Lezcano <daniel.lezcano@linaro.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

thermal: sysfs: Return ENODATA instead of EAGAIN for reads [+ + +]

Author: Hsin-Te Yuan <yuanhsinte@chromium.org>
Date:   Fri Jun 20 10:41:43 2025 +0000

    thermal: sysfs: Return ENODATA instead of EAGAIN for reads
    
    [ Upstream commit 1a4aabc27e95674837f2e25f4ef340c0469e6203 ]
    
    According to POSIX spec, EAGAIN returned by read with O_NONBLOCK set
    means the read would block. Hence, the common implementation in
    nonblocking model will poll the file when the nonblocking read returns
    EAGAIN. However, when the target file is thermal zone, this mechanism
    will totally malfunction because thermal zone doesn't implement sysfs
    notification and thus the poll will never return.
    
    For example, the read in Golang implemnts such method and sometimes
    hangs at reading some thermal zones via sysfs.
    
    Change to return -ENODATA instead of -EAGAIN to userspace.
    
    Signed-off-by: Hsin-Te Yuan <yuanhsinte@chromium.org>
    Link: https://patch.msgid.link/20250620-temp-v3-1-6becc6aeb66c@chromium.org
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

thunderbolt: Fix copy+paste error in match_service_id() [+ + +]

Author: Eric Biggers <ebiggers@kernel.org>
Date:   Sun Jul 20 22:01:36 2025 -0700

    thunderbolt: Fix copy+paste error in match_service_id()
    
    commit 5cc1f66cb23cccc704e3def27ad31ed479e934a5 upstream.
    
    The second instance of TBSVC_MATCH_PROTOCOL_VERSION seems to have been
    intended to be TBSVC_MATCH_PROTOCOL_REVISION.
    
    Fixes: d1ff70241a27 ("thunderbolt: Add support for XDomain discovery protocol")
    Cc: stable <stable@kernel.org>
    Signed-off-by: Eric Biggers <ebiggers@kernel.org>
    Link: https://lore.kernel.org/r/20250721050136.30004-1-ebiggers@kernel.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

tls: handle data disappearing from under the TLS ULP [+ + +]

Author: Jakub Kicinski <kuba@kernel.org>
Date:   Thu Aug 7 16:29:06 2025 -0700

    tls: handle data disappearing from under the TLS ULP
    
    [ Upstream commit 6db015fc4b5d5f63a64a193f65d98da3a7fc811d ]
    
    TLS expects that it owns the receive queue of the TCP socket.
    This cannot be guaranteed in case the reader of the TCP socket
    entered before the TLS ULP was installed, or uses some non-standard
    read API (eg. zerocopy ones). Replace the WARN_ON() and a buggy
    early exit (which leaves anchor pointing to a freed skb) with real
    error handling. Wipe the parsing state and tell the reader to retry.
    
    We already reload the anchor every time we (re)acquire the socket lock,
    so the only condition we need to avoid is an out of bounds read
    (not having enough bytes in the socket for previously parsed record len).
    
    If some data was read from under TLS but there's enough in the queue
    we'll reload and decrypt what is most likely not a valid TLS record.
    Leading to some undefined behavior from TLS perspective (corrupting
    a stream? missing an alert? missing an attack?) but no kernel crash
    should take place.
    
    Reported-by: William Liu <will@willsroot.io>
    Reported-by: Savino Dicanosa <savy@syst3mfailure.io>
    Link: https://lore.kernel.org/tFjq_kf7sWIG3A7CrCg_egb8CVsT_gsmHAK0_wxDPJXfIzxFAMxqmLwp3MlU5EHiet0AwwJldaaFdgyHpeIUCS-3m3llsmRzp9xIOBR4lAI=@syst3mfailure.io
    Fixes: 84c61fe1a75b ("tls: rx: do not use the standard strparser")
    Reviewed-by: Eric Dumazet <edumazet@google.com>
    Link: https://patch.msgid.link/20250807232907.600366-1-kuba@kernel.org
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

tools/build: Fix s390(x) cross-compilation with clang [+ + +]

Author: Thomas Weißschuh <thomas.weissschuh@linutronix.de>
Date:   Fri Jun 20 13:00:27 2025 +0200

    tools/build: Fix s390(x) cross-compilation with clang
    
    [ Upstream commit a40f0cdce78be8a559ee8a85c908049c65a410b2 ]
    
    The heuristic to derive a clang target triple from a GCC one does not work
    for s390. GCC uses "s390-linux" while clang expects "s390x-linux" or
    "powerz-linux".
    
    Add an explicit override.
    
    Signed-off-by: Thomas Weißschuh <thomas.weissschuh@linutronix.de>
    Link: https://lore.kernel.org/r/20250620-tools-cross-s390-v2-1-ecda886e00e5@linutronix.de
    Signed-off-by: Thomas Weißschuh <linux@weissschuh.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

tools/nolibc: define time_t in terms of __kernel_old_time_t [+ + +]

Author: Thomas Weißschuh <linux@weissschuh.net>
Date:   Sat Jul 12 11:00:55 2025 +0200

    tools/nolibc: define time_t in terms of __kernel_old_time_t
    
    [ Upstream commit d5094bcb5bfdfea2cf0de8aaf77cc65db56cbdb5 ]
    
    Nolibc assumes that the kernel ABI is using a time values that are as
    large as a long integer. For most ABIs this holds true.
    But for x32 this is not correct, as it uses 32bit longs but 64bit times.
    
    Also the 'struct stat' implementation of nolibc relies on timespec::tv_sec
    and time_t being the same type. While timespec::tv_sec comes from the
    kernel and is of type __kernel_old_time_t, time_t is defined within nolibc.
    
    Switch to the __kernel_old_time_t to always get the correct type.
    
    Signed-off-by: Thomas Weißschuh <linux@weissschuh.net>
    Link: https://lore.kernel.org/r/20250712-nolibc-x32-v1-1-6d81cb798710@weissschuh.net
    Acked-by: Willy Tarreau <w@1wt.eu>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

tools/nolibc: fix spelling of FD_SETBITMASK in FD_* macros [+ + +]

Author: Willy Tarreau <w@1wt.eu>
Date:   Thu Jun 19 11:30:51 2025 +0200

    tools/nolibc: fix spelling of FD_SETBITMASK in FD_* macros
    
    commit a477629baa2a0e9991f640af418e8c973a1c08e3 upstream.
    
    While nolibc-test does test syscalls, it doesn't test as much the rest
    of the macros, and a wrong spelling of FD_SETBITMASK in commit
    feaf75658783a broke programs using either FD_SET() or FD_CLR() without
    being noticed. Let's fix these macros.
    
    Fixes: feaf75658783a ("nolibc: fix fd_set type")
    Cc: stable@vger.kernel.org # v6.2+
    Acked-by: Thomas Weißschuh <linux@weissschuh.net>
    Signed-off-by: Willy Tarreau <w@1wt.eu>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

tools/power turbostat: Fix build with musl [+ + +]

Author: Calvin Owens <calvin@wbinvd.org>
Date:   Fri Jun 13 09:54:23 2025 -0700

    tools/power turbostat: Fix build with musl
    
    [ Upstream commit 6ea0ec1b958a84aff9f03fb0ae4613a4d5bed3ea ]
    
    turbostat.c: In function 'parse_int_file':
        turbostat.c:5567:19: error: 'PATH_MAX' undeclared (first use in this function)
         5567 |         char path[PATH_MAX];
              |                   ^~~~~~~~
    
        turbostat.c: In function 'probe_graphics':
        turbostat.c:6787:19: error: 'PATH_MAX' undeclared (first use in this function)
         6787 |         char path[PATH_MAX];
              |                   ^~~~~~~~
    
    Signed-off-by: Calvin Owens <calvin@wbinvd.org>
    Reviewed-by: Artem Bityutskiy <artem.bityutskiy@linux.intel.com>
    Signed-off-by: Len Brown <len.brown@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

tools/power turbostat: Handle cap_get_proc() ENOSYS [+ + +]

Author: Calvin Owens <calvin@wbinvd.org>
Date:   Fri Jun 13 19:20:28 2025 -0700

    tools/power turbostat: Handle cap_get_proc() ENOSYS
    
    [ Upstream commit d34fe509f5f76d9dc36291242d67c6528027ebbd ]
    
    Kernels configured with CONFIG_MULTIUSER=n have no cap_get_proc().
    Check for ENOSYS to recognize this case, and continue on to
    attempt to access the requested MSRs (such as temperature).
    
    Signed-off-by: Calvin Owens <calvin@wbinvd.org>
    Signed-off-by: Len Brown <len.brown@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

tools/power turbostat: Handle non-root legacy-uncore sysfs permissions [+ + +]

Author: Len Brown <len.brown@intel.com>
Date:   Sat Aug 9 16:31:31 2025 -0400

    tools/power turbostat: Handle non-root legacy-uncore sysfs permissions
    
    [ Upstream commit e60a13bcef206795d3ddf82f130fe8f570176d06 ]
    
    /sys/devices/system/cpu/intel_uncore_frequency/package_X_die_Y/
    may be readable by all, but
    /sys/devices/system/cpu/intel_uncore_frequency/package_X_die_Y/current_freq_khz
    may be readable only by root.
    
    Non-root turbostat users see complaints in this scenario.
    
    Fail probe of the interface if we can't read current_freq_khz.
    
    Reported-by: Artem Bityutskiy <artem.bityutskiy@linux.intel.com>
    Original-patch-by: Zhang Rui <rui.zhang@intel.com>
    Signed-off-by: Len Brown <len.brown@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

tpm: Check for completion after timeout [+ + +]

Author: Jarkko Sakkinen <jarkko@kernel.org>
Date:   Wed Jul 23 02:21:24 2025 +0300

    tpm: Check for completion after timeout
    
    [ Upstream commit d4640c394f23b202a89512346cf28f6622a49031 ]
    
    The current implementation of timeout detection works in the following
    way:
    
    1. Read completion status. If completed, return the data
    2. Sleep for some time (usleep_range)
    3. Check for timeout using current jiffies value. Return an error if
       timed out
    4. Goto 1
    
    usleep_range doesn't guarantee it's always going to wake up strictly in
    (min, max) range, so such a situation is possible:
    
    1. Driver reads completion status. No completion yet
    2. Process sleeps indefinitely. In the meantime, TPM responds
    3. We check for timeout without checking for the completion again.
       Result is lost.
    
    Such a situation also happens for the guest VMs: if vCPU goes to sleep
    and doesn't get scheduled for some time, the guest TPM driver will
    timeout instantly after waking up without checking for the completion
    (which may already be in place).
    
    Perform the completion check once again after exiting the busy loop in
    order to give the device the last chance to send us some data.
    
    Since now we check for completion in two places, extract this check into
    a separate function.
    
    Signed-off-by: Ivan Orlov <ivan.orlov0322@gmail.com>
    Reviewed-by: Jonathan McDowell <noodles@meta.com>
    Signed-off-by: Jarkko Sakkinen <jarkko@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

tpm: tpm_crb_ffa: try to probe tpm_crb_ffa when it's built-in [+ + +]

Author: Yeoreum Yun <yeoreum.yun@arm.com>
Date:   Sat Jul 19 13:27:46 2025 +0300

    tpm: tpm_crb_ffa: try to probe tpm_crb_ffa when it's built-in
    
    [ Upstream commit 746d9e9f62a6e8ba0eba2b83fc61cfe7fa8797ce ]
    
    To generate the boot_aggregate log in the IMA subsystem using TPM PCR
    values, the TPM driver must be built as built-in and must be probed
    before the IMA subsystem is initialized.
    
    However, when the TPM device operates over the FF-A protocol using the
    CRB interface, probing fails and returns -EPROBE_DEFER if the
    tpm_crb_ffa device — an FF-A device that provides the communication
    interface to the tpm_crb driver — has not yet been probed.
    
    This issue occurs because both crb_acpi_driver_init() and
    tpm_crb_ffa_driver_init() are registered with device_initcall.  As a
    result, crb_acpi_driver_init() may be invoked before
    tpm_crb_ffa_driver_init(), which is responsible for probing the
    tpm_crb_ffa device.
    
    When this happens, IMA fails to detect the TPM device and logs the
    following message:
    
      | ima: No TPM chip found, activating TPM-bypass!
    
    Consequently, it cannot generate the boot_aggregate log with the PCR
    values provided by the TPM.
    
    To resolve this issue, the tpm_crb_ffa_init() function explicitly
    attempts to probe the tpm_crb_ffa by register tpm_crb_ffa driver so that
    when tpm_crb_ffa device is created before tpm_crb_ffa_init(), probe the
    tpm_crb_ffa device in tpm_crb_ffa_init() to finish probe the TPM device
    completely.
    
    This ensures that the TPM device using CRB over FF-A can be successfully
    probed, even if crb_acpi_driver_init() is called first.
    
    [ jarkko: reformatted some of the paragraphs because they were going past
      the 75 character boundary. ]
    
    Signed-off-by: Yeoreum Yun <yeoreum.yun@arm.com>
    Reviewed-by: Mimi Zohar <zohar@linux.ibm.com>
    Reviewed-by: Sudeep Holla <sudeep.holla@arm.com>
    Reviewed-by: Jarkko Sakkinen <jarkko@kernel.org>
    Signed-off-by: Jarkko Sakkinen <jarkko@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

tracefs: Add d_delete to remove negative dentries [+ + +]

Author: Steven Rostedt <rostedt@goodmis.org>
Date:   Wed Jun 11 12:18:15 2025 -0400

    tracefs: Add d_delete to remove negative dentries
    
    [ Upstream commit d9b13cdad80dc11d74408cf201939a946e9303a6 ]
    
    If a lookup in tracefs is done on a file that does not exist, it leaves a
    dentry hanging around until memory pressure removes it. But eventfs
    dentries should hang around as when their ref count goes to zero, it
    requires more work to recreate it. For the rest of the tracefs dentries,
    they hang around as their dentry is used as a descriptor for the tracing
    system. But if a file lookup happens for a file in tracefs that does not
    exist, it should be deleted.
    
    Add a .d_delete callback that checks if dentry->fsdata is set or not. Only
    eventfs dentries set fsdata so if it has content it should not be deleted
    and should hang around in the cache.
    
    Reported-by: Al Viro <viro@zeniv.linux.org.uk>
    Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>
    Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

tracing: fprobe: Fix infinite recursion using preempt_*_notrace() [+ + +]

Author: Masami Hiramatsu (Google) <mhiramat@kernel.org>
Date:   Tue Jul 29 08:47:03 2025 +0900

    tracing: fprobe: Fix infinite recursion using preempt_*_notrace()
    
    commit a3e892ab0fc287389176eabdcd74234508f6e52d upstream.
    
    Since preempt_count_add/del() are tracable functions, it is not allowed
    to use preempt_disable/enable() in ftrace handlers. Without this fix,
    probing on `preempt_count_add%return` will cause an infinite recursion
    of fprobes.
    
    To fix this problem, use preempt_disable/enable_notrace() in
    fprobe_return().
    
    Link: https://lore.kernel.org/all/175374642359.1471729.1054175011228386560.stgit@mhiramat.tok.corp.google.com/
    
    Fixes: 4346ba160409 ("fprobe: Rewrite fprobe on function-graph tracer")
    Cc: stable@vger.kernel.org
    Signed-off-by: Masami Hiramatsu (Google) <mhiramat@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

tty: serial: fix print format specifiers [+ + +]

Author: Joseph Tilahun <jtilahun@astranis.com>
Date:   Mon Jun 9 23:56:53 2025 -0700

    tty: serial: fix print format specifiers
    
    [ Upstream commit 33a2515abd45c64911955ff1da179589db54f99f ]
    
    The serial info sometimes produces negative TX/RX counts. E.g.:
    
    3: uart:FSL_LPUART mmio:0x02970000 irq:46 tx:-1595870545 rx:339619
    RTS|CTS|DTR|DSR|CD
    
    It appears that the print format specifiers don't match with the types of
    the respective variables. E.g.: All of the fields in struct uart_icount
    are u32, but the format specifier used is %d, even though u32 is unsigned
    and %d is for signed integers. Update drivers/tty/serial/serial_core.c
    to use the proper format specifiers. Reference
    https://docs.kernel.org/core-api/printk-formats.html as the documentation
    for what format specifiers are the proper ones to use for a given C type.
    
    Signed-off-by: Joseph Tilahun <jtilahun@astranis.com>
    Link: https://lore.kernel.org/r/20250610065653.3750067-1-jtilahun@astranis.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

uapi: in6: restore visibility of most IPv6 socket options [+ + +]

Author: Jakub Kicinski <kuba@kernel.org>
Date:   Mon Jun 9 07:39:33 2025 -0700

    uapi: in6: restore visibility of most IPv6 socket options
    
    [ Upstream commit 31557b3487b349464daf42bc4366153743c1e727 ]
    
    A decade ago commit 6d08acd2d32e ("in6: fix conflict with glibc")
    hid the definitions of IPV6 options, because GCC was complaining
    about duplicates. The commit did not list the warnings seen, but
    trying to recreate them now I think they are (building iproute2):
    
    In file included from ./include/uapi/rdma/rdma_user_cm.h:39,
                     from rdma.h:16,
                     from res.h:9,
                     from res-ctx.c:7:
    ../include/uapi/linux/in6.h:171:9: warning: ‘IPV6_ADD_MEMBERSHIP’ redefined
      171 | #define IPV6_ADD_MEMBERSHIP     20
          |         ^~~~~~~~~~~~~~~~~~~
    In file included from /usr/include/netinet/in.h:37,
                     from rdma.h:13:
    /usr/include/bits/in.h:233:10: note: this is the location of the previous definition
      233 | # define IPV6_ADD_MEMBERSHIP    IPV6_JOIN_GROUP
          |          ^~~~~~~~~~~~~~~~~~~
    ../include/uapi/linux/in6.h:172:9: warning: ‘IPV6_DROP_MEMBERSHIP’ redefined
      172 | #define IPV6_DROP_MEMBERSHIP    21
          |         ^~~~~~~~~~~~~~~~~~~~
    /usr/include/bits/in.h:234:10: note: this is the location of the previous definition
      234 | # define IPV6_DROP_MEMBERSHIP   IPV6_LEAVE_GROUP
          |          ^~~~~~~~~~~~~~~~~~~~
    
    Compilers don't complain about redefinition if the defines
    are identical, but here we have the kernel using the literal
    value, and glibc using an indirection (defining to a name
    of another define, with the same numerical value).
    
    Problem is, the commit in question hid all the IPV6 socket
    options, and glibc has a pretty sparse list. For instance
    it lacks Flow Label related options. Willem called this out
    in commit 3fb321fde22d ("selftests/net: ipv6 flowlabel"):
    
      /* uapi/glibc weirdness may leave this undefined */
      #ifndef IPV6_FLOWINFO
      #define IPV6_FLOWINFO 11
      #endif
    
    More interestingly some applications (socat) use
    a #ifdef IPV6_FLOWINFO to gate compilation of thier
    rudimentary flow label support. (For added confusion
    socat misspells it as IPV4_FLOWINFO in some places.)
    
    Hide only the two defines we know glibc has a problem
    with. If we discover more warnings we can hide more
    but we should avoid covering the entire block of
    defines for "IPV6 socket options".
    
    Link: https://patch.msgid.link/20250609143933.1654417-1-kuba@kernel.org
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ublk: check for unprivileged daemon on each I/O fetch [+ + +]

Author: Caleb Sander Mateos <csander@purestorage.com>
Date:   Fri Aug 8 09:52:15 2025 -0600

    ublk: check for unprivileged daemon on each I/O fetch
    
    [ Upstream commit 5058a62875e1916e5133a1639f0207ea2148c0bc ]
    
    Commit ab03a61c6614 ("ublk: have a per-io daemon instead of a per-queue
    daemon") allowed each ublk I/O to have an independent daemon task.
    However, nr_privileged_daemon is only computed based on whether the last
    I/O fetched in each ublk queue has an unprivileged daemon task.
    Fix this by checking whether every fetched I/O's daemon is privileged.
    Change nr_privileged_daemon from a count of queues to a boolean
    indicating whether any I/Os have an unprivileged daemon.
    
    Signed-off-by: Caleb Sander Mateos <csander@purestorage.com>
    Fixes: ab03a61c6614 ("ublk: have a per-io daemon instead of a per-queue daemon")
    Reviewed-by: Ming Lei <ming.lei@redhat.com>
    Link: https://lore.kernel.org/r/20250808155216.296170-1-csander@purestorage.com
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

udf: Verify partition map count [+ + +]

Author: Jan Kara <jack@suse.cz>
Date:   Fri Jul 11 19:01:20 2025 +0200

    udf: Verify partition map count
    
    [ Upstream commit 1a11201668e8635602577dcf06f2e96c591d8819 ]
    
    Verify that number of partition maps isn't insanely high which can lead
    to large allocation in udf_sb_alloc_partition_maps(). All partition maps
    have to fit in the LVD which is in a single block.
    
    Reported-by: syzbot+478f2c1a6f0f447a46bb@syzkaller.appspotmail.com
    Signed-off-by: Jan Kara <jack@suse.cz>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

udp: also consider secpath when evaluating ipsec use for checksumming [+ + +]

Author: Sabrina Dubroca <sd@queasysnail.net>
Date:   Mon Aug 4 11:26:27 2025 +0200

    udp: also consider secpath when evaluating ipsec use for checksumming
    
    [ Upstream commit 1118aaa3b35157777890fffab91d8c1da841b20b ]
    
    Commit b40c5f4fde22 ("udp: disable inner UDP checksum offloads in
    IPsec case") tried to fix checksumming in UFO when the packets are
    going through IPsec, so that we can't rely on offloads because the UDP
    header and payload will be encrypted.
    
    But when doing a TCP test over VXLAN going through IPsec transport
    mode with GSO enabled (esp4_offload module loaded), I'm seeing broken
    UDP checksums on the encap after successful decryption.
    
    The skbs get to udp4_ufo_fragment/__skb_udp_tunnel_segment via
    __dev_queue_xmit -> validate_xmit_skb -> skb_gso_segment and at this
    point we've already dropped the dst (unless the device sets
    IFF_XMIT_DST_RELEASE, which is not common), so need_ipsec is false and
    we proceed with checksum offload.
    
    Make need_ipsec also check the secpath, which is not dropped on this
    callpath.
    
    Fixes: b40c5f4fde22 ("udp: disable inner UDP checksum offloads in IPsec case")
    Signed-off-by: Sabrina Dubroca <sd@queasysnail.net>
    Signed-off-by: Steffen Klassert <steffen.klassert@secunet.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

um: Re-evaluate thread flags repeatedly [+ + +]

Author: Thomas Weißschuh <thomas.weissschuh@linutronix.de>
Date:   Fri Jul 4 14:34:47 2025 +0200

    um: Re-evaluate thread flags repeatedly
    
    [ Upstream commit b9e2f2246eb2b5617d53af7b5e4e1b8c916f26a8 ]
    
    The thread flags may change during their processing.
    For example a task_work can queue a new signal to be sent.
    This signal should be delivered before returning to usespace again.
    
    Evaluate the flags repeatedly similar to other architectures.
    
    Signed-off-by: Thomas Weißschuh <thomas.weissschuh@linutronix.de>
    Reviewed-by: Nam Cao <namcao@linutronix.de>
    Link: https://patch.msgid.link/20250704-uml-thread_flags-v1-1-0e293fd8d627@linutronix.de
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

usb: core: config: Prevent OOB read in SS endpoint companion parsing [+ + +]

Author: Xinyu Liu <katieeliu@tencent.com>
Date:   Mon Jun 30 10:02:56 2025 +0800

    usb: core: config: Prevent OOB read in SS endpoint companion parsing
    
    commit cf16f408364efd8a68f39011a3b073c83a03612d upstream.
    
    usb_parse_ss_endpoint_companion() checks descriptor type before length,
    enabling a potentially odd read outside of the buffer size.
    
    Fix this up by checking the size first before looking at any of the
    fields in the descriptor.
    
    Signed-off-by: Xinyu Liu <katieeliu@tencent.com>
    Cc: stable <stable@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

usb: core: usb_submit_urb: downgrade type check [+ + +]

Author: Oliver Neukum <oneukum@suse.com>
Date:   Thu Jun 12 14:20:25 2025 +0200

    usb: core: usb_submit_urb: downgrade type check
    
    [ Upstream commit 503bbde34cc3dd2acd231f277ba70c3f9ed22e59 ]
    
    Checking for the endpoint type is no reason for a WARN, as that can
    cause a reboot. A driver not checking the endpoint type must not cause a
    reboot, as there is just no point in this.  We cannot prevent a device
    from doing something incorrect as a reaction to a transfer. Hence
    warning for a mere assumption being wrong is not sensible.
    
    Signed-off-by: Oliver Neukum <oneukum@suse.com>
    Acked-by: Alan Stern <stern@rowland.harvard.edu>
    Link: https://lore.kernel.org/r/20250612122149.2559724-1-oneukum@suse.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

usb: dwc3: xilinx: add shutdown callback [+ + +]

Author: Radhey Shyam Pandey <radhey.shyam.pandey@amd.com>
Date:   Wed Jun 4 00:39:31 2025 +0530

    usb: dwc3: xilinx: add shutdown callback
    
    [ Upstream commit 70627bf82e36e61c40c3315e1206e4ea4c02e668 ]
    
    Adds a shutdown callback to ensure that the XHCI stack is properly
    shutdown in reboot/shutdown path.
    
    In kexec flow, kernel_restart_prepare() performs actions necessary
    to prepare the system for a restart and invokes device_shutdown. To
    ensure proper shutdown attach the dwc3 shutdown implementation which
    mirrors the remove method.
    
    $ kexec -e
    
    <snip>
    xhci-hcd xhci-hcd.0.auto: remove, state 1
    usb usb1: USB disconnect, device number 1
    usb 1-1: USB disconnect, device number 6
    xhci-hcd xhci-hcd.0.auto: USB bus 1 deregistered
    kexec_core: Starting new kernel
    
    Signed-off-by: Radhey Shyam Pandey <radhey.shyam.pandey@amd.com>
    Acked-by: Thinh Nguyen <Thinh.Nguyen@synopsys.com>
    Link: https://lore.kernel.org/r/1748977771-714153-1-git-send-email-radhey.shyam.pandey@amd.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

usb: typec: fusb302: cache PD RX state [+ + +]

Author: Sebastian Reichel <sebastian.reichel@collabora.com>
Date:   Fri Jul 4 19:55:06 2025 +0200

    usb: typec: fusb302: cache PD RX state
    
    commit 1e61f6ab08786d66a11cfc51e13d6f08a6b06c56 upstream.
    
    This patch fixes a race condition communication error, which ends up in
    PD hard resets when losing the race. Some systems, like the Radxa ROCK
    5B are powered through USB-C without any backup power source and use a
    FUSB302 chip to do the PD negotiation. This means it is quite important
    to avoid hard resets, since that effectively kills the system's
    power-supply.
    
    I've found the following race condition while debugging unplanned power
    loss during booting the board every now and then:
    
    1. lots of TCPM/FUSB302/PD initialization stuff
    2. TCPM ends up in SNK_WAIT_CAPABILITIES (tcpm_set_pd_rx is enabled here)
    3. the remote PD source does not send anything, so TCPM does a SOFT RESET
    4. TCPM ends up in SNK_WAIT_CAPABILITIES for the second time
       (tcpm_set_pd_rx is enabled again, even though it is still on)
    
    At this point I've seen broken CRC good messages being send by the
    FUSB302 with a logic analyzer sniffing the CC lines. Also it looks like
    messages are being lost and things generally going haywire with one of
    the two sides doing a hard reset once a broken CRC good message was send
    to the bus.
    
    I think the system is running into a race condition, that the FIFOs are
    being cleared and/or the automatic good CRC message generation flag is
    being updated while a message is already arriving.
    
    Let's avoid this by caching the PD RX enabled state, as we have already
    processed anything in the FIFOs and are in a good state. As a side
    effect that this also optimizes I2C bus usage :)
    
    As far as I can tell the problem theoretically also exists when TCPM
    enters SNK_WAIT_CAPABILITIES the first time, but I believe this is less
    critical for the following reason:
    
    On devices like the ROCK 5B, which are powered through a TCPM backed
    USB-C port, the bootloader must have done some prior PD communication
    (initial communication must happen within 5 seconds after plugging the
    USB-C plug). This means the first time the kernel TCPM state machine
    reaches SNK_WAIT_CAPABILITIES, the remote side is not sending messages
    actively. On other devices a hard reset simply adds some extra delay and
    things should be good afterwards.
    
    Fixes: c034a43e72dda ("staging: typec: Fairchild FUSB302 Type-c chip driver")
    Cc: stable <stable@kernel.org>
    Signed-off-by: Sebastian Reichel <sebastian.reichel@collabora.com>
    Reviewed-by: Heikki Krogerus <heikki.krogerus@linux.intel.com>
    Link: https://lore.kernel.org/r/20250704-fusb302-race-condition-fix-v1-1-239012c0e27a@kernel.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

usb: typec: intel_pmc_mux: Defer probe if SCU IPC isn't present [+ + +]

Author: Tomasz Michalec <tmichalec@google.com>
Date:   Tue Jun 10 17:40:58 2025 +0200

    usb: typec: intel_pmc_mux: Defer probe if SCU IPC isn't present
    
    [ Upstream commit df9a825f330e76c72d1985bc9bdc4b8981e3d15f ]
    
    If pmc_usb_probe is called before SCU IPC is registered, pmc_usb_probe
    will fail.
    
    Return -EPROBE_DEFER when pmc_usb_probe doesn't get SCU IPC device, so
    the probe function can be called again after SCU IPC is initialized.
    
    Signed-off-by: Tomasz Michalec <tmichalec@google.com>
    Reviewed-by: Heikki Krogerus <heikki.krogerus@linux.intel.com>
    Link: https://lore.kernel.org/r/20250610154058.1859812-1-tmichalec@google.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

usb: typec: tcpm/tcpci_maxim: fix irq wake usage [+ + +]

Author: André Draszik <andre.draszik@linaro.org>
Date:   Mon Jul 7 11:50:27 2025 +0100

    usb: typec: tcpm/tcpci_maxim: fix irq wake usage
    
    [ Upstream commit 31611223fb34a3e9320cdfc4f4395072a13ea78e ]
    
    This driver calls enable_irq_wake() during probe() unconditionally, and
    never issues the required corresponding disable_irq_wake() to disable
    hardware interrupt wakeup signals.
    
    Additionally, whether or not a device should wake-up the system is
    meant to be a policy decision based on sysfs (.../power/wakeup) in the
    first place.
    
    Update the driver to use the standard approach to enable/disable IRQ
    wake during the suspend/resume callbacks. This solves both issues
    described above.
    
    Signed-off-by: André Draszik <andre.draszik@linaro.org>
    Reviewed-by: Badhri Jagan Sridharan <badhri@google.com>
    Link: https://lore.kernel.org/r/20250707-max77759-irq-wake-v1-1-d367f633e4bc@linaro.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

usb: typec: ucsi: Add poll_cci operation to cros_ec_ucsi [+ + +]

Author: Jameson Thies <jthies@google.com>
Date:   Fri Jul 11 20:20:33 2025 +0000

    usb: typec: ucsi: Add poll_cci operation to cros_ec_ucsi
    
    [ Upstream commit 300386d117a98961fc1d612d1f1a61997d731b8a ]
    
    cros_ec_ucsi fails to allocate a UCSI instance in it's probe function
    because it does not define all operations checked by ucsi_create.
    Update cros_ec_ucsi operations to use the same function for read_cci
    and poll_cci.
    
    Signed-off-by: Jameson Thies <jthies@google.com>
    Reviewed-by: Benson Leung <bleung@chromium.org>
    Link: https://lore.kernel.org/r/20250711202033.2201305-1-jthies@google.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

usb: typec: ucsi: psy: Set current max to 100mA for BC 1.2 and Default [+ + +]

Author: Benson Leung <bleung@chromium.org>
Date:   Thu Jul 17 20:08:05 2025 +0000

    usb: typec: ucsi: psy: Set current max to 100mA for BC 1.2 and Default
    
    [ Upstream commit af833e7f7db3cf4c82f063668e1b52297a30ec18 ]
    
    ucsi_psy_get_current_max would return 0mA as the maximum current if
    UCSI detected a BC or a Default USB Power sporce.
    
    The comment in this function is true that we can't tell the difference
    between DCP/CDP or SDP chargers, but we can guarantee that at least 1-unit
    of USB 1.1/2.0 power is available, which is 100mA, which is a better
    fallback value than 0, which causes some userspaces, including the ChromeOS
    power manager, to regard this as a power source that is not providing
    any power.
    
    In reality, 100mA is guaranteed from all sources in these classes.
    
    Signed-off-by: Benson Leung <bleung@chromium.org>
    Reviewed-by: Jameson Thies <jthies@google.com>
    Reviewed-by: Heikki Krogerus <heikki.krogerus@linux.intel.com>
    Reviewed-by: Sebastian Reichel <sebastian.reichel@collabora.com>
    Link: https://lore.kernel.org/r/20250717200805.3710473-1-bleung@chromium.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

usb: typec: ucsi: Update power_supply on power role change [+ + +]

Author: Myrrh Periwinkle <myrrhperiwinkle@qtmlabs.xyz>
Date:   Mon Jul 21 13:32:51 2025 +0700

    usb: typec: ucsi: Update power_supply on power role change
    
    commit 7616f006db07017ef5d4ae410fca99279aaca7aa upstream.
    
    The current power direction of an USB-C port also influences the
    power_supply's online status, so a power role change should also update
    the power_supply.
    
    Fixes an issue on some systems where plugging in a normal USB device in
    for the first time after a reboot will cause upower to erroneously
    consider the system to be connected to AC power.
    
    Cc: stable <stable@kernel.org>
    Fixes: 0e6371fbfba3 ("usb: typec: ucsi: Report power supply changes")
    Signed-off-by: Myrrh Periwinkle <myrrhperiwinkle@qtmlabs.xyz>
    Reviewed-by: Heikki Krogerus <heikki.krogerus@linux.intel.com>
    Link: https://lore.kernel.org/r/20250721-fix-ucsi-pwr-dir-notify-v1-1-e53d5340cb38@qtmlabs.xyz
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

usb: xhci: Avoid showing errors during surprise removal [+ + +]

Author: Mario Limonciello <mario.limonciello@amd.com>
Date:   Thu Jul 17 10:31:05 2025 +0300

    usb: xhci: Avoid showing errors during surprise removal
    
    [ Upstream commit 4b9c60e440525b729ac5f071e00bcee12e0a7e84 ]
    
    When a USB4 dock is unplugged from a system it won't respond to ring
    events. The PCI core handles the surprise removal event and notifies
    all PCI drivers. The XHCI PCI driver sets a flag that the device is
    being removed as well.
    
    When that flag is set don't show messages in the cleanup path for
    marking the controller dead.
    
    Signed-off-by: Mario Limonciello <mario.limonciello@amd.com>
    Signed-off-by: Mathias Nyman <mathias.nyman@linux.intel.com>
    Acked-by: Mathias Nyman <mathias.nyman@linux.intel.com>
    Link: https://lore.kernel.org/r/20250717073107.488599-2-mathias.nyman@linux.intel.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

usb: xhci: Avoid showing warnings for dying controller [+ + +]

Author: Mario Limonciello <mario.limonciello@amd.com>
Date:   Thu Jul 17 10:31:06 2025 +0300

    usb: xhci: Avoid showing warnings for dying controller
    
    [ Upstream commit 65fc0fc137b5da3ee1f4ca4f61050fcb203d7582 ]
    
    When a USB4 dock is unplugged from a system it won't respond to ring
    events. The PCI core handles the surprise removal event and notifies
    all PCI drivers. The XHCI PCI driver sets a flag that the device is
    being removed, and when the device stops responding a flag is also
    added to indicate it's dying.
    
    When that flag is set don't bother to show warnings about a missing
    controller.
    
    Signed-off-by: Mario Limonciello <mario.limonciello@amd.com>
    Signed-off-by: Mathias Nyman <mathias.nyman@linux.intel.com>
    Acked-by: Mathias Nyman <mathias.nyman@linux.intel.com>
    Link: https://lore.kernel.org/r/20250717073107.488599-3-mathias.nyman@linux.intel.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

usb: xhci: print xhci->xhc_state when queue_command failed [+ + +]

Author: Su Hui <suhui@nfschina.com>
Date:   Fri Jul 25 14:01:18 2025 +0800

    usb: xhci: print xhci->xhc_state when queue_command failed
    
    [ Upstream commit 7919407eca2ef562fa6c98c41cfdf6f6cdd69d92 ]
    
    When encounters some errors like these:
    xhci_hcd 0000:4a:00.2: xHCI dying or halted, can't queue_command
    xhci_hcd 0000:4a:00.2: FIXME: allocate a command ring segment
    usb usb5-port6: couldn't allocate usb_device
    
    It's hard to know whether xhc_state is dying or halted. So it's better
    to print xhc_state's value which can help locate the resaon of the bug.
    
    Signed-off-by: Su Hui <suhui@nfschina.com>
    Link: https://lore.kernel.org/r/20250725060117.1773770-1-suhui@nfschina.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

usb: xhci: Set avg_trb_len = 8 for EP0 during Address Device Command [+ + +]

Author: Jay Chen <shawn2000100@gmail.com>
Date:   Thu Jul 17 10:31:07 2025 +0300

    usb: xhci: Set avg_trb_len = 8 for EP0 during Address Device Command
    
    [ Upstream commit f72b9aa821a2bfe4b6dfec4be19f264d0673b008 ]
    
    There is a subtle contradiction between sections of the xHCI 1.2 spec
    regarding the initialization of Input Endpoint Context fields. Section
    4.8.2 ("Endpoint Context Initialization") states that all fields should
    be initialized to 0. However, Section 6.2.3 ("Endpoint Context", p.453)
    specifies that the Average TRB Length (avg_trb_len) field shall be
    greater than 0, and explicitly notes (p.454): "Software shall set
    Average TRB Length to '8' for control endpoints."
    
    Strictly setting all fields to 0 during initialization conflicts with
    the specific recommendation for control endpoints. In practice, setting
    avg_trb_len = 0 is not meaningful for the hardware/firmware, as the
    value is used for bandwidth calculation.
    
    Motivation: Our company is developing a custom Virtual xHC hardware
    platform that strictly follows the xHCI spec and its recommendations.
    During validation, we observed that enumeration fails and a parameter
    error (TRB Completion Code = 5) is reported if avg_trb_len for EP0 is
    not set to 8 as recommended by Section 6.2.3. This demonstrates the
    importance of assigning a meaningful, non-zero value to avg_trb_len,
    even in virtualized or emulated environments.
    
    This patch explicitly sets avg_trb_len to 8 for EP0 in
    xhci_setup_addressable_virt_dev(), as recommended in Section 6.2.3, to
    prevent potential issues with xHCI host controllers that enforce the
    spec strictly.
    
    Link: https://bugzilla.kernel.org/show_bug.cgi?id=220033
    Signed-off-by: Jay Chen <shawn2000100@gmail.com>
    Signed-off-by: Mathias Nyman <mathias.nyman@linux.intel.com>
    Link: https://lore.kernel.org/r/20250717073107.488599-4-mathias.nyman@linux.intel.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

userfaultfd: fix a crash in UFFDIO_MOVE when PMD is a migration entry [+ + +]

Author: Suren Baghdasaryan <surenb@google.com>
Date:   Wed Aug 6 15:00:22 2025 -0700

    userfaultfd: fix a crash in UFFDIO_MOVE when PMD is a migration entry
    
    commit aba6faec0103ed8f169be8dce2ead41fcb689446 upstream.
    
    When UFFDIO_MOVE encounters a migration PMD entry, it proceeds with
    obtaining a folio and accessing it even though the entry is swp_entry_t.
    Add the missing check and let split_huge_pmd() handle migration entries.
    While at it also remove unnecessary folio check.
    
    [surenb@google.com: remove extra folio check, per David]
      Link: https://lkml.kernel.org/r/20250807200418.1963585-1-surenb@google.com
    Link: https://lkml.kernel.org/r/20250806220022.926763-1-surenb@google.com
    Fixes: adef440691ba ("userfaultfd: UFFDIO_MOVE uABI")
    Signed-off-by: Suren Baghdasaryan <surenb@google.com>
    Reported-by: syzbot+b446dbe27035ef6bd6c2@syzkaller.appspotmail.com
    Closes: https://lore.kernel.org/all/68794b5c.a70a0220.693ce.0050.GAE@google.com/
    Reviewed-by: Peter Xu <peterx@redhat.com>
    Acked-by: David Hildenbrand <david@redhat.com>
    Cc: Andrea Arcangeli <aarcange@redhat.com>
    Cc: Lokesh Gidra <lokeshgidra@google.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

verification/dot2k: Make a separate dot2k_templates/Kconfig_container [+ + +]

Author: Nam Cao <namcao@linutronix.de>
Date:   Fri Jul 4 15:19:57 2025 +0200

    verification/dot2k: Make a separate dot2k_templates/Kconfig_container
    
    [ Upstream commit 214459699fd202c28b7b9f787e674acbd3af724a ]
    
    A generated container's Kconfig has an incorrect line:
    
        select DA_MON_EVENTS_IMPLICIT
    
    This is due to container generation uses the same template Kconfig file as
    deterministic automaton monitor.
    
    Therefore, make a separate Kconfig template for container which has only
    the necessaries for container.
    
    Cc: Masami Hiramatsu <mhiramat@kernel.org>
    Cc: Mathieu Desnoyers <mathieu.desnoyers@efficios.com>
    Link: https://lore.kernel.org/d54fd7ee120785bec5695220e837dbbd6efb30e5.1751634289.git.namcao@linutronix.de
    Reviewed-by: Gabriele Monaco <gmonaco@redhat.com>
    Signed-off-by: Nam Cao <namcao@linutronix.de>
    Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

vfio/mlx5: fix possible overflow in tracking max message size [+ + +]

Author: Artem Sadovnikov <a.sadovnikov@ispras.ru>
Date:   Tue Jul 1 14:40:17 2025 +0000

    vfio/mlx5: fix possible overflow in tracking max message size
    
    [ Upstream commit b3060198483bac43ec113c62ae3837076f61f5de ]
    
    MLX cap pg_track_log_max_msg_size consists of 5 bits, value of which is
    used as power of 2 for max_msg_size. This can lead to multiplication
    overflow between max_msg_size (u32) and integer constant, and afterwards
    incorrect value is being written to rq_size.
    
    Fix this issue by extending integer constant to u64 type.
    
    Found by Linux Verification Center (linuxtesting.org) with SVACE.
    
    Suggested-by: Alex Williamson <alex.williamson@redhat.com>
    Signed-off-by: Artem Sadovnikov <a.sadovnikov@ispras.ru>
    Reviewed-by: Yishai Hadas <yishaih@nvidia.com>
    Link: https://lore.kernel.org/r/20250701144017.2410-2-a.sadovnikov@ispras.ru
    Signed-off-by: Alex Williamson <alex.williamson@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

vfio/type1: conditional rescheduling while pinning [+ + +]

Author: Keith Busch <kbusch@kernel.org>
Date:   Tue Jul 15 11:46:22 2025 -0700

    vfio/type1: conditional rescheduling while pinning
    
    [ Upstream commit b1779e4f209c7ff7e32f3c79d69bca4e3a3a68b6 ]
    
    A large DMA mapping request can loop through dma address pinning for
    many pages. In cases where THP can not be used, the repeated vmf_insert_pfn can
    be costly, so let the task reschedule as need to prevent CPU stalls. Failure to
    do so has potential harmful side effects, like increased memory pressure
    as unrelated rcu tasks are unable to make their reclaim callbacks and
    result in OOM conditions.
    
     rcu: INFO: rcu_sched self-detected stall on CPU
     rcu:   36-....: (20999 ticks this GP) idle=b01c/1/0x4000000000000000 softirq=35839/35839 fqs=3538
     rcu:            hardirqs   softirqs   csw/system
     rcu:    number:        0        107            0
     rcu:   cputime:       50          0        10446   ==> 10556(ms)
     rcu:   (t=21075 jiffies g=377761 q=204059 ncpus=384)
    ...
      <TASK>
      ? asm_sysvec_apic_timer_interrupt+0x16/0x20
      ? walk_system_ram_range+0x63/0x120
      ? walk_system_ram_range+0x46/0x120
      ? pgprot_writethrough+0x20/0x20
      lookup_memtype+0x67/0xf0
      track_pfn_insert+0x20/0x40
      vmf_insert_pfn_prot+0x88/0x140
      vfio_pci_mmap_huge_fault+0xf9/0x1b0 [vfio_pci_core]
      __do_fault+0x28/0x1b0
      handle_mm_fault+0xef1/0x2560
      fixup_user_fault+0xf5/0x270
      vaddr_get_pfns+0x169/0x2f0 [vfio_iommu_type1]
      vfio_pin_pages_remote+0x162/0x8e0 [vfio_iommu_type1]
      vfio_iommu_type1_ioctl+0x1121/0x1810 [vfio_iommu_type1]
      ? futex_wake+0x1c1/0x260
      x64_sys_call+0x234/0x17a0
      do_syscall_64+0x63/0x130
      ? exc_page_fault+0x63/0x130
      entry_SYSCALL_64_after_hwframe+0x4b/0x53
    
    Signed-off-by: Keith Busch <kbusch@kernel.org>
    Reviewed-by: Paul E. McKenney <paulmck@kernel.org>
    Link: https://lore.kernel.org/r/20250715184622.3561598-1-kbusch@meta.com
    Signed-off-by: Alex Williamson <alex.williamson@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

vhost: fail early when __vhost_add_used() fails [+ + +]

Author: Jason Wang <jasowang@redhat.com>
Date:   Mon Jul 14 16:47:53 2025 +0800

    vhost: fail early when __vhost_add_used() fails
    
    [ Upstream commit b4ba1207d45adaafa2982c035898b36af2d3e518 ]
    
    This patch fails vhost_add_used_n() early when __vhost_add_used()
    fails to make sure used idx is not updated with stale used ring
    information.
    
    Reported-by: Eugenio Pérez <eperezma@redhat.com>
    Signed-off-by: Jason Wang <jasowang@redhat.com>
    Message-Id: <20250714084755.11921-2-jasowang@redhat.com>
    Signed-off-by: Michael S. Tsirkin <mst@redhat.com>
    Tested-by: Lei Yang <leiyang@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

vsock/virtio: Resize receive buffers so that each SKB fits in a 4K page [+ + +]

Author: Will Deacon <will@kernel.org>
Date:   Thu Jul 17 10:01:11 2025 +0100

    vsock/virtio: Resize receive buffers so that each SKB fits in a 4K page
    
    [ Upstream commit 03a92f036a04fed2b00d69f5f46f1a486e70dc5c ]
    
    When allocating receive buffers for the vsock virtio RX virtqueue, an
    SKB is allocated with a 4140 data payload (the 44-byte packet header +
    VIRTIO_VSOCK_DEFAULT_RX_BUF_SIZE). Even when factoring in the SKB
    overhead, the resulting 8KiB allocation thanks to the rounding in
    kmalloc_reserve() is wasteful (~3700 unusable bytes) and results in a
    higher-order page allocation on systems with 4KiB pages just for the
    sake of a few hundred bytes of packet data.
    
    Limit the vsock virtio RX buffers to 4KiB per SKB, resulting in much
    better memory utilisation and removing the need to allocate higher-order
    pages entirely.
    
    Reviewed-by: Stefano Garzarella <sgarzare@redhat.com>
    Signed-off-by: Will Deacon <will@kernel.org>
    Message-Id: <20250717090116.11987-5-will@kernel.org>
    Signed-off-by: Michael S. Tsirkin <mst@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

watchdog: dw_wdt: Fix default timeout [+ + +]

Author: Sebastian Reichel <sebastian.reichel@collabora.com>
Date:   Thu Jul 17 18:55:02 2025 +0200

    watchdog: dw_wdt: Fix default timeout
    
    [ Upstream commit ac3dbb91e0167d017f44701dd51c1efe30d0c256 ]
    
    The Synopsys Watchdog driver sets the default timeout to 30 seconds,
    but on some devices this is not a valid timeout. E.g. on RK3588 the
    actual timeout being used is 44 seconds instead.
    
    Once the watchdog is started the value is updated accordingly, but
    it would be better to expose a sensible timeout to userspace without
    the need to first start the watchdog.
    
    Signed-off-by: Sebastian Reichel <sebastian.reichel@collabora.com>
    Reviewed-by: Guenter Roeck <linux@roeck-us.net>
    Link: https://lore.kernel.org/r/20250717-dw-wdt-fix-initial-timeout-v1-1-86dc864d48dd@kernel.org
    Signed-off-by: Guenter Roeck <linux@roeck-us.net>
    Signed-off-by: Wim Van Sebroeck <wim@linux-watchdog.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

watchdog: iTCO_wdt: Report error if timeout configuration fails [+ + +]

Author: Ziyan Fu <fuzy5@lenovo.com>
Date:   Fri Jul 4 15:35:18 2025 +0800

    watchdog: iTCO_wdt: Report error if timeout configuration fails
    
    [ Upstream commit 40efc43eb7ffb5a4e2f998c13b8cfb555e671b92 ]
    
    The driver probes with the invalid timeout value when
    'iTCO_wdt_set_timeout()' fails, as its return value is not checked. In
    this case, when executing "wdctl", we may get:
    
    Device:        /dev/watchdog0
    Timeout:       30 seconds
    Timeleft:      613 seconds
    
    The timeout value is the value of "heartbeat" or "WATCHDOG_TIMEOUT", and
    the timeleft value is calculated from the register value we actually read
    (0xffff) by masking with 0x3ff and converting ticks to seconds (* 6 / 10).
    
    Add error handling to return the failure code if 'iTCO_wdt_set_timeout()'
    fails, ensuring the driver probe fails and prevents invalid operation.
    
    Signed-off-by: Ziyan Fu <fuzy5@lenovo.com>
    Reviewed-by: Guenter Roeck <linux@roeck-us.net>
    Link: https://lore.kernel.org/r/20250704073518.7838-1-13281011316@163.com
    Signed-off-by: Guenter Roeck <linux@roeck-us.net>
    Signed-off-by: Wim Van Sebroeck <wim@linux-watchdog.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

watchdog: sbsa: Adjust keepalive timeout to avoid MediaTek WS0 race condition [+ + +]

Author: Aaron Plattner <aplattner@nvidia.com>
Date:   Mon Jul 21 16:06:39 2025 -0700

    watchdog: sbsa: Adjust keepalive timeout to avoid MediaTek WS0 race condition
    
    [ Upstream commit 48defdf6b083f74a44e1f742db284960d3444aec ]
    
    The MediaTek implementation of the sbsa_gwdt watchdog has a race
    condition where a write to SBSA_GWDT_WRR is ignored if it occurs while
    the hardware is processing a timeout refresh that asserts WS0.
    
    Detect this based on the hardware implementer and adjust
    wdd->min_hw_heartbeat_ms to avoid the race by forcing the keepalive ping
    to be one second later.
    
    Signed-off-by: Aaron Plattner <aplattner@nvidia.com>
    Acked-by: Timur Tabi <ttabi@nvidia.com>
    Reviewed-by: Guenter Roeck <linux@roeck-us.net>
    Link: https://lore.kernel.org/r/20250721230640.2244915-1-aplattner@nvidia.com
    Signed-off-by: Guenter Roeck <linux@roeck-us.net>
    Signed-off-by: Wim Van Sebroeck <wim@linux-watchdog.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: ath10k: shutdown driver when hardware is unreliable [+ + +]

Author: Kang Yang <kang.yang@oss.qualcomm.com>
Date:   Mon Jun 23 10:27:31 2025 +0800

    wifi: ath10k: shutdown driver when hardware is unreliable
    
    [ Upstream commit c256a94d1b1b15109740306f7f2a7c2173e12072 ]
    
    In rare cases, ath10k may lose connection with the PCIe bus due to
    some unknown reasons, which could further lead to system crashes during
    resuming due to watchdog timeout:
    
    ath10k_pci 0000:01:00.0: wmi command 20486 timeout, restarting hardware
    ath10k_pci 0000:01:00.0: already restarting
    ath10k_pci 0000:01:00.0: failed to stop WMI vdev 0: -11
    ath10k_pci 0000:01:00.0: failed to stop vdev 0: -11
    ieee80211 phy0: PM: **** DPM device timeout ****
    Call Trace:
     panic+0x125/0x315
     dpm_watchdog_set+0x54/0x54
     dpm_watchdog_handler+0x57/0x57
     call_timer_fn+0x31/0x13c
    
    At this point, all WMI commands will timeout and attempt to restart
    device. So set a threshold for consecutive restart failures. If the
    threshold is exceeded, consider the hardware is unreliable and all
    ath10k operations should be skipped to avoid system crash.
    
    fail_cont_count and pending_recovery are atomic variables, and
    do not involve complex conditional logic. Therefore, even if recovery
    check and reconfig complete are executed concurrently, the recovery
    mechanism will not be broken.
    
    Tested-on: QCA6174 hw3.2 PCI WLAN.RM.4.4.1-00288-QCARMSWPZ-1
    
    Signed-off-by: Kang Yang <kang.yang@oss.qualcomm.com>
    Reviewed-by: Loic Poulain <loic.poulain@oss.qualcomm.com>
    Link: https://patch.msgid.link/20250623022731.509-1-kang.yang@oss.qualcomm.com
    Signed-off-by: Jeff Johnson <jeff.johnson@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: ath12k: Add memset and update default rate value in wmi tx completion [+ + +]

Author: Sarika Sharma <quic_sarishar@quicinc.com>
Date:   Tue Jun 3 12:05:12 2025 +0530

    wifi: ath12k: Add memset and update default rate value in wmi tx completion
    
    [ Upstream commit 9903c0986f782dfc511d7638b6f15fb6e8600cd3 ]
    
    When both AP/STA and monitor interfaces are enabled, ieee80211_tx_status()
    is invoked from two paths: the TX completion handler for data frames
    and the WMI TX completion handler for management frames.
    In the data path, the skb->cb is properly zeroed using memset, but in
    the WMI path, this step is missing.
    
    As a result, mac80211 encountered uninitialized (junk) values in
    skb->cb when generating the radiotap header for monitor mode, leading
    to invalid radiotap lengths.
    
    Hence, explicitly zero the status field in the skb->cb using memset
    in WMI TX completion path to ensure consistent and correct behavior
    during WMI tx completion path.
    
    Additionally, set info->status.rates[0].idx = -1 to indicate that
    no valid rate information is available, avoiding misinterpretation of
    garbage values.
    
    Tested-on: QCN9274 hw2.0 PCI WLAN.WBE.1.4.1-00199-QCAHKSWPL_SILICONZ-1
    
    Signed-off-by: Sarika Sharma <quic_sarishar@quicinc.com>
    Reviewed-by: Vasanthakumar Thiagarajan <vasanthakumar.thiagarajan@oss.qualcomm.com>
    Link: https://patch.msgid.link/20250603063512.1887652-1-quic_sarishar@quicinc.com
    Signed-off-by: Jeff Johnson <jeff.johnson@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: ath12k: Correct tid cleanup when tid setup fails [+ + +]

Author: Sarika Sharma <quic_sarishar@quicinc.com>
Date:   Mon Jul 21 11:47:49 2025 +0530

    wifi: ath12k: Correct tid cleanup when tid setup fails
    
    [ Upstream commit 4a2bf707270f897ab8077baee8ed5842a5321686 ]
    
    Currently, if any error occurs during ath12k_dp_rx_peer_tid_setup(),
    the tid value is already incremented, even though the corresponding
    TID is not actually allocated. Proceed to
    ath12k_dp_rx_peer_tid_delete() starting from unallocated tid,
    which might leads to freeing unallocated TID and cause potential
    crash or out-of-bounds access.
    
    Hence, fix by correctly decrementing tid before cleanup to match only
    the successfully allocated TIDs.
    
    Also, remove tid-- from failure case of ath12k_dp_rx_peer_frag_setup(),
    as decrementing the tid before cleanup in loop will take care of this.
    
    Compile tested only.
    
    Signed-off-by: Sarika Sharma <quic_sarishar@quicinc.com>
    Reviewed-by: Vasanthakumar Thiagarajan <vasanthakumar.thiagarajan@oss.qualcomm.com>
    Link: https://patch.msgid.link/20250721061749.886732-1-quic_sarishar@quicinc.com
    Signed-off-by: Jeff Johnson <jeff.johnson@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: ath12k: Decrement TID on RX peer frag setup error handling [+ + +]

Author: Karthikeyan Kathirvel <quic_kathirve@quicinc.com>
Date:   Mon May 26 09:17:13 2025 +0530

    wifi: ath12k: Decrement TID on RX peer frag setup error handling
    
    [ Upstream commit 7c0884fcd2ddde0544d2e77f297ae461e1f53f58 ]
    
    Currently, TID is not decremented before peer cleanup, during error
    handling path of ath12k_dp_rx_peer_frag_setup(). This could lead to
    out-of-bounds access in peer->rx_tid[].
    
    Hence, add a decrement operation for TID, before peer cleanup to
    ensures proper cleanup and prevents out-of-bounds access issues when
    the RX peer frag setup fails.
    
    Found during code review. Compile tested only.
    
    Signed-off-by: Karthikeyan Kathirvel <quic_kathirve@quicinc.com>
    Signed-off-by: Sarika Sharma <quic_sarishar@quicinc.com>
    Reviewed-by: Vasanthakumar Thiagarajan <vasanthakumar.thiagarajan@oss.qualcomm.com>
    Link: https://patch.msgid.link/20250526034713.712592-1-quic_sarishar@quicinc.com
    Signed-off-by: Jeff Johnson <jeff.johnson@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: ath12k: Enable REO queue lookup table feature on QCN9274 hw2.0 [+ + +]

Author: Raj Kumar Bhagat <quic_rajkbhag@quicinc.com>
Date:   Mon Jun 9 08:48:50 2025 +0530

    wifi: ath12k: Enable REO queue lookup table feature on QCN9274 hw2.0
    
    [ Upstream commit b79742b84e16e41c4a09f3126436f39f36e75c06 ]
    
    The commit 89ac53e96217 ("wifi: ath12k: Enable REO queue lookup table
    feature on QCN9274") originally intended to enable the reoq_lut_support
    hardware parameter flag for both QCN9274 hw1.0 and hw2.0. However,
    it enabled it only for QCN9274 hw1.0.
    
    Hence, enable REO queue lookup table feature on QCN9274 hw2.0.
    
    Tested-on: QCN9274 hw2.0 PCI WLAN.WBE.1.4.1-00199-QCAHKSWPL_SILICONZ-1
    
    Signed-off-by: Raj Kumar Bhagat <quic_rajkbhag@quicinc.com>
    Reviewed-by: Vasanthakumar Thiagarajan <vasanthakumar.thiagarajan@oss.qualcomm.com>
    Link: https://patch.msgid.link/20250609-qcn9274-reoq-v1-1-a92c91abc9b9@quicinc.com
    Signed-off-by: Jeff Johnson <jeff.johnson@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: ath12k: Fix beacon reception for sta associated to Non-TX AP [+ + +]

Author: Rameshkumar Sundaram <rameshkumar.sundaram@oss.qualcomm.com>
Date:   Fri May 30 09:26:15 2025 +0530

    wifi: ath12k: Fix beacon reception for sta associated to Non-TX AP
    
    [ Upstream commit ce7c93d196bfd2190dc4a0b86deee04e82042ed1 ]
    
    Currently, when a station is associated with a Non-Transmitting BSS of an
    MBSSID set, beacons are not frequently received from the firmware. This
    results in missing events via beacons, such as channel switches, leading
    to the station not switching to new channel as the AP does, eventually
    causing a kick out event from the firmware. This issue arises due to
    missing configuration of Non-Transmitting BSS information in station's
    vdev up command.
    
    Fix this by filling information such as the Transmitting BSSID, profile index
    and profile count in vdev up command of station.
    
    Tested-on: QCN9274 hw2.0 PCI WLAN.WBE.1.4.1-00199-QCAHKSWPL_SILICONZ-1
    Tested-on: WCN7850 hw2.0 PCI WLAN.HMT.1.1.c5-00284-QCAHMTSWPL_V1.0_V2.0_SILICONZ-1
    
    Signed-off-by: Rameshkumar Sundaram <rameshkumar.sundaram@oss.qualcomm.com>
    Reviewed-by: Vasanthakumar Thiagarajan <vasanthakumar.thiagarajan@oss.qualcomm.com>
    Link: https://patch.msgid.link/20250530035615.3178480-3-rameshkumar.sundaram@oss.qualcomm.com
    Signed-off-by: Jeff Johnson <jeff.johnson@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: ath12k: Fix station association with MBSSID Non-TX BSS [+ + +]

Author: Hari Chandrakanthan <quic_haric@quicinc.com>
Date:   Fri May 30 09:26:14 2025 +0530

    wifi: ath12k: Fix station association with MBSSID Non-TX BSS
    
    [ Upstream commit 70eeacc1a92a444f4b5777ab19e1c378a5edc8dd ]
    
    ath12k station is unable to associate with non-transmitting BSSes
    in a Multiple BSS set because the user-space does not receive
    information about the non-transmitting BSSes from mac80211's
    scan results.
    
    The ath12k driver does not advertise its MBSSID capability to mac80211,
    resulting in wiphy->support_mbssid not being set. Consequently, the
    information about non-transmitting BSS is not parsed from received
    Beacon/Probe response frames and is therefore not included in the
    scan results.
    
    Fix this by advertising the MBSSID capability of ath12k driver to
    mac80211.
    
    Tested-on: QCN9274 hw2.0 PCI WLAN.WBE.1.4.1-00199-QCAHKSWPL_SILICONZ-1
    Tested-on: WCN7850 hw2.0 PCI WLAN.HMT.1.1.c5-00284-QCAHMTSWPL_V1.0_V2.0_SILICONZ-1
    
    Signed-off-by: Hari Chandrakanthan <quic_haric@quicinc.com>
    Signed-off-by: Rameshkumar Sundaram <rameshkumar.sundaram@oss.qualcomm.com>
    Reviewed-by: Vasanthakumar Thiagarajan <vasanthakumar.thiagarajan@oss.qualcomm.com>
    Link: https://patch.msgid.link/20250530035615.3178480-2-rameshkumar.sundaram@oss.qualcomm.com
    Signed-off-by: Jeff Johnson <jeff.johnson@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: cfg80211: Fix interface type validation [+ + +]

Author: Ilan Peer <ilan.peer@intel.com>
Date:   Wed Jul 9 23:37:55 2025 +0300

    wifi: cfg80211: Fix interface type validation
    
    [ Upstream commit 14450be2332a49445106403492a367412b8c23f4 ]
    
    Fix a condition that verified valid values of interface types.
    
    Signed-off-by: Ilan Peer <ilan.peer@intel.com>
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Link: https://patch.msgid.link/20250709233537.7ad199ca5939.I0ac1ff74798bf59a87a57f2e18f2153c308b119b@changeid
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: cfg80211: reject HTC bit for management frames [+ + +]

Author: Johannes Berg <johannes.berg@intel.com>
Date:   Fri Jul 18 20:23:06 2025 +0200

    wifi: cfg80211: reject HTC bit for management frames
    
    [ Upstream commit be06a8c7313943109fa870715356503c4c709cbc ]
    
    Management frames sent by userspace should never have the
    order/HTC bit set, reject that. It could also cause some
    confusion with the length of the buffer and the header so
    the validation might end up wrong.
    
    Link: https://patch.msgid.link/20250718202307.97a0455f0f35.I1805355c7e331352df16611839bc8198c855a33f@changeid
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: iwlegacy: Check rate_idx range after addition [+ + +]

Author: Stanislaw Gruszka <stf_xl@wp.pl>
Date:   Sun May 25 16:45:24 2025 +0200

    wifi: iwlegacy: Check rate_idx range after addition
    
    [ Upstream commit 0de19d5ae0b2c5b18b88c5c7f0442f707a207409 ]
    
    Limit rate_idx to IL_LAST_OFDM_RATE for 5GHz band for thinkable case
    the index is incorrect.
    
    Reported-by: Fedor Pchelkin <pchelkin@ispras.ru>
    Reported-by: Alexei Safin <a.safin@rosa.ru>
    Signed-off-by: Stanislaw Gruszka <stf_xl@wp.pl>
    Reviewed-by: Fedor Pchelkin <pchelkin@ispras.ru>
    Link: https://patch.msgid.link/20250525144524.GA172583@wp.pl
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: iwlwifi: dvm: fix potential overflow in rs_fill_link_cmd() [+ + +]

Author: Rand Deeb <rand.sec96@gmail.com>
Date:   Wed Mar 13 13:17:55 2024 +0300

    wifi: iwlwifi: dvm: fix potential overflow in rs_fill_link_cmd()
    
    [ Upstream commit e3ad987e9dc7d1e12e3f2f1e623f0e174cd0ca78 ]
    
    The 'index' variable in the rs_fill_link_cmd() function can reach
    LINK_QUAL_MAX_RETRY_NUM during the execution of the inner loop. This
    variable is used as an index for the lq_cmd->rs_table array, which has a
    size of LINK_QUAL_MAX_RETRY_NUM, without proper validation.
    
    Modify the condition of the inner loop to ensure that the 'index' variable
    does not exceed LINK_QUAL_MAX_RETRY_NUM - 1, thereby preventing any
    potential overflow issues.
    
    Found by Linux Verification Center (linuxtesting.org) with SVACE.
    
    Signed-off-by: Rand Deeb <rand.sec96@gmail.com>
    Link: https://patch.msgid.link/20240313101755.269209-1-rand.sec96@gmail.com
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: iwlwifi: fw: Fix possible memory leak in iwl_fw_dbg_collect [+ + +]

Author: Pagadala Yesu Anjaneyulu <pagadala.yesu.anjaneyulu@intel.com>
Date:   Wed Jun 11 22:26:23 2025 +0300

    wifi: iwlwifi: fw: Fix possible memory leak in iwl_fw_dbg_collect
    
    [ Upstream commit cc8d9cbf269dab363c768bfa9312265bc807fca5 ]
    
    Ensure descriptor is freed on error to avoid memory leak.
    
    Signed-off-by: Pagadala Yesu Anjaneyulu <pagadala.yesu.anjaneyulu@intel.com>
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Link: https://patch.msgid.link/20250611222325.8158d15ec866.Ifa3e422c302397111f20a16da7509e6574bc19e3@changeid
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: iwlwifi: handle non-overlapping API ranges [+ + +]

Author: Miri Korenblit <miriam.rachel.korenblit@intel.com>
Date:   Wed Jul 9 23:05:29 2025 +0300

    wifi: iwlwifi: handle non-overlapping API ranges
    
    [ Upstream commit 6fdd41b25fb4154bcde7a38ca1c98e072fa1177c ]
    
    The option to set an api_version_min/max also to the RF was added.
    In the case that both the MAC and the RF has a range defined, we take
    the narrower range of both.
    
    This doesn't work for non-overlapping ranges. In this case, we should
    just take the lower range of both.
    
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Link: https://patch.msgid.link/20250709200543.1628666-2-miriam.rachel.korenblit@intel.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: iwlwifi: mld: avoid outdated reorder buffer head_sn [+ + +]

Author: Avraham Stern <avraham.stern@intel.com>
Date:   Wed Jul 23 09:45:06 2025 +0300

    wifi: iwlwifi: mld: avoid outdated reorder buffer head_sn
    
    [ Upstream commit 666357bf3e57c6a68be128825775aee14f9a24f7 ]
    
    If no frames are received on a queue for a while, the reorder buffer
    head_sn may be an old one. When the next frame that is received on
    that queue and buffered is a subframe of an AMSDU but not the last
    subframe, it will not update the buffer's head_sn. When the frame
    release notification arrives, it will not release the buffered frame
    because it will look like the notification's NSSN is lower than the
    buffer's head_sn (because of a wraparound).
    Fix it by updating the head_sn when the first frame is buffered.
    
    Signed-off-by: Avraham Stern <avraham.stern@intel.com>
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Link: https://patch.msgid.link/20250723094230.e1f62a9a603c.I7b57a481122074b1f40d39cd31db2e5262668eb2@changeid
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: iwlwifi: mld: don't exit EMLSR when we shouldn't [+ + +]

Author: Miri Korenblit <miriam.rachel.korenblit@intel.com>
Date:   Thu Jun 12 14:48:57 2025 +0300

    wifi: iwlwifi: mld: don't exit EMLSR when we shouldn't
    
    [ Upstream commit 0cdb8ff6ebbac55f38933f4621215784887b400e ]
    
    There is a requirement to exit EMLSR if there wasn't enough throughput
    in the secondary link.
    This is checked in check_tpt_wk, which runs every 5 seconds in a high
    throughput scenario (when the throughput blocker isn't set)
    
    It can happen that this worker is running immediately after we entered
    EMLSR, and in that case the secondary link didn't have a chance to have
    throughput. In that case we will exit EMLSR for no good reason.
    
    Fix this by tracking the time we entered EMLSR, and in the worker make
    sure that 5 seconds passed from when we entered EMLSR. If not, don't
    check the secondary link throughput.
    
    Reviewed-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Link: https://patch.msgid.link/20250612144708.c680f8d7dc37.I8a02d1e8d99df3789da8d5714f19b31a865a61ff@changeid
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: iwlwifi: mld: fix last_mlo_scan_time type [+ + +]

Author: Johannes Berg <johannes.berg@intel.com>
Date:   Wed Jun 11 22:26:27 2025 +0300

    wifi: iwlwifi: mld: fix last_mlo_scan_time type
    
    [ Upstream commit f26281c1b727b90ec18ae90044d5f429d2250e82 ]
    
    This should be u64, otherwise it rolls over quickly on 32-bit
    systems.
    
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Link: https://patch.msgid.link/20250611222325.5381030253cd.I4e3a7bca5b52fc826e26311055286421508c4d1b@changeid
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: iwlwifi: mld: fix scan request validation [+ + +]

Author: Avraham Stern <avraham.stern@intel.com>
Date:   Thu Jul 10 21:28:18 2025 +0300

    wifi: iwlwifi: mld: fix scan request validation
    
    [ Upstream commit d1f5f881ac2c5dc185a88c7bfe47d2b3ecbbc501 ]
    
    The scan request validation function uses bitwise and instead
    of logical and. Fix it.
    
    Signed-off-by: Avraham Stern <avraham.stern@intel.com>
    Reviewed-by: Daniel Gabay <daniel.gabay@intel.com>
    Reviewed-by: Ilan Peer <ilan.peer@intel.com>
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Link: https://patch.msgid.link/20250710212632.ec7d665f56a4.I416816b491fafa5d3efdf0a4be78356eedf2bd95@changeid
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: iwlwifi: mld: use spec link id and not FW link id [+ + +]

Author: Miri Korenblit <miriam.rachel.korenblit@intel.com>
Date:   Wed Jul 23 09:45:12 2025 +0300

    wifi: iwlwifi: mld: use spec link id and not FW link id
    
    [ Upstream commit 170db5f873850a04a8eafd3401b2ea36adb20cea ]
    
    In missed beacon handling, we compare the FW link id to the
    bss_param_ch_cnt_link_id, which is a spec link id. Fix it.
    
    Reviewed-by: Somashekhar Puttagangaiah <somashekhar.puttagangaiah@intel.com>
    Reviewed-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Link: https://patch.msgid.link/20250723094230.2104f8cac836.I25ed77c2b87bde82a9153e2aa26e09b8a42f6ee3@changeid
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: iwlwifi: mld: use the correct struct size for tracing [+ + +]

Author: Benjamin Berg <benjamin.berg@intel.com>
Date:   Thu Jun 12 14:48:50 2025 +0300

    wifi: iwlwifi: mld: use the correct struct size for tracing
    
    [ Upstream commit b2c1f9b6e3aac9567c84208b73d716cc5d456404 ]
    
    For the iwlmld driver the RX command is using struct iwl_rx_mpdu_desc
    and not the much older struct iwl_rx_mpdu_res_start. Adjust the value of
    rx_mpdu_cmd_hdr_size accordingly so that the trace data is correct.
    
    Signed-off-by: Benjamin Berg <benjamin.berg@intel.com>
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Link: https://patch.msgid.link/20250612144708.688d95d99ff3.Id3055ca6c19cf8c821cbbd80c09ca2a21d9acec7@changeid
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: iwlwifi: mvm: avoid outdated reorder buffer head_sn [+ + +]

Author: Avraham Stern <avraham.stern@intel.com>
Date:   Wed Jul 23 09:45:05 2025 +0300

    wifi: iwlwifi: mvm: avoid outdated reorder buffer head_sn
    
    [ Upstream commit 422850b29e05e67c9145895bfe559940caa0caa8 ]
    
    If no frames are received on a queue for a while, the reorder buffer
    head_sn may be an old one. When the next frame that is received on
    that queue and buffered is a subframe of an AMSDU but not the last
    subframe, it will not update the buffer's head_sn. When the frame
    release notification arrives, it will not release the buffered frame
    because it will look like the notification's NSSN is lower than the
    buffer's head_sn (because of a wraparound).
    Fix it by updating the head_sn when the first frame is buffered.
    
    Signed-off-by: Avraham Stern <avraham.stern@intel.com>
    Reviewed-by: Daniel Gabay <daniel.gabay@intel.com>
    Reviewed-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Link: https://patch.msgid.link/20250723094230.795ec0cb8817.I9ec9a3508e7935e8d1833ea3e086066fdefee644@changeid
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: iwlwifi: mvm: fix scan request validation [+ + +]

Author: Avraham Stern <avraham.stern@intel.com>
Date:   Wed Jul 9 23:05:43 2025 +0300

    wifi: iwlwifi: mvm: fix scan request validation
    
    [ Upstream commit 7c2f3ec7707188d8d5269ae2dce97d7be3e9f261 ]
    
    The scan request validation function uses bitwise and instead
    of logical and. Fix it.
    
    Signed-off-by: Avraham Stern <avraham.stern@intel.com>
    Reviewed-by: Ilan Peer <ilan.peer@intel.com>
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Link: https://patch.msgid.link/20250709230308.3fbc1f27871b.I7a8ee91f463c1a2d9d8561c8232e196885d02c43@changeid
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: iwlwifi: mvm: set gtk id also in older FWs [+ + +]

Author: Miri Korenblit <miriam.rachel.korenblit@intel.com>
Date:   Thu Jul 10 21:28:27 2025 +0300

    wifi: iwlwifi: mvm: set gtk id also in older FWs
    
    [ Upstream commit 61be9803f322ab46f31ba944c6ef7de195891f64 ]
    
    We use gtk[i].id, but it is not even set in older FW APIs
    (iwl_wowlan_status_v6 and iwl_wowlan_status_v7).
    Set it also in older FWs.
    
    Reviewed-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Link: https://patch.msgid.link/20250710212632.e91e49590414.I27d2fdbed1c54aee59929fa11ec169f07e159406@changeid
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: iwlwifi: pcie: reinit device properly during TOP reset [+ + +]

Author: Johannes Berg <johannes.berg@intel.com>
Date:   Mon Jun 9 21:21:21 2025 +0300

    wifi: iwlwifi: pcie: reinit device properly during TOP reset
    
    [ Upstream commit eda36f5195d6c078e20331f1dfcf688bd6567c2b ]
    
    During TOP reset a full _iwl_trans_pcie_start_hw() is needed so
    the device is properly initialized for operation. Fix that.
    
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Link: https://patch.msgid.link/20250609211928.903444f8e8e8.I7f70600339abb9d658f97924aef22faf1af00a3c@changeid
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: mac80211: avoid weird state in error path [+ + +]

Author: Miri Korenblit <miriam.rachel.korenblit@intel.com>
Date:   Mon Jun 9 21:35:13 2025 +0300

    wifi: mac80211: avoid weird state in error path
    
    [ Upstream commit be1ba9ed221ffb95a8bb15f4c83d0694225ba808 ]
    
    If we get to the error path of ieee80211_prep_connection, for example
    because of a FW issue, then ieee80211_vif_set_links is called
    with 0.
    But the call to drv_change_vif_links from ieee80211_vif_update_links
    will probably fail as well, for the same reason.
    In this case, the valid_links and active_links bitmaps will be reverted
    to the value of the failing connection.
    Then, in the next connection, due to the logic of
    ieee80211_set_vif_links_bitmaps, valid_links will be set to the ID of
    the new connection assoc link, but the active_links will remain with the
    ID of the old connection's assoc link.
    If those IDs are different, we get into a weird state of valid_links and
    active_links being different. One of the consequences of this state is
    to call drv_change_vif_links with new_links as 0, since the & operation
    between the bitmaps will be 0.
    
    Since a removal of a link should always succeed, ignore the return value
    of drv_change_vif_links if it was called to only remove links, which is
    the case for the ieee80211_prep_connection's error path.
    That way, the bitmaps will not be reverted to have the value from the
    failing connection and will have 0, so the next connection will have a
    good state.
    
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Reviewed-by: Johannes Berg <johannes.berg@intel.com>
    Link: https://patch.msgid.link/20250609213231.ba2011fb435f.Id87ff6dab5e1cf757b54094ac2d714c656165059@changeid
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: mac80211: don't complete management TX on SAE commit [+ + +]

Author: Johannes Berg <johannes.berg@intel.com>
Date:   Mon Jun 9 21:35:27 2025 +0300

    wifi: mac80211: don't complete management TX on SAE commit
    
    [ Upstream commit 6b04716cdcac37bdbacde34def08bc6fdb5fc4e2 ]
    
    When SAE commit is sent and received in response, there's no
    ordering for the SAE confirm messages. As such, don't call
    drivers to stop listening on the channel when the confirm
    message is still expected.
    
    This fixes an issue if the local confirm is transmitted later
    than the AP's confirm, for iwlwifi (and possibly mt76) the
    AP's confirm would then get lost since the device isn't on
    the channel at the time the AP transmit the confirm.
    
    For iwlwifi at least, this also improves the overall timing
    of the authentication handshake (by about 15ms according to
    the report), likely since the session protection won't be
    aborted and rescheduled.
    
    Note that even before this, mgd_complete_tx() wasn't always
    called for each call to mgd_prepare_tx() (e.g. in the case
    of WEP key shared authentication), and the current drivers
    that have the complete callback don't seem to mind. Document
    this as well though.
    
    Reported-by: Jan Hendrik Farr <kernel@jfarr.cc>
    Closes: https://lore.kernel.org/all/aB30Ea2kRG24LINR@archlinux/
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Link: https://patch.msgid.link/20250609213232.12691580e140.I3f1d3127acabcd58348a110ab11044213cf147d3@changeid
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: mac80211: don't unreserve never reserved chanctx [+ + +]

Author: Johannes Berg <johannes.berg@intel.com>
Date:   Wed Jul 9 23:37:53 2025 +0300

    wifi: mac80211: don't unreserve never reserved chanctx
    
    [ Upstream commit a6d521bafcb290294128a51b13dbf4baae5748fc ]
    
    If a link has no chanctx, indicating it is an inactive link
    that we tracked CSA for, then attempting to unreserve the
    reserved chanctx will throw a warning and fail, since there
    never was a reserved chanctx. Skip the unreserve.
    
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Link: https://patch.msgid.link/20250709233537.022192f4b1ae.Ib58156ac13e674a9f4d714735be0764a244c0aae@changeid
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: mac80211: don't use TPE data from assoc response [+ + +]

Author: Johannes Berg <johannes.berg@intel.com>
Date:   Wed Jul 9 23:38:03 2025 +0300

    wifi: mac80211: don't use TPE data from assoc response
    
    [ Upstream commit a597432cc9e640439370d9dc95952220cc13fc2b ]
    
    Since there's no TPE element in the (re)assoc response, trying
    to use the data from it just leads to using the defaults, even
    though the real values had been set during authentication from
    the discovered BSS information.
    
    Fix this by simply not handling the TPE data in assoc response
    since it's not intended to be present, if it changes later the
    necessary changes will be made by tracking beacons later.
    
    As a side effect, by passing the real frame subtype, now print
    a correct value for ML reconfiguration responses.
    
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Link: https://patch.msgid.link/20250709233537.caa1ca853f5a.I588271f386731978163aa9d84ae75d6f79633e16@changeid
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: mac80211: fix rx link assignment for non-MLO stations [+ + +]

Author: Hari Chandrakanthan <quic_haric@quicinc.com>
Date:   Mon Jun 30 14:11:19 2025 +0530

    wifi: mac80211: fix rx link assignment for non-MLO stations
    
    [ Upstream commit cc2b722132893164bcb3cee4f08ed056e126eb6c ]
    
    Currently, ieee80211_rx_data_set_sta() does not correctly handle the
    case where the interface supports multiple links (MLO), but the station
    does not (non-MLO). This can lead to incorrect link assignment or
    unexpected warnings when accessing link information.
    
    Hence, add a fix to check if the station lacks valid link support and
    use its default link ID for rx->link assignment. If the station
    unexpectedly has valid links, fall back to the default link.
    
    This ensures correct link association and prevents potential issues
    in mixed MLO/non-MLO environments.
    
    Signed-off-by: Hari Chandrakanthan <quic_haric@quicinc.com>
    Signed-off-by: Sarika Sharma <quic_sarishar@quicinc.com>
    Link: https://patch.msgid.link/20250630084119.3583593-1-quic_sarishar@quicinc.com
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: mac80211: handle WLAN_HT_ACTION_NOTIFY_CHANWIDTH async [+ + +]

Author: Miri Korenblit <miriam.rachel.korenblit@intel.com>
Date:   Wed Jul 9 23:38:02 2025 +0300

    wifi: mac80211: handle WLAN_HT_ACTION_NOTIFY_CHANWIDTH async
    
    [ Upstream commit 93370f2d37f50757a810da409efc0223c342527e ]
    
    If this action frame, with the value of IEEE80211_HT_CHANWIDTH_ANY,
    arrives right after a beacon that changed the operational bandwidth from
    20 MHz to 40 MHz, then updating the rate control bandwidth to 40 can
    race with updating the chanctx width (that happens in the beacon
    proccesing) back to 40 MHz:
    
    cpu0                                    cpu1
    
    ieee80211_rx_mgmt_beacon
    ieee80211_config_bw
    ieee80211_link_change_chanreq
    (*)ieee80211_link_update_chanreq
                                            ieee80211_rx_h_action
                                            (**)ieee80211_sta_cur_vht_bw
    (***) ieee80211_recalc_chanctx_chantype
    
    in (**), the maximum between the capability width and the bss width is
    returned. But the bss width was just updated to 40 in (*),
    so the action frame handling code will increase the width of the rate
    control before the chanctx was increased (in ***), leading to a FW error
    (at least in iwlwifi driver. But this is wrong regardless).
    
    Fix this by simply handling the action frame async, so it won't race
    with the beacon proccessing.
    
    Closes: https://bugzilla.kernel.org/show_bug.cgi?id=218632
    Reviewed-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Link: https://patch.msgid.link/20250709233537.bb9dc6f36c35.I39782d6077424e075974c3bee4277761494a1527@changeid
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: mac80211: update radar_required in channel context after channel switch [+ + +]

Author: Ramya Gnanasekar <ramya.gnanasekar@oss.qualcomm.com>
Date:   Sun Jun 8 19:33:24 2025 +0530

    wifi: mac80211: update radar_required in channel context after channel switch
    
    [ Upstream commit 140c6a61d83cbd85adba769b5ef8d61acfa5b392 ]
    
    Currently, when a non-DFS channel is brought up and the bandwidth is
    expanded from 80 MHz to 160 MHz, where the primary 80 MHz is non-DFS
    and the secondary 80 MHz consists of DFS channels, radar detection
    fails if radar occurs in the secondary 80 MHz.
    
    When the channel is switched from 80 MHz to 160 MHz, with the primary
    80 MHz being non-DFS and the secondary 80 MHz consisting of DFS
    channels, the radar required flag in the channel switch parameters
    is set to true. However, when using a reserved channel context,
    it is not updated in sdata, which disables radar detection in the
    secondary 80 MHz DFS channels.
    
    Update the radar required flag in sdata to fix this issue when using
    a reserved channel context.
    
    Signed-off-by: Ramya Gnanasekar <ramya.gnanasekar@oss.qualcomm.com>
    Signed-off-by: Ramasamy Kaliappan <ramasamy.kaliappan@oss.qualcomm.com>
    Link: https://patch.msgid.link/20250608140324.1687117-1-ramasamy.kaliappan@oss.qualcomm.com
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: mt76: fix vif link allocation [+ + +]

Author: Felix Fietkau <nbd@nbd.name>
Date:   Fri Jul 4 15:08:05 2025 +0200

    wifi: mt76: fix vif link allocation
    
    [ Upstream commit 53a5d72bdd70e262623b6009cc4754927b428bad ]
    
    Reuse the vif deflink for link_id = 0 in order to avoid confusion with
    vif->bss_conf, which also gets a link id of 0.
    
    Link: https://patch.msgid.link/20250704-mt7996-mlo-fixes-v1-1-356456c73f43@kernel.org
    Signed-off-by: Felix Fietkau <nbd@nbd.name>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: mt76: mt7915: mcu: increase eeprom command timeout [+ + +]

Author: David Bauer <mail@david-bauer.net>
Date:   Wed Apr 2 02:45:26 2025 +0200

    wifi: mt76: mt7915: mcu: increase eeprom command timeout
    
    [ Upstream commit efd31873cdb3e5580fb76eeded6314856f52b06e ]
    
    Increase the timeout for MCU_EXT_CMD_EFUSE_BUFFER_MODE command.
    
    Regular retries upon hardware-recovery have been observed. Increasing
    the timeout slightly remedies this problem.
    
    Signed-off-by: David Bauer <mail@david-bauer.net>
    Link: https://patch.msgid.link/20250402004528.1036715-2-mail@david-bauer.net
    Signed-off-by: Felix Fietkau <nbd@nbd.name>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: mt76: mt7915: mcu: re-init MCU before loading FW patch [+ + +]

Author: David Bauer <mail@david-bauer.net>
Date:   Wed Apr 2 02:45:27 2025 +0200

    wifi: mt76: mt7915: mcu: re-init MCU before loading FW patch
    
    [ Upstream commit ac9c50c79eaef5fca0f165e45d0c5880606db53e ]
    
    Restart the MCU and release the patch semaphore before loading the MCU
    patch firmware from the host.
    
    This fixes failures upon error recovery in case the semaphore was
    previously taken and never released by the host.
    
    This happens from time to time upon triggering a full-chip error
    recovery. Under this circumstance, the hardware restart fails and the
    radio is rendered inoperational.
    
    Signed-off-by: David Bauer <mail@david-bauer.net>
    Link: https://patch.msgid.link/20250402004528.1036715-3-mail@david-bauer.net
    Signed-off-by: Felix Fietkau <nbd@nbd.name>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: mt76: mt7996: Fix mlink lookup in mt7996_tx_prepare_skb [+ + +]

Author: Lorenzo Bianconi <lorenzo@kernel.org>
Date:   Fri Jul 4 15:08:09 2025 +0200

    wifi: mt76: mt7996: Fix mlink lookup in mt7996_tx_prepare_skb
    
    [ Upstream commit 59ea7af6f9ce218b86f9f46520819247c3a5f97b ]
    
    Use proper link_id in mt7996_tx_prepare_skb routine for mlink lookup.
    
    Signed-off-by: Lorenzo Bianconi <lorenzo@kernel.org>
    Link: https://patch.msgid.link/20250704-mt7996-mlo-fixes-v1-5-356456c73f43@kernel.org
    Signed-off-by: Felix Fietkau <nbd@nbd.name>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: rtlwifi: fix possible skb memory leak in _rtl_pci_init_one_rxdesc() [+ + +]

Author: Thomas Fourier <fourier.thomas@gmail.com>
Date:   Fri Jun 13 09:38:36 2025 +0200

    wifi: rtlwifi: fix possible skb memory leak in _rtl_pci_init_one_rxdesc()
    
    [ Upstream commit 76b3e5078d76f0eeadb7aacf9845399f8473da0d ]
    
    When `dma_mapping_error()` is true, if a new `skb` has been allocated,
    then it must be de-allocated.
    
    Compile tested only
    
    Signed-off-by: Thomas Fourier <fourier.thomas@gmail.com>
    Signed-off-by: Ping-Ke Shih <pkshih@realtek.com>
    Link: https://patch.msgid.link/20250613074014.69856-2-fourier.thomas@gmail.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: rtlwifi: fix possible skb memory leak in `_rtl_pci_rx_interrupt()`. [+ + +]

Author: Thomas Fourier <fourier.thomas@gmail.com>
Date:   Mon Jun 16 12:56:30 2025 +0200

    wifi: rtlwifi: fix possible skb memory leak in `_rtl_pci_rx_interrupt()`.
    
    [ Upstream commit 44c0e191004f0e3aa1bdee3be248be14dbe5b020 ]
    
    The function `_rtl_pci_init_one_rxdesc()` can fail even when the new
    `skb` is passed because of a DMA mapping error.  If it fails, the `skb`
    is not saved in the rx ringbuffer and thus lost.
    
    Compile tested only
    
    Signed-off-by: Thomas Fourier <fourier.thomas@gmail.com>
    Acked-by: Ping-Ke Shih <pkshih@realtek.com>
    Signed-off-by: Ping-Ke Shih <pkshih@realtek.com>
    Link: https://patch.msgid.link/20250616105631.444309-4-fourier.thomas@gmail.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: rtw89: 8852c: increase beacon loss to 6 seconds [+ + +]

Author: Kuan-Chung Chen <damon.chen@realtek.com>
Date:   Fri Jun 6 10:02:57 2025 +0800

    wifi: rtw89: 8852c: increase beacon loss to 6 seconds
    
    [ Upstream commit 4bcef86b13316511bb336a26140fc4130c3a65a2 ]
    
    Intermittent beacon loss from a specific AP causes the connection
    to be lost. Increasing the beacon loss count can make the
    connection more stable.
    
    Signed-off-by: Kuan-Chung Chen <damon.chen@realtek.com>
    Signed-off-by: Ping-Ke Shih <pkshih@realtek.com>
    Link: https://patch.msgid.link/20250606020302.16873-2-pkshih@realtek.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: rtw89: coex: Not to set slot duration to zero to avoid firmware issue [+ + +]

Author: Ching-Te Ku <ku920601@realtek.com>
Date:   Mon Jun 16 17:02:49 2025 +0800

    wifi: rtw89: coex: Not to set slot duration to zero to avoid firmware issue
    
    [ Upstream commit a7feafea4ce80d5fa5284d05d54b4f108d2ab575 ]
    
    If the duration set to zero, Wi-Fi firmware will trigger some unexpected
    issue when firmware try to enable timer.
    
    Signed-off-by: Ching-Te Ku <ku920601@realtek.com>
    Signed-off-by: Ping-Ke Shih <pkshih@realtek.com>
    Link: https://patch.msgid.link/20250616090252.51098-9-pkshih@realtek.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: rtw89: Disable deep power saving for USB/SDIO [+ + +]

Author: Bitterblue Smith <rtl8821cerfe2@gmail.com>
Date:   Mon Jun 30 23:43:25 2025 +0300

    wifi: rtw89: Disable deep power saving for USB/SDIO
    
    [ Upstream commit a3b871a0f7c083c2a632a31da8bc3de554ae8550 ]
    
    Disable deep power saving for USB and SDIO because rtw89_mac_send_rpwm()
    is called in atomic context and accessing hardware registers results in
    "scheduling while atomic" errors.
    
    Signed-off-by: Bitterblue Smith <rtl8821cerfe2@gmail.com>
    Acked-by: Ping-Ke Shih <pkshih@realtek.com>
    Signed-off-by: Ping-Ke Shih <pkshih@realtek.com>
    Link: https://patch.msgid.link/0f49eceb-0de0-47e2-ba36-3c6a0dddd17d@gmail.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: rtw89: Fix rtw89_mac_power_switch() for USB [+ + +]

Author: Bitterblue Smith <rtl8821cerfe2@gmail.com>
Date:   Mon Jun 30 23:45:55 2025 +0300

    wifi: rtw89: Fix rtw89_mac_power_switch() for USB
    
    [ Upstream commit e2b71603333a9dd73ee88347d8894fffc3456ac1 ]
    
    Clear some bits in some registers in order to allow RTL8851BU to power
    on. This is done both when powering on and when powering off because
    that's what the vendor driver does.
    
    Also tested with RTL8832BU and RTL8832CU.
    
    Signed-off-by: Bitterblue Smith <rtl8821cerfe2@gmail.com>
    Acked-by: Ping-Ke Shih <pkshih@realtek.com>
    Signed-off-by: Ping-Ke Shih <pkshih@realtek.com>
    Link: https://patch.msgid.link/a39da939-d640-4486-ad38-f658f220afc8@gmail.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: rtw89: Lower the timeout in rtw89_fw_read_c2h_reg() for USB [+ + +]

Author: Bitterblue Smith <rtl8821cerfe2@gmail.com>
Date:   Tue Jul 15 22:44:47 2025 +0300

    wifi: rtw89: Lower the timeout in rtw89_fw_read_c2h_reg() for USB
    
    [ Upstream commit 671be46afd1f03de9dc6e4679c88e1a7a81cdff6 ]
    
    This read_poll_timeout_atomic() with a delay of 1 µs and a timeout of
    1000000 µs can take ~250 seconds in the worst case because sending a
    USB control message takes ~250 µs.
    
    Lower the timeout to 4000 for USB in order to reduce the maximum polling
    time to ~1 second.
    
    This problem was observed with RTL8851BU while suspending to RAM with
    WOWLAN enabled. The computer sat for 4 minutes with a black screen
    before suspending.
    
    Signed-off-by: Bitterblue Smith <rtl8821cerfe2@gmail.com>
    Signed-off-by: Ping-Ke Shih <pkshih@realtek.com>
    Link: https://patch.msgid.link/09313da6-c865-4e91-b758-4cb38a878796@gmail.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: rtw89: scan abort when assign/unassign_vif [+ + +]

Author: Chih-Kang Chang <gary.chang@realtek.com>
Date:   Tue Jun 10 21:00:34 2025 +0800

    wifi: rtw89: scan abort when assign/unassign_vif
    
    [ Upstream commit 3db8563bac6c34018cbb96b14549a95c368b0304 ]
    
    If scan happen during start_ap, the register which control TX might be
    turned off during scan. Additionally, if set_channel occurs during scan
    will backup this register and set to firmware after set_channel done.
    When scan complete, firmware will also set TX by this register, causing
    TX to be disabled and beacon can't be TX. Therefore, in assign/unassign_vif
    call scan abort before set_channel to avoid scan racing with set_channel.
    
    Signed-off-by: Chih-Kang Chang <gary.chang@realtek.com>
    Signed-off-by: Ping-Ke Shih <pkshih@realtek.com>
    Link: https://patch.msgid.link/20250610130034.14692-13-pkshih@realtek.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: rtw89: wow: Add Basic Rate IE to probe request in scheduled scan mode [+ + +]

Author: Chin-Yen Lee <timlee@realtek.com>
Date:   Wed Jul 16 20:29:26 2025 +0800

    wifi: rtw89: wow: Add Basic Rate IE to probe request in scheduled scan mode
    
    [ Upstream commit 37c23874d13eb369d8b384a1ce5992ff6c23d56f ]
    
    In scheduled scan mode, the current probe request only includes the SSID
    IE, but omits the Basic Rate IE. Some APs do not respond to such
    incomplete probe requests, causing net-detect failures. To improve
    interoperability and ensure APs respond correctly, add the Basic Rate IE
    to the probe request in driver.
    
    Signed-off-by: Chin-Yen Lee <timlee@realtek.com>
    Signed-off-by: Ping-Ke Shih <pkshih@realtek.com>
    Link: https://patch.msgid.link/20250716122926.6709-1-pkshih@realtek.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

x86/bugs: Avoid warning when overriding return thunk [+ + +]

Author: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
Date:   Wed Jun 11 10:29:31 2025 -0700

    x86/bugs: Avoid warning when overriding return thunk
    
    [ Upstream commit 9f85fdb9fc5a1bd308a10a0a7d7e34f2712ba58b ]
    
    The purpose of the warning is to prevent an unexpected change to the return
    thunk mitigation. However, there are legitimate cases where the return
    thunk is intentionally set more than once. For example, ITS and SRSO both
    can set the return thunk after retbleed has set it. In both the cases
    retbleed is still mitigated.
    
    Replace the warning with an info about the active return thunk.
    
    Suggested-by: Borislav Petkov <bp@alien8.de>
    Signed-off-by: Pawan Gupta <pawan.kumar.gupta@linux.intel.com>
    Signed-off-by: Borislav Petkov (AMD) <bp@alien8.de>
    Link: https://lore.kernel.org/20250611-eibrs-fix-v4-3-5ff86cac6c61@linux.intel.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

x86/fpu: Fix NULL dereference in avx512_status() [+ + +]

Author: Fushuai Wang <wangfushuai@baidu.com>
Date:   Mon Aug 11 11:50:44 2025 -0700

    x86/fpu: Fix NULL dereference in avx512_status()
    
    commit 31cd31c9e17ece125aad27259501a2af69ccb020 upstream.
    
    Problem
    -------
    With CONFIG_X86_DEBUG_FPU enabled, reading /proc/[kthread]/arch_status
    causes a warning and a NULL pointer dereference.
    
    This is because the AVX-512 timestamp code uses x86_task_fpu() but
    doesn't check it for NULL. CONFIG_X86_DEBUG_FPU addles that function
    for kernel threads (PF_KTHREAD specifically), making it return NULL.
    
    The point of the warning was to ensure that kernel threads only access
    task->fpu after going through kernel_fpu_begin()/_end(). Note: all
    kernel tasks exposed in /proc have a valid task->fpu.
    
    Solution
    --------
    One option is to silence the warning and check for NULL from
    x86_task_fpu(). However, that warning is fairly fresh and seems like a
    defense against misuse of the FPU state in kernel threads.
    
    Instead, stop outputting AVX-512_elapsed_ms for kernel threads
    altogether. The data was garbage anyway because avx512_timestamp is
    only updated for user threads, not kernel threads.
    
    If anyone ever wants to track kernel thread AVX-512 use, they can come
    back later and do it properly, separate from this bug fix.
    
    [ dhansen: mostly rewrite changelog ]
    
    Fixes: 22aafe3bcb67 ("x86/fpu: Remove init_task FPU state dependencies, add debugging warning for PF_KTHREAD tasks")
    Co-developed-by: Sohil Mehta <sohil.mehta@intel.com>
    Signed-off-by: Sohil Mehta <sohil.mehta@intel.com>
    Signed-off-by: Fushuai Wang <wangfushuai@baidu.com>
    Signed-off-by: Dave Hansen <dave.hansen@linux.intel.com>
    Cc: stable@vger.kernel.org
    Link: https://lore.kernel.org/all/20250811185044.2227268-1-sohil.mehta%40intel.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/sev/vc: Fix EFI runtime instruction emulation [+ + +]

Author: Gerd Hoffmann <kraxel@redhat.com>
Date:   Thu Jun 26 13:40:11 2025 +0200

    x86/sev/vc: Fix EFI runtime instruction emulation
    
    [ Upstream commit 7b22e0432981c2fa230f1b493082b7e67112c4aa ]
    
    In case efi_mm is active go use the userspace instruction decoder which
    supports fetching instructions from active_mm.  This is needed to make
    instruction emulation work for EFI runtime code, so it can use CPUID and
    RDMSR.
    
    EFI runtime code uses the CPUID instruction to gather information about
    the environment it is running in, such as SEV being enabled or not, and
    choose (if needed) the SEV code path for ioport access.
    
    EFI runtime code uses the RDMSR instruction to get the location of the
    CAA page (see SVSM spec, section 4.2 - "Post Boot").
    
    The big picture behind this is that the kernel needs to be able to
    properly handle #VC exceptions that come from EFI runtime services.
    Since EFI runtime services have a special page table mapping for the EFI
    virtual address space, the efi_mm context must be used when decoding
    instructions during #VC handling.
    
      [ bp: Massage. ]
    
    Signed-off-by: Gerd Hoffmann <kraxel@redhat.com>
    Signed-off-by: Borislav Petkov (AMD) <bp@alien8.de>
    Reviewed-by: Pankaj Gupta <pankaj.gupta@amd.com>
    Link: https://lore.kernel.org/20250626114014.373748-2-kraxel@redhat.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

x86/sev: Ensure SVSM reserved fields in a page validation entry are initialized to zero [+ + +]

Author: Tom Lendacky <thomas.lendacky@amd.com>
Date:   Wed Aug 13 10:26:59 2025 -0500

    x86/sev: Ensure SVSM reserved fields in a page validation entry are initialized to zero
    
    commit 3ee9cebd0a5e7ea47eb35cec95eaa1a866af982d upstream.
    
    In order to support future versions of the SVSM_CORE_PVALIDATE call, all
    reserved fields within a PVALIDATE entry must be set to zero as an SVSM should
    be ensuring all reserved fields are zero in order to support future usage of
    reserved areas based on the protocol version.
    
    Fixes: fcd042e86422 ("x86/sev: Perform PVALIDATE using the SVSM when not at VMPL0")
    Signed-off-by: Tom Lendacky <thomas.lendacky@amd.com>
    Signed-off-by: Borislav Petkov (AMD) <bp@alien8.de>
    Reviewed-by: Joerg Roedel <joerg.roedel@amd.com>
    Cc: <stable@kernel.org>
    Link: https://lore.kernel.org/7cde412f8b057ea13a646fb166b1ca023f6a5031.1755098819.git.thomas.lendacky@amd.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/sev: Improve handling of writes to intercepted TSC MSRs [+ + +]

Author: Nikunj A Dadhania <nikunj@amd.com>
Date:   Tue Jul 22 13:18:53 2025 +0530

    x86/sev: Improve handling of writes to intercepted TSC MSRs
    
    [ Upstream commit 5eb1bcdb6a8c088514019c3a9bda5d565beed1af ]
    
    Currently, when a Secure TSC enabled SNP guest attempts to write to the
    intercepted GUEST_TSC_FREQ MSR (a read-only MSR), the guest kernel response
    incorrectly implies a VMM configuration error, when in fact it is the usual
    VMM configuration to intercept writes to read-only MSRs, unless explicitly
    documented.
    
    Modify the intercepted TSC MSR #VC handling:
    * Write to GUEST_TSC_FREQ will generate a #GP instead of terminating the
      guest
    * Write to MSR_IA32_TSC will generate a #GP instead of silently ignoring it
    
    However, continue to terminate the guest when reading from intercepted
    GUEST_TSC_FREQ MSR with Secure TSC enabled, as intercepted reads indicate an
    improper VMM configuration for Secure TSC enabled SNP guests.
    
      [ bp: simplify comment. ]
    
    Fixes: 38cc6495cdec ("x86/sev: Prevent GUEST_TSC_FREQ MSR interception for Secure TSC enabled guests")
    Suggested-by: Sean Christopherson <seanjc@google.com>
    Signed-off-by: Nikunj A Dadhania <nikunj@amd.com>
    Signed-off-by: Borislav Petkov (AMD) <bp@alien8.de>
    Reviewed-by: Tom Lendacky <thomas.lendacky@amd.com>
    Link: https://lore.kernel.org/20250722074853.22253-1-nikunj@amd.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

xen/netfront: Fix TX response spurious interrupts [+ + +]

Author: Anthoine Bourgeois <anthoine.bourgeois@vates.tech>
Date:   Mon Jul 21 09:34:54 2025 +0000

    xen/netfront: Fix TX response spurious interrupts
    
    [ Upstream commit 114a2de6fa86d99ed9546cc9113a3cad58beef79 ]
    
    We found at Vates that there are lot of spurious interrupts when
    benchmarking the xen-net PV driver frontend. This issue appeared with a
    patch that addresses security issue XSA-391 (b27d47950e48 "xen/netfront:
    harden netfront against event channel storms"). On an iperf benchmark,
    spurious interrupts can represent up to 50% of the interrupts.
    
    Spurious interrupts are interrupts that are rised for nothing, there is
    no work to do. This appends because the function that handles the
    interrupts ("xennet_tx_buf_gc") is also called at the end of the request
    path to garbage collect the responses received during the transmission
    load.
    
    The request path is doing the work that the interrupt handler should
    have done otherwise. This is particurary true when there is more than
    one vcpu and get worse linearly with the number of vcpu/queue.
    
    Moreover, this problem is amplifyed by the penalty imposed by a spurious
    interrupt. When an interrupt is found spurious the interrupt chip will
    delay the EOI to slowdown the backend. This delay will allow more
    responses to be handled by the request path and then there will be more
    chance the next interrupt will not find any work to do, creating a new
    spurious interrupt.
    
    This causes performance issue. The solution here is to remove the calls
    from the request path and let the interrupt handler do the processing of
    the responses. This approch removes most of the spurious interrupts
    (<0.05%) and also has the benefit of freeing up cycles in the request
    path, allowing it to process more work, which improves performance
    compared to masking the spurious interrupt one way or another.
    
    This optimization changes a part of the code that is present since the
    net frontend driver was upstreamed. There is no similar pattern in the
    other xen PV drivers. Since the first commit of xen-netfront is a blob
    that doesn't explain all the design choices I can only guess why this
    specific mecanism was here. This could have been introduce to compensate
    a slow backend at the time (maybe the backend was fixed or optimize
    later) or a small queue. In 18 years, both frontend and backend gain lot
    of features and optimizations that could have obsolete the feature of
    reaping completions from the TX path.
    
    Some vif throughput performance figures from a 8 vCPUs, 4GB of RAM HVM
    guest(s):
    
    Without this patch on the :
    vm -> dom0: 4.5Gb/s
    vm -> vm:   7.0Gb/s
    
    Without XSA-391 patch (revert of b27d47950e48):
    vm -> dom0: 8.3Gb/s
    vm -> vm:   8.7Gb/s
    
    With XSA-391 and this patch:
    vm -> dom0: 11.5Gb/s
    vm -> vm:   12.6Gb/s
    
    v2:
    - add revewed and tested by tags
    - resend with the maintainers in the recipients list
    
    v3:
    - remove Fixes tag but keep the commit ref in the explanation
    - add a paragraph on why this code was here
    
    Signed-off-by: Anthoine Bourgeois <anthoine.bourgeois@vates.tech>
    Reviewed-by: Juergen Gross <jgross@suse.com>
    Tested-by: Elliott Mitchell <ehem+xen@m5p.com>
    Signed-off-by: Juergen Gross <jgross@suse.com>
    Message-ID: <20250721093316.23560-1-anthoine.bourgeois@vates.tech>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

xfrm: bring back device check in validate_xmit_xfrm [+ + +]

Author: Sabrina Dubroca <sd@queasysnail.net>
Date:   Mon Aug 4 11:26:26 2025 +0200

    xfrm: bring back device check in validate_xmit_xfrm
    
    [ Upstream commit 65f079a6c446a939eefe71e6d5957d5d6365fcf9 ]
    
    This is partial revert of commit d53dda291bbd993a29b84d358d282076e3d01506.
    
    This change causes traffic using GSO with SW crypto running through a
    NIC capable of HW offload to no longer get segmented during
    validate_xmit_xfrm, and is unrelated to the bonding use case mentioned
    in the commit.
    
    Fixes: d53dda291bbd ("xfrm: Remove unneeded device check from validate_xmit_xfrm")
    Signed-off-by: Sabrina Dubroca <sd@queasysnail.net>
    Reviewed-by: Cosmin Ratiu <cratiu@nvidia.com>
    Signed-off-by: Steffen Klassert <steffen.klassert@secunet.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

xfrm: Duplicate SPI Handling [+ + +]

Author: Aakash Kumar S <saakashkumar@marvell.com>
Date:   Mon Jun 30 18:08:56 2025 +0530

    xfrm: Duplicate SPI Handling
    
    [ Upstream commit 94f39804d891cffe4ce17737d295f3b195bc7299 ]
    
    The issue originates when Strongswan initiates an XFRM_MSG_ALLOCSPI
    Netlink message, which triggers the kernel function xfrm_alloc_spi().
    This function is expected to ensure uniqueness of the Security Parameter
    Index (SPI) for inbound Security Associations (SAs). However, it can
    return success even when the requested SPI is already in use, leading
    to duplicate SPIs assigned to multiple inbound SAs, differentiated
    only by their destination addresses.
    
    This behavior causes inconsistencies during SPI lookups for inbound packets.
    Since the lookup may return an arbitrary SA among those with the same SPI,
    packet processing can fail, resulting in packet drops.
    
    According to RFC 4301 section 4.4.2 , for inbound processing a unicast SA
    is uniquely identified by the SPI and optionally protocol.
    
    Reproducing the Issue Reliably:
    To consistently reproduce the problem, restrict the available SPI range in
    charon.conf : spi_min = 0x10000000 spi_max = 0x10000002
    This limits the system to only 2 usable SPI values.
    Next, create more than 2 Child SA. each using unique pair of src/dst address.
    As soon as the 3rd Child SA is initiated, it will be assigned a duplicate
    SPI, since the SPI pool is already exhausted.
    With a narrow SPI range, the issue is consistently reproducible.
    With a broader/default range, it becomes rare and unpredictable.
    
    Current implementation:
    xfrm_spi_hash() lookup function computes hash using daddr, proto, and family.
    So if two SAs have the same SPI but different destination addresses, then
    they will:
    a. Hash into different buckets
    b. Be stored in different linked lists (byspi + h)
    c. Not be seen in the same hlist_for_each_entry_rcu() iteration.
    As a result, the lookup will result in NULL and kernel allows that Duplicate SPI
    
    Proposed Change:
    xfrm_state_lookup_spi_proto() does a truly global search - across all states,
    regardless of hash bucket and matches SPI and proto.
    
    Signed-off-by: Aakash Kumar S <saakashkumar@marvell.com>
    Acked-by: Herbert Xu <herbert@gondor.apana.org.au>
    Signed-off-by: Steffen Klassert <steffen.klassert@secunet.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

xfrm: flush all states in xfrm_state_fini [+ + +]

Author: Sabrina Dubroca <sd@queasysnail.net>
Date:   Mon Aug 4 11:05:43 2025 +0200

    xfrm: flush all states in xfrm_state_fini
    
    [ Upstream commit 42e42562c9cfcdacf000f1b42284a4fad24f8546 ]
    
    While reverting commit f75a2804da39 ("xfrm: destroy xfrm_state
    synchronously on net exit path"), I incorrectly changed
    xfrm_state_flush's "proto" argument back to IPSEC_PROTO_ANY. This
    reverts some of the changes in commit dbb2483b2a46 ("xfrm: clean up
    xfrm protocol checks"), and leads to some states not being removed
    when we exit the netns.
    
    Pass 0 instead of IPSEC_PROTO_ANY from both xfrm_state_fini
    xfrm6_tunnel_net_exit, so that xfrm_state_flush deletes all states.
    
    Fixes: 2a198bbec691 ("Revert "xfrm: destroy xfrm_state synchronously on net exit path"")
    Reported-by: syzbot+6641a61fe0e2e89ae8c5@syzkaller.appspotmail.com
    Closes: https://syzkaller.appspot.com/bug?extid=6641a61fe0e2e89ae8c5
    Tested-by: syzbot+6641a61fe0e2e89ae8c5@syzkaller.appspotmail.com
    Signed-off-by: Sabrina Dubroca <sd@queasysnail.net>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Signed-off-by: Steffen Klassert <steffen.klassert@secunet.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

xfrm: restore GSO for SW crypto [+ + +]

Author: Sabrina Dubroca <sd@queasysnail.net>
Date:   Mon Aug 4 11:26:25 2025 +0200

    xfrm: restore GSO for SW crypto
    
    [ Upstream commit 234d1eff5d4987024be9d40ac07b918a5ae8db1a ]
    
    Commit 49431af6c4ef incorrectly assumes that the GSO path is only used
    by HW offload, but it's also useful for SW crypto.
    
    This patch re-enables GSO for SW crypto. It's not an exact revert to
    preserve the other changes made to xfrm_dev_offload_ok afterwards, but
    it reverts all of its effects.
    
    Fixes: 49431af6c4ef ("xfrm: rely on XFRM offload")
    Signed-off-by: Sabrina Dubroca <sd@queasysnail.net>
    Reviewed-by: Leon Romanovsky <leonro@nvidia.com>
    Reviewed-by: Zhu Yanjun <yanjun.zhu@linux.dev>
    Signed-off-by: Steffen Klassert <steffen.klassert@secunet.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

xfs: fix scrub trace with null pointer in quotacheck [+ + +]

Author: Andrey Albershteyn <aalbersh@redhat.com>
Date:   Thu Jul 31 19:07:22 2025 +0200

    xfs: fix scrub trace with null pointer in quotacheck
    
    commit 5d94b19f066480addfcdcb5efde66152ad5a7c0e upstream.
    
    The quotacheck doesn't initialize sc->ip.
    
    Cc: stable@vger.kernel.org # v6.8
    Fixes: 21d7500929c8a0 ("xfs: improve dquot iteration for scrub")
    Reviewed-by: Darrick J. Wong <djwong@kernel.org>
    Signed-off-by: Andrey Albershteyn <aalbersh@kernel.org>
    Signed-off-by: Carlos Maiolino <cem@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>