Changelog in Linux kernel 7.0.11

accel/qaic: Add overflow check to remap_pfn_range during mmap [+ + +]

Author: Zack McKevitt <zachary.mckevitt@oss.qualcomm.com>
Date:   Thu Apr 30 12:39:01 2026 -0700

    accel/qaic: Add overflow check to remap_pfn_range during mmap
    
    [ Upstream commit aa16b2bc0f02709919e2435f531406531e5bcc69 ]
    
    The call to remap_pfn_range in qaic_gem_object_mmap is susceptible to
    (re)mapping beyond the VMA if the BO is too large. This can cause use
    after free issues when munmap() unmaps only the VMA region and not the
    additional mappings. To prevent this, check the remaining size of the
    VMA before remapping and truncate the remapped length if sg->length is
    too large.
    
    Reported-by: Lukas Maar <lukas.maar@tugraz.at>
    Fixes: ff13be830333 ("accel/qaic: Add datapath")
    Reviewed-by: Karol Wachowski <karol.wachowski@linux.intel.com>
    Signed-off-by: Zack McKevitt <zachary.mckevitt@oss.qualcomm.com>
    Reviewed-by: Jeff Hugo <jeff.hugo@oss.qualcomm.com>
    [jhugo: fix braces from checkpatch --strict]
    Signed-off-by: Jeff Hugo <jeff.hugo@oss.qualcomm.com>
    Link: https://patch.msgid.link/20260430193858.1178641-1-zachary.mckevitt@oss.qualcomm.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ACPI: battery: Fix system wakeup on critical battery status [+ + +]

Author: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
Date:   Fri May 15 19:03:59 2026 +0200

    ACPI: battery: Fix system wakeup on critical battery status
    
    commit c35cb4fc7231702d1e9952aec1a442f3e27df6f5 upstream.
    
    Commit 0a869409a981 ("ACPI: battery: Convert the driver to a platform
    one") changed the parent of the battery wakeup source to the platform
    device used for driver binding, but it forgot to update the
    acpi_pm_wakeup_event() call in acpi_battery_update() accordingly.
    
    Do it now to unbreak waking up the system on critical battery status
    during suspend-to-idle and during transitions to ACPI S3/S4.
    
    Fixes: 0a869409a981 ("ACPI: battery: Convert the driver to a platform one")
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Cc: 7.0+ <stable@vger.kernel.org> # 7.0+
    Link: https://patch.msgid.link/12898712.O9o76ZdvQC@rafael.j.wysocki
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ACPI: driver: Check ACPI_COMPANION() against NULL during probe [+ + +]

Author: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
Date:   Fri May 22 08:36:07 2026 -0400

    ACPI: driver: Check ACPI_COMPANION() against NULL during probe
    
    [ Upstream commit e4865a56d013e86e46ea6acea15bb6eae01898ff ]
    
    Since every platform driver can be forced to match a device that doesn't
    match its list of device IDs because of device_match_driver_override(),
    platform drivers that rely on the existence of a device's ACPI companion
    object should verify its presence.
    
    Accordingly, add requisite ACPI_COMPANION() or ACPI_HANDLE() checks
    against NULL to 13 platform drivers handling core ACPI devices.
    
    Also change the value returned by the ACPI thermal zone driver when
    the device's ACPI companion is not present to -ENODEV for consistency
    with the other drivers.
    
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Reviewed-by: Hans de Goede <johannes.goede@oss.qualcomm.com>
    Reviewed-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com>
    Link: https://patch.msgid.link/4516068.ejJDZkT8p0@rafael.j.wysocki
    Cc: 7.0+ <stable@vger.kernel.org> # 7.0+
    [ reordered variable declaration to add NULL check before pre-existing stable-only code that dereferences the pointer ]
    Signed-off-by: Sasha Levin <sashal@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

af_unix: Fix UAF read of tail->len in unix_stream_data_wait() [+ + +]

Author: Jann Horn <jannh@google.com>
Date:   Mon May 18 18:51:30 2026 +0200

    af_unix: Fix UAF read of tail->len in unix_stream_data_wait()
    
    commit be309f8eae8b474a4a617eaae01324da996fc719 upstream.
    
    unix_stream_data_wait() does skb_peek_tail(&sk->sk_receive_queue) without
    holding any lock that prevents SKBs on that queue from being dequeued and
    freed.
    This has been the case since commit 79f632c71bea ("unix/stream: fix
    peeking with an offset larger than data in queue").
    The first consequence of this is that the pointer comparison
    `tail != last` can be false even if `last` semantically refers to an
    already-freed SKB while `tail` is a new SKB allocated at the same address;
    which can cause unix_stream_data_wait() to wrongly keep blocking after new
    data has arrived, but only in a weird scenario where a peeking recv() and
    a normal recv() on the same socket are racing, which is probably not a
    real problem.
    
    But since commit 2b514574f7e8 ("net: af_unix: implement splice for stream
    af_unix sockets"), `tail` is actually dereferenced, which can cause UAF in
    the following race scenario (where test_setup() runs single-threaded,
    and afterwards, test_thread1() and test_thread2() run concurrently in
    two threads:
    ```
    static int socks[2];
    void test_setup(void) {
      socketpair(AF_UNIX, SOCK_STREAM, 0, socks);
      send(socks[1], "A", 1, 0);
      int peekoff = 1;
      setsockopt(socks[0], SOL_SOCKET, SO_PEEK_OFF, &peekoff, sizeof(peekoff));
    }
    void test_thread1(void) {
      char dummy;
      recv(socks[0], &dummy, 1, MSG_PEEK);
    }
    void test_thread2(void) {
      char dummy;
      recv(socks[0], &dummy, 1, 0);
      shutdown(socks[1], SHUT_WR);
    }
    ```
    
    when racing like this:
    ```
    thread1                       thread2
    unix_stream_read_generic
      mutex_lock(&u->iolock)
      skb_peek(&sk->sk_receive_queue)
      skb_peek_next(skb, &sk->sk_receive_queue)
      mutex_unlock(&u->iolock)
                                  unix_stream_read_generic
                                    unix_state_lock(sk)
                                    skb_peek(&sk->sk_receive_queue)
                                    unix_state_unlock(sk)
      unix_stream_data_wait
        unix_state_lock(sk)
        tail = skb_peek_tail(&sk->sk_receive_queue)
                                    spin_lock(&sk->sk_receive_queue.lock)
                                    __skb_unlink(skb, &sk->sk_receive_queue)
                                    spin_unlock(&sk->sk_receive_queue.lock)
                                    consume_skb(skb) [frees the SKB]
        `tail != last`: false
        `tail`: true
        `tail->len != last_len` ***UAF***
    ```
    
    Fix the UAF by removing the read of tail->len; checking tail->len would
    only make sense if SKBs in the receive queue of a UNIX socket could grow,
    which can no longer happen.
    
    Kuniyuki explained:
    
    > When commit 869e7c62486e ("net: af_unix: implement stream sendpage
    > support") added sendpage() support, data could be appended to the last
    > skb in the receiver's queue.
    >
    > That's why we needed to check if the length of the last skb was changed
    > while waiting for new data in unix_stream_data_wait().
    >
    > However, commit a0dbf5f818f9 ("af_unix: Support MSG_SPLICE_PAGES") and
    > commit 57d44a354a43 ("unix: Convert unix_stream_sendpage() to use
    > MSG_SPLICE_PAGES") refactored sendmsg(), and now data is always added
    > to a new skb.
    
    That means this fix is not suitable for kernels before 6.5.
    
    Fixes: 2b514574f7e8 ("net: af_unix: implement splice for stream af_unix sockets")
    Cc: stable@vger.kernel.org # 6.5.x
    Signed-off-by: Jann Horn <jannh@google.com>
    Reviewed-by: Kuniyuki Iwashima <kuniyu@google.com>
    Link: https://patch.msgid.link/20260518-b4-unix-recv-wait-hotfix-v2-1-83e29ce8ad31@google.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

afs: Fix the locking used by afs_get_link() [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Tue May 12 13:34:01 2026 +0100

    afs: Fix the locking used by afs_get_link()
    
    [ Upstream commit c0410adf3da6db46f3513411fcf95e63c2f1d1ad ]
    
    The afs filesystem in the kernel doesn't do locking correctly for symbolic
    links.  There are a number of problems:
    
     (1) It doesn't do any locking around afs_read_single() to prevent races
         between multiple ->get_link() calls, thereby allowing the possibility
         of leaks.
    
     (2) It doesn't use RCU barriering when accessing the buffer pointers
         during RCU pathwalk.
    
     (3) It can race with another thread updating the contents of the symlink
         if a third party updated it on the server.
    
    Fix this by the following means:
    
     (0) Move symlink handling into its own file as this makes it more
         complicated.
    
     (1) Take the validate_lock around afs_read_single() to prevent races
         between multiple ->get_link() calls.
    
     (2) Keep a separate copy of the symlink contents with an rcu_head.  This
         is always going to be a lot smaller than a page, so it can be
         kmalloc'd and save quite a bit of memory.  It also needs a refcount
         for non-RCU pathwalk.
    
     (3) Split the symlink read and write-to-cache routines in afs from those
         for directories.
    
     (4) Discard the I/O buffer as soon as the write-to-cache completes as this
         is a full page (plus a folio_queue).
    
     (5) If there's no cache, discard the I/O buffer immediately after reading
         and copying if there is no cache.
    
    Fixes: eae9e78951bb ("afs: Use netfslib for symlinks, allowing them to be cached")
    Fixes: 6698c02d64b2 ("afs: Locally initialise the contents of a new symlink on creation")
    Closes: https://sashiko.dev/#/patchset/20260326104544.509518-1-dhowells%40redhat.com
    Signed-off-by: David Howells <dhowells@redhat.com>
    Link: https://patch.msgid.link/20260512123404.719402-25-dhowells@redhat.com
    cc: Marc Dionne <marc.dionne@auristor.com>
    cc: linux-afs@lists.infradead.org
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ALSA: asihpi: Fix potential OOB array access at reading cache [+ + +]

Author: Takashi Iwai <tiwai@suse.de>
Date:   Fri May 15 10:55:58 2026 +0200

    ALSA: asihpi: Fix potential OOB array access at reading cache
    
    commit 7b7d6572145c1dab2dd9bfb550b188e5f0ff3c3f upstream.
    
    find_control() to retrieve a cached info accesses the array with the
    given index blindly, which may lead to an OOB array access.
    Add a sanity check for avoiding it.
    
    Link: https://sashiko.dev/#/patchset/20260511230121.28606-1-rosenp%40gmail.com
    Cc: <stable@vger.kernel.org>
    Link: https://patch.msgid.link/20260515085606.242284-1-tiwai@suse.de
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ALSA: hda/ca0132: Disable auto-detect on manual output select [+ + +]

Author: Matt DeVillier <matt.devillier@gmail.com>
Date:   Thu May 7 09:58:41 2026 -0500

    ALSA: hda/ca0132: Disable auto-detect on manual output select
    
    [ Upstream commit 6fd9f6e870ea285f05102e8e00e6a7f4495a9a02 ]
    
    Commit 778031e1658d ("ALSA: hda/ca0132: Set HP/Speaker
    auto-detect default from headphone pin verb") enables HP/Speaker
    auto-detect by default when the headphone pin supports presence detect.
    
    With auto-detect enabled, ca0132_select_out() and ca0132_alt_select_out()
    choose the output from jack presence instead of the manual HP/Speaker
    selection. This means selecting speaker output while headphones are
    plugged in updates the control state, but audio still routes to the
    headphones.
    
    Treat an explicit manual output selection as a request to leave
    auto-detect mode. Clear the HP/Speaker auto-detect switch before applying
    the manual selection, and notify userspace so the auto-detect control
    state is updated in mixers. Do this for both the normal HP/Speaker
    Playback Switch and the alternate Output Select control used by desktop
    cards.
    
    This keeps auto-detect enabled by default for devices with jack presence
    detection, while preserving the expected behavior that a manual output
    choice takes effect immediately.
    
    Fixes: 778031e1658d ("ALSA: hda/ca0132: Set HP/Speaker auto-detect default from headphone pin verb")
    Signed-off-by: Matt DeVillier <matt.devillier@gmail.com>
    Link: https://lore.kernel.org/CAFTm+6AfeXKf=b2frG4xC5yC4jjM9TkD6c8+dOWWFw6BDjDESw@mail.gmail.com
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ALSA: hda/realtek: Use ALC287_FIXUP_TXNW2781_I2C for ASUS Strix Gxx5 [+ + +]

Author: Eric Naim <dnaim@cachyos.org>
Date:   Sat May 16 19:15:31 2026 +0800

    ALSA: hda/realtek: Use ALC287_FIXUP_TXNW2781_I2C for ASUS Strix Gxx5
    
    [ Upstream commit 4372286ac774536e8e68bc6dfa0f0b0152b31fce ]
    
    These devices were incorrectly using the ALC287_FIXUP_TAS2781_I2C quirk
    leading to errors:
    
    [ 18.765990] Serial bus multi instantiate pseudo device driver TXNW2781:00: error -ENXIO: IRQ index 0 not found
    [ 18.768153] Serial bus multi instantiate pseudo device driver TXNW2781:00: error -ENXIO: IRQ index 0 not found
    [ 18.768476] Serial bus multi instantiate pseudo device driver TXNW2781:00: error -ENXIO: IRQ index 0 not found
    [ 18.768899] Serial bus multi instantiate pseudo device driver TXNW2781:00: Instantiated 3 I2C devices.
    
    Use the ALC287_FIXUP_TXNW2781_I2C quirk instead to fix this and restore
    speaker audio on affected devices.
    
    Fixes: 1e9c708dc3ae ("ALSA: hda/tas2781: Add new quirk for Lenovo, ASUS, Dell projects")
    Link: https://lore.kernel.org/59fd4aa4-76b9-4984-8db9-a60e55ec6e80@losource.net/
    Closes: https://lore.kernel.org/CACB9z7kjs8rhLstEc8fV29BCTb5dd881JwGozoKdO5cwCb=YwQ@mail.gmail.com
    Signed-off-by: Eric Naim <dnaim@cachyos.org>
    Link: https://patch.msgid.link/20260516111532.111463-1-dnaim@cachyos.org
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ALSA: hda: cs35l41: Put ACPI device on missing physical node [+ + +]

Author: Shuhao Fu <sfual@cse.ust.hk>
Date:   Tue Apr 28 16:12:38 2026 +0800

    ALSA: hda: cs35l41: Put ACPI device on missing physical node
    
    [ Upstream commit fca7401fe37f7abc6e54147ea560f37279231137 ]
    
    acpi_dev_get_first_match_dev() returns a refcounted ACPI device and
    callers must balance it with acpi_dev_put().
    
    cs35l41_hda_read_acpi() stores the returned ACPI device in
    cs35l41->dacpi. That reference is normally released by the later
    probe cleanup or the remove path, but the NULL-check on
    physdev exits before either of those paths can run.
    
    Drop the lookup reference before returning -ENODEV.
    
    Fixes: c34b04cc6178 ("ALSA: hda: cs35l41: Fix NULL pointer dereference in cs35l41_hda_read_acpi()")
    Signed-off-by: Shuhao Fu <sfual@cse.ust.hk>
    Tested-by: Simon Trimmer <simont@opensource.cirrus.com>
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Link: https://patch.msgid.link/20260428081238.GA1659932@chcpu16
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ALSA: hda: cs35l56: Put ACPI device after setting companion [+ + +]

Author: Shuhao Fu <sfual@cse.ust.hk>
Date:   Tue Apr 28 16:01:39 2026 +0800

    ALSA: hda: cs35l56: Put ACPI device after setting companion
    
    [ Upstream commit aa2fbece1b07954ef26488c800d126a36a8ab93e ]
    
    acpi_dev_get_first_match_dev() returns a refcounted ACPI device and
    callers are expected to balance it with acpi_dev_put().
    
    When no companion is already attached, cs35l56_hda_read_acpi() looks
    up an ACPI device and sets it with ACPI_COMPANION_SET(), but leaves
    the lookup reference held.
    
    ACPI_COMPANION_SET() does not take ownership of that reference, so
    drop it with acpi_dev_put() after attaching the companion.
    
    Fixes: 73cfbfa9caea ("ALSA: hda/cs35l56: Add driver for Cirrus Logic CS35L56 amplifier")
    Signed-off-by: Shuhao Fu <sfual@cse.ust.hk>
    Tested-by: Simon Trimmer <simont@opensource.cirrus.com>
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Link: https://patch.msgid.link/20260428080139.GA1649104@chcpu16
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ALSA: pcm: Don't setup bogus iov_iter for silencing [+ + +]

Author: Takashi Iwai <tiwai@suse.de>
Date:   Sun May 17 18:51:20 2026 +0200

    ALSA: pcm: Don't setup bogus iov_iter for silencing
    
    commit e4d3386b74fba8e01280484b67ee481ece00201e upstream.
    
    At transition to the iov_iter for PCM data transfer, we blindly
    applied the iov_iter setup also for silencing (i.e. data = NULL), and
    it leads to a calculation of bogus iov_iter.  Fortunately this didn't
    cause troubles on most of architectures but it goes wrong on RISC-V
    now, causing a NULL dereference.
    
    Handle the NULL data case to treat the silencing in interleaved_copy()
    for addressing the bug above.  noninterleaved_copy() has already the
    NULL data handling, so it doesn't need changes.
    
    Reported-by: Jiakai Xu <xujiakai24@mails.ucas.ac.cn>
    Closes: https://lore.kernel.org/20260515051516.3103036-1-xujiakai24@mails.ucas.ac.cn
    Fixes: cf393babb37a ("ALSA: pcm: Add copy ops with iov_iter")
    Cc: <stable@vger.kernel.org>
    Link: https://patch.msgid.link/20260517165121.31399-1-tiwai@suse.de
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ALSA: scarlett2: Add missing error check when initialise Autogain Status [+ + +]

Author: Robertus Diawan Chris <robertusdchris@gmail.com>
Date:   Fri May 8 10:39:14 2026 +0700

    ALSA: scarlett2: Add missing error check when initialise Autogain Status
    
    [ Upstream commit c0e4fffc0f474b7ed10adee4ab2bc1a66d36fc72 ]
    
    When initialise new control with scarlett2_add_new_ctl() function for
    Autogain Status, scarlett2_add_new_ctl() might throw an error. So, add
    error check after initialise new control for Autogain Status.
    
    This is reported by Coverity Scan with CID 1598781 as UNUSED_VALUE.
    
    Fixes: 0a995e38dc44 ("ALSA: scarlett2: Add support for software-controllable input gain")
    Signed-off-by: Robertus Diawan Chris <robertusdchris@gmail.com>
    Link: https://patch.msgid.link/20260508033914.111596-1-robertusdchris@gmail.com
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ALSA: scarlett2: Allow flash writes ending at segment boundary [+ + +]

Author: Cássio Gabriel <cassiogabrielcontato@gmail.com>
Date:   Tue May 19 11:46:19 2026 -0300

    ALSA: scarlett2: Allow flash writes ending at segment boundary
    
    commit a69b677e47a80319ce148d61cc29a2b57006e78d upstream.
    
    scarlett2_hwdep_write() rejects writes when offset + count is greater than
    or equal to the selected flash segment size. That incorrectly treats a
    write ending exactly at the end of the segment as out of space, although
    the last byte written is still within the segment.
    
    Split invalid argument checks from the segment-space check, keep
    zero-length writes as no-ops, and compare count against the remaining
    segment size. This permits exact-end writes and avoids relying on
    offset + count before deciding whether the request is in bounds.
    
    Fixes: 1abfbd3c9527 ("ALSA: scarlett2: Add support for uploading new firmware")
    Cc: stable@vger.kernel.org
    Signed-off-by: Cássio Gabriel <cassiogabrielcontato@gmail.com>
    Link: https://patch.msgid.link/20260519-alsa-scarlett2-flash-write-boundary-v1-1-b550480e92da@gmail.com
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ALSA: seq: Serialize UMP output teardown with event_input [+ + +]

Author: Zhang Cen <rollkingzzc@gmail.com>
Date:   Wed May 20 18:32:49 2026 +0800

    ALSA: seq: Serialize UMP output teardown with event_input
    
    [ Upstream commit 60a1969fae6209644698fca91c185d153674f631 ]
    
    seq_ump_process_event() borrows client->out_rfile.output without
    synchronizing with the first-open and last-close transition in
    seq_ump_client_open() and seq_ump_client_close().
    
    The last output unuse can therefore drop opened[STR_OUT] to zero and
    release the rawmidi file while an in-flight event_input callback is still
    inside snd_rawmidi_kernel_write(). That leaves the rawmidi substream
    runtime exposed to teardown before the write path has taken its own
    buffer reference.
    
    Add a per-client rwlock for the event_input-visible output file. Publish
    a newly opened output file under the write side, and hold the read side
    from the output lookup through snd_rawmidi_kernel_write(). The last
    output close copies and clears the visible output file under the write
    side, then drops the lock and releases the saved rawmidi file. Use
    IRQ-safe rwlock guards because event_input can also be reached from
    atomic sequencer delivery.
    
    The buggy scenario involves two paths, with each column showing the
    order within that path:
    
    path A label: event_input path         path B label: last unuse path
    1. seq_ump_process_event() reads       1. seq_ump_client_close()
       client->out_rfile.output.              drops opened[STR_OUT] to zero.
    2. snd_rawmidi_kernel_write1()         2. snd_rawmidi_kernel_release()
       has not yet pinned runtime.            closes the output file.
    3. The writer continues using          3. close_substream() frees
       the borrowed substream.                substream->runtime.
    
    This keeps the output substream and runtime alive for the full
    event_input write while keeping rawmidi release outside the rwlock.
    
    KASAN reproduced this as a slab-use-after-free in
    snd_rawmidi_kernel_write1(), with allocation through
    seq_ump_use()/snd_seq_port_connect() and free through
    seq_ump_unuse()/snd_seq_port_disconnect().
    
    Suggested-by: Takashi Iwai <tiwai@suse.de>
    
    Validation reproduced this kernel report:
    KASAN slab-use-after-free in snd_rawmidi_kernel_write1+0x9d/0x400
    RIP: 0033:0x7f5528af837f
    Read of size 8
    Call trace:
      dump_stack_lvl+0x73/0xb0 (?:?)
      print_report+0xd1/0x650 (?:?)
      srso_alias_return_thunk+0x5/0xfbef5 (?:?)
      __virt_addr_valid+0x1a7/0x340 (?:?)
      kasan_complete_mode_report_info+0x64/0x200 (?:?)
      kasan_report+0xf7/0x130 (?:?)
      snd_rawmidi_kernel_write1+0x9d/0x400 (?:?)
      __asan_load8+0x82/0xb0 (?:?)
      update_stack_state+0x1ef/0x2d0 (?:?)
      snd_rawmidi_kernel_write+0x1a/0x20 (?:?)
      seq_ump_process_event+0xd4/0x120 (sound/core/seq/seq_ump_client.c:82)
      __snd_seq_deliver_single_event+0x8a/0xe0 (?:?)
      snd_seq_deliver_from_ump+0x2b2/0xd60 (?:?)
      lock_acquire+0x14e/0x2e0 (?:?)
      find_held_lock+0x31/0x90 (?:?)
      snd_seq_port_use_ptr+0xa6/0xe0 (?:?)
      __kasan_check_write+0x18/0x20 (?:?)
      do_raw_read_unlock+0x32/0xa0 (?:?)
      _raw_read_unlock+0x26/0x50 (?:?)
      snd_seq_deliver_single_event+0x45c/0x4b0 (?:?)
      snd_seq_deliver_event+0x10d/0x1b0 (?:?)
      snd_seq_client_enqueue_event+0x192/0x240 (?:?)
      snd_seq_write+0x2cd/0x450 (?:?)
      apparmor_file_permission+0x20/0x30 (?:?)
      security_file_permission+0x51/0x60 (?:?)
      vfs_write+0x1ce/0x850 (?:?)
      __fget_files+0x12b/0x220 (?:?)
      lock_release+0xc8/0x2a0 (?:?)
      __rcu_read_unlock+0x74/0x2d0 (?:?)
      __fget_files+0x135/0x220 (?:?)
      ksys_write+0x15a/0x180 (?:?)
      rcu_is_watching+0x24/0x60 (?:?)
      __x64_sys_write+0x46/0x60 (?:?)
      x64_sys_call+0x7d/0x20d0 (?:?)
      do_syscall_64+0xc1/0x360 (arch/x86/entry/syscall_64.c:87)
      entry_SYSCALL_64_after_hwframe+0x77/0x7f (?:?)
    
    Fixes: 81fd444aa371 ("ALSA: seq: Bind UMP device")
    Signed-off-by: Zhang Cen <rollkingzzc@gmail.com>
    Link: https://patch.msgid.link/20260520103249.3048345-1-rollkingzzc@gmail.com
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ALSA: ua101: Reject too-short USB descriptors [+ + +]

Author: Cássio Gabriel <cassiogabrielcontato@gmail.com>
Date:   Tue May 19 00:32:15 2026 -0300

    ALSA: ua101: Reject too-short USB descriptors
    
    commit b59d5c51bb328a60749b4dd5fe7e649bfb4089b4 upstream.
    
    find_format_descriptor() walks the class-specific interface extras by
    advancing with bLength. It rejects descriptors that extend past the
    remaining buffer, but it does not reject descriptor lengths smaller than
    a USB descriptor header.
    
    Reject too-short descriptors before using bLength to advance the local
    scan. This keeps the UA-101 parser robust against malformed descriptor
    data and matches the usual USB descriptor walking rules.
    
    Fixes: 63978ab3e3e9 ("sound: add Edirol UA-101 support")
    Cc: stable@vger.kernel.org
    Signed-off-by: Cássio Gabriel <cassiogabrielcontato@gmail.com>
    Link: https://patch.msgid.link/20260519-alsa-ua101-desc-len-v1-1-4307d1a5e054@gmail.com
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

arm64: dts: renesas: r8a78000: Fix SCIF brg_int clocks [+ + +]

Author: Geert Uytterhoeven <geert+renesas@glider.be>
Date:   Tue Jan 6 18:09:51 2026 +0100

    arm64: dts: renesas: r8a78000: Fix SCIF brg_int clocks
    
    [ Upstream commit 86637727c11a105499e9faa38f3422dfcf4d211d ]
    
    According to the documentation, the internal clock input for the BRG is
    SGASYNCD4_PERW_BUSφ.
    
    Fixes: c13a643e2c491f5b ("arm64: dts: renesas: Add R8A78000 SoC support")
    Signed-off-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Link: https://patch.msgid.link/459d360a8332f92b3766b30814e7e1c76169aaf7.1767719254.git.geert+renesas@glider.be
    Signed-off-by: Sasha Levin <sashal@kernel.org>

arm64: probes: Handle probes on hinted conditional branch instructions [+ + +]

Author: Vladimir Murzin <vladimir.murzin@arm.com>
Date:   Fri May 15 14:37:29 2026 +0100

    arm64: probes: Handle probes on hinted conditional branch instructions
    
    commit 2ccd8ff980b50e842481bae71102fa3883fc4377 upstream.
    
    BC.cond instructions introduced by FEAT_HBC cannot be executed
    out-of-line, like other branch instructions. However, they can be
    simulated in the same way as B.cond instructions.
    
    Extend the B.cond decoder mask to match BC.cond instructions as well,
    and handle them using the existing B.cond simulation path.
    
    Fixes: 7f86d128e437 ("arm64: add HWCAP for FEAT_HBC (hinted conditional branches)")
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Vladimir Murzin <vladimir.murzin@arm.com>
    Signed-off-by: Catalin Marinas <catalin.marinas@arm.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ARM: dts: renesas: genmai: Drop superfluous cells [+ + +]

Author: Marek Vasut <marek.vasut+renesas@mailbox.org>
Date:   Sat Mar 28 00:42:10 2026 +0100

    ARM: dts: renesas: genmai: Drop superfluous cells
    
    [ Upstream commit 714e1d6bba0e0abe5c87c8e189a35fa690540df4 ]
    
    Drop superfluous address-cells and size-cells to fix DTC W=1 warning:
    
        arch/arm/boot/dts/renesas/r7s72100-genmai.dts:28.17-55.4: Warning (avoid_unnecessary_addr_size): /flash@18000000: unnecessary #address-cells/#size-cells without "ranges", "dma-ranges" or child "reg" or "ranges" property
    
    Signed-off-by: Marek Vasut <marek.vasut+renesas@mailbox.org>
    Fixes: 30e0a8cf886cb459 ("ARM: dts: renesas: genmai: Add FLASH nodes")
    Reviewed-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Link: https://patch.msgid.link/20260327234244.91707-6-marek.vasut+renesas@mailbox.org
    Signed-off-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ARM: dts: renesas: rskrza1: Drop superfluous cells [+ + +]

Author: Marek Vasut <marek.vasut+renesas@mailbox.org>
Date:   Sat Mar 28 00:42:11 2026 +0100

    ARM: dts: renesas: rskrza1: Drop superfluous cells
    
    [ Upstream commit ab83176d3cf1cf1c1f6e604432905bda4515d17f ]
    
    Drop superfluous address-cells and size-cells to fix DTC W=1 warning:
    
        arch/arm/boot/dts/renesas/r7s72100-rskrza1.dts:32.17-72.4: Warning (avoid_unnecessary_addr_size): /flash@18000000: unnecessary #address-cells/#size-cells without "ranges", "dma-ranges" or child "reg" or "ranges" property
    
    Signed-off-by: Marek Vasut <marek.vasut+renesas@mailbox.org>
    Fixes: 98537eb77d3ef185 ("ARM: dts: renesas: rskrza1: Add FLASH nodes")
    Reviewed-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Link: https://patch.msgid.link/20260327234244.91707-7-marek.vasut+renesas@mailbox.org
    Signed-off-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ARM: integrator: Fix early initialization [+ + +]

Author: Guenter Roeck <linux@roeck-us.net>
Date:   Tue May 5 21:15:37 2026 +0200

    ARM: integrator: Fix early initialization
    
    [ Upstream commit 90d77b30a666049ad24df463f52e5d529c44e8cd ]
    
    Starting with commit bdb249fce9ad4 ("ARM: integrator: read counter using
    syscon/regmap"), intcp_init_early calls syscon_regmap_lookup_by_compatible
    which in turn calls of_syscon_register. This function allocates memory.
    Since the memory management code has not been initialized at that time,
    the call always fails. It either returns -ENOMEM or crashes as follows.
    
    Unable to handle kernel NULL pointer dereference at virtual address 0000000c when read
    [0000000c] *pgd=00000000
    Internal error: Oops: 5 [#1] ARM
    Modules linked in:
    CPU: 0 UID: 0 PID: 0 Comm: swapper Not tainted 6.15.0-rc5-00026-g5fcc9bf84ee5 #1 PREEMPT
    Hardware name: ARM Integrator/CP (Device Tree)
    PC is at __kmalloc_cache_noprof+0xec/0x39c
    LR is at __kmalloc_cache_noprof+0x34/0x39c
    ...
    Call trace:
     __kmalloc_cache_noprof from of_syscon_register+0x7c/0x310
     of_syscon_register from device_node_get_regmap+0xa4/0xb0
     device_node_get_regmap from intcp_init_early+0xc/0x40
     intcp_init_early from start_kernel+0x60/0x688
     start_kernel from 0x0
    
    The crash is seen due to a dereferenced pointer which is not supposed to be
    NULL but is NULL if the memory management subsystem has not been
    initialized. The crash is not seen with all versions of gcc. Some versions
    such as gcc 9.x apparently do not dereference the pointer, presumably if
    tracing is disabled. The problem has been reproduced with gcc 10.x, 11.x,
    and 13.x. Either case, if the crash is not seen, the call to
    syscon_regmap_lookup_by_compatible returns -ENOMEM, and
    sched_clock_register is never called.
    
    Fix the problem by moving the early initialization code into the standard
    machine initialization code.
    
    Fixes: bdb249fce9ad4 ("ARM: integrator: read counter using syscon/regmap")
    Cc: Linus Walleij <linus.walleij@linaro.org>
    Signed-off-by: Guenter Roeck <linux@roeck-us.net>
    Link: https://lore.kernel.org/20250518164118.3859567-1-linux@roeck-us.net
    Signed-off-by: Linus Walleij <linus.walleij@linaro.org>
    Link: https://lore.kernel.org/r/20260505-integrator-fixes-v1-1-56ab9aac59db@kernel.org
    Signed-off-by: Arnd Bergmann <arnd@arndb.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ASoC: amd: acp-sdw-legacy: check CPU DAI name before logging [+ + +]

Author: Cássio Gabriel <cassiogabrielcontato@gmail.com>
Date:   Mon May 11 13:42:02 2026 -0300

    ASoC: amd: acp-sdw-legacy: check CPU DAI name before logging
    
    [ Upstream commit 1afd8f06dcb1d561af3b239c5b14a88b87c13454 ]
    
    devm_kasprintf() can fail and return NULL. The legacy AMD SoundWire
    machine driver logs cpus->dai_name before checking the allocation result.
    
    Move the debug print after the NULL check, matching the ordering used by
    the SOF AMD SoundWire path after commit 5726b68473f7 ("ASoC: amd/sdw_utils:
    avoid NULL deref when devm_kasprintf() fails").
    
    Fixes: 2981d9b0789c ("ASoC: amd: acp: add soundwire machine driver for legacy stack")
    Signed-off-by: Cássio Gabriel <cassiogabrielcontato@gmail.com>
    Link: https://patch.msgid.link/20260511-asoc-amd-acp-sdw-legacy-dai-name-null-v1-1-dc6151b6da8a@gmail.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ASoC: codecs: fs210x: fix possible buffer overflow [+ + +]

Author: Alexander A. Klimov <grandmaster@al2klimov.de>
Date:   Wed May 13 21:08:52 2026 +0200

    ASoC: codecs: fs210x: fix possible buffer overflow
    
    [ Upstream commit 0d435a7ebcd4e97e47673c1ab6fb27f973a053ec ]
    
    In fs210x_effect_scene_info(), a string was copied like this:
    
        strscpy(DST, SRC, strlen(SRC) + 1);
    
    A buffer overflow would happen if strlen(SRC) >= sizeof(DST).
    Actually, strscpy() must be used this way:
    
        strscpy(DST, SRC, sizeof(DST));
        strscpy(DST, SRC); // defaults to sizeof(DST)
    
    Fixes: 756117701779 ("ASoC: codecs: Add FourSemi FS2104/5S audio amplifier driver")
    Signed-off-by: Alexander A. Klimov <grandmaster@al2klimov.de>
    Link: https://patch.msgid.link/20260513190852.196723-2-grandmaster@al2klimov.de
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ASoC: codecs: pcm512x: fix null-ptr dereference in pcm512x_overclock_xxx_put() [+ + +]

Author: Jeongjun Park <aha310510@gmail.com>
Date:   Thu May 21 20:37:12 2026 +0900

    ASoC: codecs: pcm512x: fix null-ptr dereference in pcm512x_overclock_xxx_put()
    
    commit 09e8f9a9aa19aa8c1b0cc7a0ebc68f6ecf86a660 upstream.
    
    In the pcm512x chipset driver, pcm512x_overclock_xxx_put() is defined as
    a general mixer kcontrol instead of a DAPM kcontrol, so struct
    snd_soc_dapm_context must not be accessed via
    snd_soc_dapm_kcontrol_to_dapm().
    
    This causes a NULL pointer dereference, so it must be modified to use
    snd_soc_component_to_dapm().
    
    Cc: stable@kernel.org
    Closes: https://github.com/raspberrypi/linux/issues/7242
    Fixes: 02dbbb7e982a ("ASoC: codecs: pcm512x: convert to snd_soc_dapm_xxx()")
    Signed-off-by: Jeongjun Park <aha310510@gmail.com>
    Link: https://patch.msgid.link/20260521113712.227438-1-aha310510@gmail.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ASoC: cs-amp-lib: Fix missing dput() after debugfs_lookup() [+ + +]

Author: Richard Fitzgerald <rf@opensource.cirrus.com>
Date:   Thu May 21 13:25:10 2026 +0100

    ASoC: cs-amp-lib: Fix missing dput() after debugfs_lookup()
    
    [ Upstream commit ba28a07a9a0b53a538c809e04e517e1ce1f1bee3 ]
    
    Rewrite cs_amp_create_debugfs() so that dput() will be called on
    a valid dentry returned from debugfs_lookup().
    
    The pointer returned from debugfs_lookup() must be released by dput().
    The pointer returned from debugfs_create_dir() does not need to be
    passed to dput().
    
    Signed-off-by: Richard Fitzgerald <rf@opensource.cirrus.com>
    Fixes: cdd27fa3298a ("ASoC: cs-amp-lib: Add helpers for factory calibration")
    Link: https://patch.msgid.link/20260521122511.987322-3-rf@opensource.cirrus.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ASoC: cs-amp-lib: Fix wrong sizeof() in _cs_amp_set_efi_calibration_data() [+ + +]

Author: Richard Fitzgerald <rf@opensource.cirrus.com>
Date:   Thu May 21 13:25:09 2026 +0100

    ASoC: cs-amp-lib: Fix wrong sizeof() in _cs_amp_set_efi_calibration_data()
    
    [ Upstream commit 67a52d3ebb5a0ae0c0e23ffa99470d9463179c9f ]
    
    When calculating data->count replace the incorrect sizeof(data) with use
    of struct_offset().
    
    The faulty sizeof(data) was incorrectly calculating the size of the
    pointer instead of the size of the struct pointed to. As it happens, both
    values are 8 on a 64-bit CPU. In the unlikely event of using this code on
    a 32-bit CPU the number of available bytes would be calculated 4 larger
    than is actually available.
    
    Instead of changing to sizeof(*data) it has been replaced by
    struct_offset() because it has better chance of detecting these sorts of
    typos. Also the offset of the data[] array is actually what we want to know
    here anyway.
    
    Signed-off-by: Richard Fitzgerald <rf@opensource.cirrus.com>
    Fixes: 2b62e66626f0 ("ASoC: cs-amp-lib: Add function to write calibration to UEFI")
    Link: https://patch.msgid.link/20260521122511.987322-2-rf@opensource.cirrus.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ASoC: cs35l56: Fix flushing of IRQ work in cs35l56_sdw_remove() [+ + +]

Author: Richard Fitzgerald <rf@opensource.cirrus.com>
Date:   Thu May 21 13:30:57 2026 +0100

    ASoC: cs35l56: Fix flushing of IRQ work in cs35l56_sdw_remove()
    
    [ Upstream commit 18e7bd9f2446664053f8c34b72abd4606d22d858 ]
    
    Use flush_work() instead of cancel_work_sync() to terminate pending IRQ
    work in cs35l56_sdw_remove(). And flush_work() again after masking the
    interrupts to flush any queueing that was racing with the masking. This is
    the same sequence as cs35l56_sdw_system_suspend().
    
    cs35l56_sdw_interrupt() takes the pm_runtime to prevent the bus powering-
    down before the interrupt status can be read and handled. The work releases
    this pm_runtime. So cancelling it, instead of flushing, could leave an
    unbalanced pm_runtime.
    
    Signed-off-by: Richard Fitzgerald <rf@opensource.cirrus.com>
    Fixes: e49611252900 ("ASoC: cs35l56: Add driver for Cirrus Logic CS35L56")
    Link: https://patch.msgid.link/20260521123057.988732-1-rf@opensource.cirrus.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ASoC: intel: sof_sdw: Prepare for configuration without a jack [+ + +]

Author: Maciej Strozek <mstrozek@opensource.cirrus.com>
Date:   Fri Apr 3 09:23:35 2026 +0100

    ASoC: intel: sof_sdw: Prepare for configuration without a jack
    
    [ Upstream commit d733fb463834cf97a0c667681e236fea0e833a05 ]
    
    In certain setups of cs42l43 UAJ function may be removed from ACPI and
    physically unconnected. Prepare a driver for that configuration by
    setting a system clock in the speaker path too.
    
    Signed-off-by: Maciej Strozek <mstrozek@opensource.cirrus.com>
    Link: https://patch.msgid.link/20260403082335.40798-1-mstrozek@opensource.cirrus.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Stable-dep-of: 5a30862dec5a ("ASoC: sdw_utils: Check speaker component string allocation")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ASoC: sdw_utils: Add quirk to ignore RT712 CODEC_MIC [+ + +]

Author: Mac Chiang <mac.chiang@intel.com>
Date:   Fri May 8 17:32:23 2026 +0800

    ASoC: sdw_utils: Add quirk to ignore RT712 CODEC_MIC
    
    [ Upstream commit 9c37daee7c17fa17e8d41089ee1f658b06cb672a ]
    
    Some devices do not use CODEC_MIC but use the host PCH_DMIC
    instead. Add a quirk to skip the CODEC_MIC DAI when it is not present
    in disco table, ensuring the correct capture device is used.
    
    If CODEC_MIC is present, it continues to be used as default.
    
    Fixes: 9489db97f6f0 ("ASoC: sdw_utils: add SmartMic DAI for RT712 VB")
    Signed-off-by: Mac Chiang <mac.chiang@intel.com>
    Signed-off-by: Bard Liao <yung-chuan.liao@linux.intel.com>
    Link: https://patch.msgid.link/20260508093224.1246282-2-yung-chuan.liao@linux.intel.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ASoC: sdw_utils: Add quirk to ignore RT721 CODEC_MIC [+ + +]

Author: Mac Chiang <mac.chiang@intel.corp-partner.google.com>
Date:   Fri May 8 17:32:24 2026 +0800

    ASoC: sdw_utils: Add quirk to ignore RT721 CODEC_MIC
    
    [ Upstream commit fa749a77bdc50f0d695aaf81f1bd55967d77d10f ]
    
    Add a quirk to skip the CODEC_MIC DAI when it is not present.
    This ensures PCH_DMIC is used as the fallback; otherwise,
    CODEC_MIC remains the default.
    
    Fixes: 846a8d3cf3ba ("ASoC: Intel: soc-acpi-intel-ptl-match: Add rt721 support")
    Signed-off-by: Mac Chiang <mac.chiang@intel.com>
    Signed-off-by: Bard Liao <yung-chuan.liao@linux.intel.com>
    Link: https://patch.msgid.link/20260508093224.1246282-3-yung-chuan.liao@linux.intel.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ASoC: sdw_utils: Check speaker component string allocation [+ + +]

Author: Cássio Gabriel <cassiogabrielcontato@gmail.com>
Date:   Tue May 12 11:03:53 2026 -0300

    ASoC: sdw_utils: Check speaker component string allocation
    
    [ Upstream commit 5a30862dec5a70da0a9d259de3f87a7542cc95b2 ]
    
    devm_kasprintf() can fail while building the temporary speaker
    component string. If that happens, spk_components is set to NULL, but
    the current code can still pass it to strlen() on a later loop iteration
    or after the loop when appending the speaker component list to
    card->components.
    
    Use NULL to represent the initial "no speaker components" state, and
    return -ENOMEM immediately if building spk_components fails.
    
    Fixes: 0f60ecffbfe3 ("ASoC: sdw_utils: generate combined spk components string")
    Signed-off-by: Cássio Gabriel <cassiogabrielcontato@gmail.com>
    Link: https://patch.msgid.link/20260512-asoc-sdw-utils-spk-components-alloc-v1-1-c9bbd6d2e123@gmail.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ASoC: sdw_utils: cs42l43: allow spk component names to be combined [+ + +]

Author: Maciej Strozek <mstrozek@opensource.cirrus.com>
Date:   Mon Apr 20 12:48:17 2026 +0100

    ASoC: sdw_utils: cs42l43: allow spk component names to be combined
    
    [ Upstream commit 87a3f5c8ac2096e9406ce2ed3bf5b9bc1589a92d ]
    
    Move handling of cs42l43-spk component string into SOF mechanism [1]
    which will allow it to be aggregated with other speakers.
    Likewise handle the cs35l56-bridge special case which should not be
    combined to keep compatibility with UCM.
    
    Link: https://github.com/thesofproject/linux/pull/5445 [1]
    Link: https://github.com/alsa-project/alsa-ucm-conf/pull/747
    Reviewed-by: Bard Liao <yung-chuan.liao@linux.intel.com>
    Signed-off-by: Maciej Strozek <mstrozek@opensource.cirrus.com>
    Suggested-by: Aaron Ma <aaron.ma@canonical.com>
    Tested-by: Aaron Ma <aaron.ma@canonical.com>
    Link: https://patch.msgid.link/20260420114823.194226-1-mstrozek@opensource.cirrus.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Stable-dep-of: 5a30862dec5a ("ASoC: sdw_utils: Check speaker component string allocation")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ASoC: soc-utils: Add missing va_end in snd_soc_ret() [+ + +]

Author: Robertus Diawan Chris <robertusdchris@gmail.com>
Date:   Tue May 19 12:40:24 2026 +0700

    ASoC: soc-utils: Add missing va_end in snd_soc_ret()
    
    [ Upstream commit 298a43b54432fbc3a32949a94c72544ee18c8c00 ]
    
    The default case in snd_soc_ret() use va_start without va_end to
    cleanup "args" object which can cause undefined behavior. So, add
    missing va_end to cleanup "args" object.
    
    This is reported by Coverity Scan as "Missing varargs init or cleanup".
    
    Fixes: 943116ba2a6a ("ASoC: add common snd_soc_ret() and use it")
    Signed-off-by: Robertus Diawan Chris <robertusdchris@gmail.com>
    Link: https://patch.msgid.link/20260519054024.274741-1-robertusdchris@gmail.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ASoC: SOF: amd: Fix error code handling in psp_send_cmd() [+ + +]

Author: Mario Limonciello <mario.limonciello@amd.com>
Date:   Mon May 11 10:36:36 2026 -0500

    ASoC: SOF: amd: Fix error code handling in psp_send_cmd()
    
    [ Upstream commit 2c7b1227e582e88db7917412dca4e752c1aff691 ]
    
    The smn_read_register() helper returns negative error codes on failure
    or the register value on success. When used with read_poll_timeout(),
    the return value is stored in the 'data' variable.
    
    Currently 'data' is declared as u32, which causes negative error codes
    to be cast to large positive values. This makes the condition 'data > 0'
    incorrectly treat errors as success.
    
    Fix by changing 'data' from u32 to int, matching the pattern used in
    psp_mbox_ready() which correctly handles the same helper function.
    
    Reported-by: Dan Carpenter <error27@gmail.com>
    Closes: https://lore.kernel.org/linux-sound/agGES8vWrLOrBu28@stanley.mountain/
    Fixes: f120cf33d232 ("ASoC: SOF: amd: Use AMD_NODE")
    Signed-off-by: Mario Limonciello <mario.limonciello@amd.com>
    Link: https://patch.msgid.link/20260511153638.724810-1-mario.limonciello@amd.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ata: libata-scsi: do not needlessly defer commands when using PMP with FBS [+ + +]

Author: Niklas Cassel <cassel@kernel.org>
Date:   Thu May 14 09:39:02 2026 +0200

    ata: libata-scsi: do not needlessly defer commands when using PMP with FBS
    
    commit 759e8756da00aa115d504a18155b1d1ee1cc12e8 upstream.
    
    The ACS specification does not allow a non-NCQ command to be issued while
    an NCQ command is outstanding.
    
    Commit 0ea84089dbf6 ("ata: libata-scsi: avoid Non-NCQ command starvation")
    introduced a feature where a deferred non-NCQ command gets issued from a
    workqueue. The design stores a single non-NCQ command per port.
    
    However, when using Port Multipliers (PMPs), specifically PMPs that
    support FIS-Based Switching (FBS), non-NCQ and NCQ commands can be mixed
    on the same port, just not for the same link, see e.g. ata_std_qc_defer()
    which is, and always has operated on a per-link basis.
    
    Therefore, move the deferred_qc from struct ata_port to struct ata_link.
    This way, when using a PMP with FBS, we will not needlessly defer commands
    to all other links, just because one link issued a non-NCQ command while
    having an NCQ command outstanding. Only commands for that specific link
    will be deferred. This is in line with how PMPs with FBS worked before
    commit 0ea84089dbf6 ("ata: libata-scsi: avoid Non-NCQ command starvation").
    
    Fixes: 0ea84089dbf6 ("ata: libata-scsi: avoid Non-NCQ command starvation")
    Tested-by: Tommy Kelly <linux@tkel.ly>
    Reviewed-by: Damien Le Moal <dlemoal@kernel.org>
    Signed-off-by: Niklas Cassel <cassel@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ata: libata-scsi: do not use the deferred QC feature for ATA_DEFER_PORT [+ + +]

Author: Niklas Cassel <cassel@kernel.org>
Date:   Thu May 14 09:39:00 2026 +0200

    ata: libata-scsi: do not use the deferred QC feature for ATA_DEFER_PORT
    
    commit ce4548807d2e4ae48fd0dbe38865467369877913 upstream.
    
    The deferred QC feature was meant to handle mixed NCQ and non-NCQ commands,
    i.e. for return value ATA_DEFER_LINK.
    
    ATA_DEFER_PORT is returned by PATA drivers, but also certain SATA drivers
    like sata_mv and sata_sil24 that uses ap->excl_link to workaround hardware
    bugs in these HBAs. Regardless of the reason, using the deferred QC feature
    for ATA_DEFER_PORT is always wrong, and will break the ap->excl_link usage
    of the SATA drivers that rely on that feature.
    
    Modify ata_scsi_qc_issue() to only use the deferred QC feature when mixing
    NCQ and non-NCQ commands, i.e. ATA_DEFER_LINK.
    
    Fixes: 0ea84089dbf6 ("ata: libata-scsi: avoid Non-NCQ command starvation")
    Tested-by: Tommy Kelly <linux@tkel.ly>
    Reviewed-by: Damien Le Moal <dlemoal@kernel.org>
    Signed-off-by: Niklas Cassel <cassel@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ata: libata-scsi: do not use the deferred QC feature on PMPs with CBS [+ + +]

Author: Niklas Cassel <cassel@kernel.org>
Date:   Thu May 14 09:39:01 2026 +0200

    ata: libata-scsi: do not use the deferred QC feature on PMPs with CBS
    
    commit f233124fb36cd57ef09f96d517a38ab4b902e15e upstream.
    
    When using Port Multipliers (PMPs) with Command-Based Switching (CBS), you
    can only issue commands to one link at a time. For PMPs with CBS, there is
    already code to handle commands being sent to different links in
    sata_pmp_qc_defer_cmd_switch() using ap->excl_link. sata_sil24 also makes
    use of ap->excl_link.
    
    A user on the list reported that commit 0ea84089dbf6 ("ata: libata-scsi:
    avoid Non-NCQ command starvation") broke PMPs with CBS. The commit
    introduced code that stores a deferred qc in ap->deferred_qc, to later be
    issued via a workqueue. It turns out that this change is incompatible with
    the existing ap->excl_link handling used by PMPs with CBS.
    
    Thus, modify sata_pmp_qc_defer_cmd_switch() and sil24_qc_defer() to return
    ATA_DEFER_LINK_EXCL, and make sure that the deferred QC handling via
    workqueue is not used for this return value.
    
    This way, PMPs with CBS will work once again. Note that the starvation
    referenced in commit 0ea84089dbf6 ("ata: libata-scsi: avoid Non-NCQ
    command starvation") can only happen on libsas ports, and libsas does not
    support Port Multipliers, thus there is no harm of reverting back to the
    previous way of deferring commands for PMPs with CBS.
    
    Non-libsas ports connected to anything but a PMP with CBS (e.g. a normal
    drive or a PMP with FBS) will continue using the deferred workqueue, since
    it does result in lower completion latencies for non-NCQ commands, even
    though the workqueue is not strictly needed to avoid starvation for
    non-libsas ports.
    
    If we want to modify the scope of the workqueue issuing to also handle
    PMPs with CBS, then we should ensure that we can save both NCQ and non-NCQ
    commands in ap->deferred_qc, while also removing the existing PMP CBS
    handling using ap->excl_link, such that we don't duplicate features.
    
    While at it, also add a comment explaining how the ap->excl_link mechanism
    works.
    
    Fixes: 0ea84089dbf6 ("ata: libata-scsi: avoid Non-NCQ command starvation")
    Tested-by: Tommy Kelly <linux@tkel.ly>
    Reported-by: Tommy Kelly <linux@tkel.ly>
    Closes: https://lore.kernel.org/linux-ide/ce09cc21-a8e9-4845-b205-35411e22fba9@tkel.ly/
    Reviewed-by: Damien Le Moal <dlemoal@kernel.org>
    Signed-off-by: Niklas Cassel <cassel@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ata: libata-scsi: improve readability of ata_scsi_qc_issue() [+ + +]

Author: Niklas Cassel <cassel@kernel.org>
Date:   Thu May 14 09:38:59 2026 +0200

    ata: libata-scsi: improve readability of ata_scsi_qc_issue()
    
    commit 360190bd965f93794d5f5685a6de22ce6da2b672 upstream.
    
    Improve readability of ata_scsi_qc_issue().
    
    No functional changes.
    
    Tested-by: Tommy Kelly <linux@tkel.ly>
    Reviewed-by: Damien Le Moal <dlemoal@kernel.org>
    Signed-off-by: Niklas Cassel <cassel@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: bla: avoid double decrement of bla.num_requests [+ + +]

Author: Sven Eckelmann <sven@narfation.org>
Date:   Tue May 12 09:13:31 2026 +0200

    batman-adv: bla: avoid double decrement of bla.num_requests
    
    commit 83ab69bd12b80f6ea169c8bea6977701b53a043d upstream.
    
    The bla.num_requests is increased when no request_sent was in progress. And
    it is decremented in various places (announcement was received, backbone is
    purged, periodic work). But the check if the request_sent is actually set
    to a specific state and the atomic_dec/_inc are not safe because they are
    not atomic (TOCTOU) and multiple such code portions can run concurrently.
    
    At the same time, it is necessary to modify request_sent (state) and
    bla.num_requests atomically. Otherwise batadv_bla_send_request() might set
    request_sent to 1 and is interrupted.  batadv_handle_announce() can then
    set request_sent back to 0 and decrement num_requests before
    batadv_bla_send_request() incremented it.
    
    The two operations must therefore be locked. And since state (request_sent)
    and wait_periods are only accessed inside this lock, they can be converted
    to simpler datatypes. And to avoid that the bla.num_requests is touched by
    a parallel running context with a valid backbone_gw reference after
    batadv_bla_purge_backbone_gw() ran, a third state "stopped" is required to
    correctly signal that a backbone_gw is in the state of being cleaned up.
    
    Cc: stable@kernel.org
    Fixes: 23721387c409 ("batman-adv: add basic bridge loop avoidance code")
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: bla: avoid NULL-ptr deref for claim via dropped interface [+ + +]

Author: Sven Eckelmann <sven@narfation.org>
Date:   Tue May 19 09:23:49 2026 +0200

    batman-adv: bla: avoid NULL-ptr deref for claim via dropped interface
    
    commit f80d3d98d2ff78d9e2fe5d68b1f45948c4f7bd24 upstream.
    
    Without rtnl_lock held, a hardif might be retrieved as primary interface of
    a meshif, but then (while operating on this interface) getting decoupled
    from the mesh interface. In this case, the meshif still exists but the
    pointer from the primary hardif to the meshif is set to NULL.
    
    The mesh_iface must be checked first to be non-NULL before continuing to
    send an ARP request using meshif.
    
    Cc: stable@kernel.org
    Fixes: 23721387c409 ("batman-adv: add basic bridge loop avoidance code")
    Reported-by: Ido Schimmel <idosch@nvidia.com>
    Reported-by: syzbot+9fdcc9f05a98a540b816@syzkaller.appspotmail.com
    Closes: https://syzkaller.appspot.com/bug?extid=9fdcc9f05a98a540b816
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: bla: fix report_work leak on backbone_gw purge [+ + +]

Author: Sven Eckelmann <sven@narfation.org>
Date:   Sun May 10 11:43:20 2026 +0200

    batman-adv: bla: fix report_work leak on backbone_gw purge
    
    commit 0459430add32ea41f3e2ef9351610e6d33627a6b upstream.
    
    batadv_bla_purge_backbone_gw() removes stale backbone gateway entries,
    but fails to properly handle their associated report_work:
    
    - If report_work is running, the purge must wait for it to finish before
      freeing the backbone_gw, otherwise the worker may access freed memory
      (e.g. bat_priv).
    - If report_work is pending, the purge must cancel it and release the
      reference held for that pending work item.
    
    The previous implementation called hlist_for_each_entry_safe() inside a
    spin_lock_bh() section, but cancel_work_sync() may sleep and therefore
    cannot be called from within a spinlock-protected region.
    
    Restructure the loop to handle one entry per spinlock critical section:
    acquire the lock, find the next entry to purge, remove it from the hash
    list, then release the lock before calling cancel_work_sync() and
    dropping the hash_entry reference. Repeat until no more entries require
    purging.
    
    Cc: stable@kernel.org
    Fixes: 23721387c409 ("batman-adv: add basic bridge loop avoidance code")
    Reviewed-by: Simon Wunderlich <sw@simonwunderlich.de>
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: clear current gateway during teardown [+ + +]

Author: Ruijie Li <ruijieli51@gmail.com>
Date:   Thu May 14 16:13:25 2026 +0800

    batman-adv: clear current gateway during teardown
    
    commit a340a51ed801eab7bb454150c226323b865263cc upstream.
    
    batadv_gw_node_free() removes the gateway list entries during mesh teardown,
    but it does not clear the currently selected gateway. This leaves stale
    gateway state behind across cleanup and can break a later mesh recreation.
    
    Clear bat_priv->gw.curr_gw before walking the gateway list so the selected
    gateway reference is dropped as part of teardown.
    
    Fixes: 2265c1410864 ("batman-adv: gateway election code refactoring")
    Cc: stable@kernel.org
    Reported-by: Yuan Tan <yuantan098@gmail.com>
    Reported-by: Yifan Wu <yifanwucs@gmail.com>
    Reported-by: Juefei Pu <tomapufckgml@gmail.com>
    Reported-by: Xin Liu <bird@lzu.edu.cn>
    Signed-off-by: Ruijie Li <ruijieli51@gmail.com>
    Signed-off-by: Zhanpeng Li <lzhanpeng2025@lzu.edu.cn>
    Signed-off-by: Ren Wei <n05ec@lzu.edu.cn>
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: dat: handle forward allocation error [+ + +]

Author: Sven Eckelmann <sven@narfation.org>
Date:   Wed May 13 09:01:34 2026 +0200

    batman-adv: dat: handle forward allocation error
    
    commit 2d8826a2d3657cea66fb0370f9e521575a673871 upstream.
    
    batadv_dat_forward_data() calls pskb_copy_for_clone() to duplicate an skb
    for each DHT candidate, but does not check the return value before passing
    it to batadv_send_skb_prepare_unicast_4addr(). That function dereferences
    the skb unconditionally, so a failed allocation triggers a NULL pointer
    dereference.
    
    Skip forwarding to the current DHT candidate on allocation failure.
    
    Cc: stable@kernel.org
    Fixes: 785ea1144182 ("batman-adv: Distributed ARP Table - create DHT helper functions")
    Reported-by: Yuan Tan <yuantan098@gmail.com>
    Reported-by: Yifan Wu <yifanwucs@gmail.com>
    Reported-by: Juefei Pu <tomapufckgml@gmail.com>
    Reported-by: Xin Liu <bird@lzu.edu.cn>
    Reviewed-by: Yuan Tan <yuantan098@gmail.com>
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: fix fragment reassembly length accounting [+ + +]

Author: Ruide Cao <caoruide123@gmail.com>
Date:   Wed May 13 11:58:15 2026 +0800

    batman-adv: fix fragment reassembly length accounting
    
    commit 9cd3f16c320bfdadd4509358122368deb56a5741 upstream.
    
    batman-adv keeps a running payload length for queued fragments and uses it
    to validate a fragment chain before reassembly.
    
    That accounting currently allows the accumulated fragment length to be
    truncated during updates. As a result, malformed fragment chains can
    bypass the intended validation and drive reassembly with inconsistent
    length state, leading to a local denial of service.
    
    Fix the accounting by storing the accumulated length in a length-typed
    field and rejecting update overflows before the existing validation logic
    runs.
    
    The fix was verified against the original reproducer and against valid
    fragment reassembly paths.
    
    Fixes: 610bfc6bc99b ("batman-adv: Receive fragmented packets and merge")
    Cc: stable@kernel.org
    Reported-by: Yuan Tan <yuantan098@gmail.com>
    Reported-by: Yifan Wu <yifanwucs@gmail.com>
    Reported-by: Juefei Pu <tomapufckgml@gmail.com>
    Reported-by: Xin Liu <bird@lzu.edu.cn>
    Signed-off-by: Ruide Cao <caoruide123@gmail.com>
    Tested-by: Ren Wei <enjou1224z@gmail.com>
    Signed-off-by: Ren Wei <n05ec@lzu.edu.cn>
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: fix tp_meter counter underflow during shutdown [+ + +]

Author: Luxiao Xu <rakukuip@gmail.com>
Date:   Mon May 11 18:52:09 2026 +0200

    batman-adv: fix tp_meter counter underflow during shutdown
    
    commit 94f3b133168d1c49895e7cc6afbcf1cc0b354602 upstream.
    
    batadv_tp_sender_shutdown() unconditionally decrements the "sending"
    atomic counter. If multiple paths (e.g. timeout, user cancel, and
    normal finish) call this function, the counter can underflow to -1.
    
    Since the sender logic treats any non-zero value as "still sending",
    a negative value causes the sender kthread to loop indefinitely.
    This leads to a use-after-free when the interface is removed while
    the zombie thread is still active.
    
    Fix this by using atomic_xchg() to ensure the counter only transitions
    from 1 to 0 once.
    
    Fixes: 33a3bb4a3345 ("batman-adv: throughput meter implementation")
    Cc: stable@kernel.org
    Reported-by: Yuan Tan <yuantan098@gmail.com>
    Reported-by: Yifan Wu <yifanwucs@gmail.com>
    Reported-by: Juefei Pu <tomapufckgml@gmail.com>
    Reported-by: Xin Liu <bird@lzu.edu.cn>
    Signed-off-by: Luxiao Xu <rakukuip@gmail.com>
    Signed-off-by: Ren Wei <n05ec@lzu.edu.cn>
    [sven: added missing change in batadv_tp_send]
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: frag: disallow unicast fragment in fragment [+ + +]

Author: Sven Eckelmann <sven@narfation.org>
Date:   Wed May 13 09:01:36 2026 +0200

    batman-adv: frag: disallow unicast fragment in fragment
    
    commit bc62216dc8e221e3781afa14430f45208bfa9af9 upstream.
    
    batadv_frag_skb_buffer() is called by batadv_batman_skb_recv() when a
    BATADV_UNICAST_FRAG packet is received. Once all fragments are collected
    and the packet is reassembled, batadv_recv_frag_packet() calls
    batadv_batman_skb_recv() again to process the defragmented payload.
    
    A malicious sender can craft a BATADV_UNICAST_FRAG packet whose reassembled
    payload is itself a BATADV_UNICAST_FRAG packet (matryoshka-style nesting).
    Each nesting level recurses through batadv_batman_skb_recv() without bound,
    growing the kernel stack until it is exhausted.
    
    Since refragmentation or fragments in fragments are not actually allowed,
    discard all packets which are still BATADV_UNICAST_FRAG packets after the
    defragmentation process.
    
    Cc: stable@kernel.org
    Fixes: 610bfc6bc99b ("batman-adv: Receive fragmented packets and merge")
    Reported-by: Yuan Tan <yuantan098@gmail.com>
    Reported-by: Yifan Wu <yifanwucs@gmail.com>
    Reported-by: Juefei Pu <tomapufckgml@gmail.com>
    Reported-by: Xin Liu <bird@lzu.edu.cn>
    Reviewed-by: Yuan Tan <yuantan098@gmail.com>
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: iv: recover OGM scheduling after forward packet error [+ + +]

Author: Sven Eckelmann <sven@narfation.org>
Date:   Fri May 15 22:00:40 2026 +0200

    batman-adv: iv: recover OGM scheduling after forward packet error
    
    commit aa3153bd139a6c48667dcd02608d3b2c80bff02c upstream.
    
    When batadv_iv_ogm_schedule_buff() fails to allocate and queue a forward
    packet for OGM transmission, the work item that drives periodic OGM
    scheduling is never re-armed. This silently halts transmission of the
    node's own OGMs on the affected interface — only OGMs from other peers
    continue to be aggregated and forwarded.
    
    Fix this by tracking whether batadv_iv_ogm_queue_add() (and transitively
    batadv_iv_ogm_aggregate_new()) successfully scheduled a forward packet.
    When scheduling fails, batadv_iv_ogm_schedule_buff() falls back to queuing
    a dedicated recovery work item (reschedule_work) that fires after one
    originator interval and calls batadv_iv_ogm_schedule() again.
    
    Cc: stable@kernel.org
    Fixes: c6c8fea29769 ("net: Add batman-adv meshing protocol")
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: mcast: fix use-after-free in orig_node RCU release [+ + +]

Author: Sven Eckelmann <sven@narfation.org>
Date:   Thu May 14 19:22:02 2026 +0200

    batman-adv: mcast: fix use-after-free in orig_node RCU release
    
    commit 20c2d6a20ca936f5aaa6dd40f73f262ac45c87cc upstream.
    
    batadv_mcast_purge_orig() removes entries from RCU-protected hlists but
    does not wait for an RCU grace period before returning. Concurrent RCU
    readers may still accesses references to those entries at the point of
    removal. RCU-protected readers trying to operate on entries like
    orig->mcast_want_all_ipv6_node will then access already freed memory.
    
    Fix this by moving batadv_mcast_purge_orig() to batadv_orig_node_release(),
    just before the call_rcu() invocation. This ensures RCU readers that were
    active at purge time have drained before the orig_node memory is reclaimed.
    
    Cc: stable@kernel.org
    Fixes: ab49886e3da7 ("batman-adv: Add IPv4 link-local/IPv6-ll-all-nodes multicast support")
    Acked-by: Linus Lüssing <linus.luessing@c0d3.blue>
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: tp_meter: avoid role confusion in tp_list [+ + +]

Author: Sven Eckelmann <sven@narfation.org>
Date:   Sat May 16 12:33:41 2026 +0200

    batman-adv: tp_meter: avoid role confusion in tp_list
    
    commit ff24f2ecfd94c07a2b89bac497433e3b23271cac upstream.
    
    Session lookups in tp_list matched only on destination address (and
    optionally session ID), leaving role validation to the caller. If two
    sessions with the same other_end coexisted (one as sender, one as receiver)
    a lookup could silently return the wrong one, causing the caller's role to
    bail out early, potentially skipping necessary cleanup.
    
    Move the role check into the lookup functions themselves so the correct
    entry is always returned, or none at all. Since batadv_tp_start()
    legitimately needs to detect any active session to a destination regardless
    of role, introduce a dedicated helper for that case rather than bending the
    existing lookup semantics.
    
    Cc: stable@kernel.org
    Fixes: 33a3bb4a3345 ("batman-adv: throughput meter implementation")
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: tp_meter: avoid use of uninit sender vars [+ + +]

Author: Sven Eckelmann <sven@narfation.org>
Date:   Wed May 13 09:01:35 2026 +0200

    batman-adv: tp_meter: avoid use of uninit sender vars
    
    commit 6c65cf23d4c6170fcf5714c32aa64689718cb142 upstream.
    
    batadv_tp_recv_ack() and batadv_tp_stop() are only valid for tp_vars in the
    BATADV_TP_SENDER role. When called with a BATADV_TP_RECEIVER role, it
    proceeds to read sender-only members that were never initialized, leading
    to undefined behavior.
    
    This can be triggered when a node that is currently acting as a receiver in
    an ongoing tp_meter session receives a malicious ACK packet.
    
    Guard against this by checking tp_vars->role immediately after the
    lookup and bailing out if it is not BATADV_TP_SENDER, before any of
    those members are accessed.
    
    Cc: stable@kernel.org
    Fixes: 33a3bb4a3345 ("batman-adv: throughput meter implementation")
    Reported-by: Yuan Tan <yuantan098@gmail.com>
    Reported-by: Yifan Wu <yifanwucs@gmail.com>
    Reported-by: Juefei Pu <tomapufckgml@gmail.com>
    Reported-by: Xin Liu <bird@lzu.edu.cn>
    Reviewed-by: Yuan Tan <yuantan098@gmail.com>
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: tp_meter: directly shut down timer on cleanup [+ + +]

Author: Sven Eckelmann <sven@narfation.org>
Date:   Wed May 13 10:43:54 2026 +0200

    batman-adv: tp_meter: directly shut down timer on cleanup
    
    commit d5487249a81ea658717614009c8f46acc5b7101a upstream.
    
    batadv_tp_sender_cleanup() was calling timer_delete_sync() followed by
    timer_delete() to guard against the timer handler re-arming itself between
    the two calls. This double-deletion hack relied on the sending status being
    set to 0 to suppress re-arming.
    
    Replace both calls with a single timer_shutdown_sync(). This function both
    waits for any running timer callback to complete (like timer_delete_sync())
    and permanently disarms the timer so it cannot be re-armed afterwards,
    making re-arming prevention unconditional and self-documenting.
    
    The re-arming property is also required because otherwise:
    
    1. context 0 (batadv_tp_recv_ack()) checks in
       batadv_tp_reset_sender_timer() if sending is still 1 -> it is
    2. context 1 changes in batadv_tp_sender_shutdown() sending to 0 and in
       this process forces the kthread to stop timer in
       batadv_tp_sender_cleanup()
    3. context 0 continues in batadv_tp_reset_sender_timer() and rearms the
       timer -> but the reference for it is already gone
    
    Cc: stable@kernel.org
    Fixes: 33a3bb4a3345 ("batman-adv: throughput meter implementation")
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: tp_meter: fix race condition in send error reporting [+ + +]

Author: Sven Eckelmann <sven@narfation.org>
Date:   Wed May 13 23:38:54 2026 +0200

    batman-adv: tp_meter: fix race condition in send error reporting
    
    commit 71dce47f0758537fff78fddb5fb0d4632d29b29f upstream.
    
    batadv_tp_sender_shutdown() previously used two separate variables to track
    session state: sending (an atomic flag indicating whether the session was
    active) and reason (a plain enum storing the stop reason). This introduced
    a race window between the two writes: after sending was cleared to 0,
    batadv_tp_send() could observe the stopped state and call
    batadv_tp_sender_end() before reason was written, causing the wrong stop
    reason to be reported to the caller.
    
    Fix this by consolidating both variables into a single atomic send_result,
    which holds 0 while the session is running and the stop reason once it
    ends.
    
    Cc: stable@kernel.org
    Fixes: 33a3bb4a3345 ("batman-adv: throughput meter implementation")
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: tp_meter: fix tp_vars reference leak in receiver shutdown [+ + +]

Author: Sven Eckelmann <sven@narfation.org>
Date:   Sun May 10 11:31:03 2026 +0200

    batman-adv: tp_meter: fix tp_vars reference leak in receiver shutdown
    
    commit 77098e4bea37af51d3962efa88a5af2ea5e1ac57 upstream.
    
    The receiver shutdown timer handler, batadv_tp_receiver_shutdown(), is
    responsible for releasing the tp_vars reference it holds. However, the
    existing logic for coordinating this release with batadv_tp_stop_all() was
    flawed.
    
    timer_shutdown_sync() guarantees the timer will not fire again after it
    returns, but it returns non-zero only when the timer was pending at the
    time of the call. If the timer had already expired (and
    batadv_tp_stop_all() would unsucessfully try to  rearm itself),
    batadv_tp_stop_all() skips its batadv_tp_vars_put(), and
    batadv_tp_receiver_shutdown() fails to put its own reference as well.
    
    Fix this by introducing a new atomic variable receiving that is set to 1
    when the receiver is initialized and cleared atomically with atomic_xchg()
    by whichever side claims it first. Only the side that observes the
    transition from 1 to 0 is responsible for releasing the tp_vars timer
    reference, eliminating the uncertainty.
    
    Cc: stable@kernel.org
    Fixes: 3d3cf6a7314a ("batman-adv: stop tp_meter sessions during mesh teardown")
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: tt: avoid empty VLAN responses [+ + +]

Author: Sven Eckelmann <sven@narfation.org>
Date:   Sat May 2 20:47:34 2026 +0200

    batman-adv: tt: avoid empty VLAN responses
    
    commit fa1bd704940b5bcbc32c0b28db9167405c8ee5e0 upstream.
    
    The commit 16116dac2339 ("batman-adv: prevent TT request storms by not
    sending inconsistent TT TLVLs") added checks to the local (direct) TT
    response code. But the response can also be done indirectly by another node
    using the global TT state. To avoid such inconsistency states reported in
    the original fix, also avoid sending empty VLANs for replies from the
    global TT state.
    
    Cc: stable@kernel.org
    Fixes: 7ea7b4a14275 ("batman-adv: make the TT CRC logic VLAN specific")
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: tt: fix negative last_changeset_len [+ + +]

Author: Sven Eckelmann <sven@narfation.org>
Date:   Sat May 2 19:53:21 2026 +0200

    batman-adv: tt: fix negative last_changeset_len
    
    commit fc92cdfcb295cefa4344d71a527d61b638b7bfc4 upstream.
    
    batadv_piv_tt::last_changeset_len len was declared as s16, but the field is
    never intended to hold a negative value. When a value greater than 32767 is
    assigned, it wraps to a negative signed integer.
    
    In batadv_send_my_tt_response(), last_changeset_len is temporarily widened
    to s32. The incorrectly negative s16 value propagates into the s32, causing
    batadv_tt_prepare_tvlv_local_data() to allocate a full sized buffer but
    populates only a small portion of it with the collected changeset. All
    remaining bits are kept uninitialized.
    
    Using an u16 avoids this type confusion and ensures that no (negative) sign
    extension is performed in batadv_send_my_tt_response().
    
    Cc: stable@kernel.org
    Fixes: a73105b8d4c7 ("batman-adv: improved client announcement mechanism")
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: tt: fix negative tt_buff_len [+ + +]

Author: Sven Eckelmann <sven@narfation.org>
Date:   Sat May 2 19:53:21 2026 +0200

    batman-adv: tt: fix negative tt_buff_len
    
    commit b64963a2ceeb7529310b6cf253a1e540784422f4 upstream.
    
    batadv_orig_node::tt_buff_len was declared as s16, but the field is never
    intended to hold a negative value. When a value greater than 32767 is
    assigned, it wraps to a negative signed integer.
    
    In batadv_send_other_tt_response(), tt_buff_len is temporarily widened to
    s32. The incorrectly negative s16 value propagates into the s32, causing
    batadv_tt_prepare_tvlv_global_data() to allocate a full sized buffer but
    populates only a small portion of it with the collected changeset. All
    remaining bits are kept uninitialized.
    
    Using an u16 avoids this type confusion and ensures that no (negative) sign
    extension is performed in batadv_send_other_tt_response().
    
    Cc: stable@kernel.org
    Fixes: a73105b8d4c7 ("batman-adv: improved client announcement mechanism")
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: tt: fix TOCTOU race for reported vlans [+ + +]

Author: Sven Eckelmann <sven@narfation.org>
Date:   Sat May 2 19:47:11 2026 +0200

    batman-adv: tt: fix TOCTOU race for reported vlans
    
    commit 94d27005016be15ffc638b2ecbc4d58805ad7b48 upstream.
    
    The local TT based TVLV is generated by first checking the number of VLANs
    which have at least one TT entry. A new buffer with the correct size for
    the VLANs is then allocated. Only then, the list of VLANs s used to fill
    the VLAN entries in the buffer. During this time, the meshif_vlan_list_lock
    is held. But the actual number of TT entries of each VLAN can still
    increase during this time - just not the number of VLANs in the list.
    
    But the prefilter used in the buffer size calculation might still cause an
    increase of the number of VLANs which need to be stored. Simply because a
    VLAN might now suddenly have at least one entry when it had none in the
    pre-alloc check - and then needs to occupy space which was not allocated.
    
    It is better to overestimate the buffer size at the beginning and then fill
    the buffer only with the VLANs which are not empty.
    
    Cc: stable@kernel.org
    Fixes: 16116dac2339 ("batman-adv: prevent TT request storms by not sending inconsistent TT TLVLs")
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: tt: prevent TVLV entry number overflow [+ + +]

Author: Sven Eckelmann <sven@narfation.org>
Date:   Sat May 2 21:25:19 2026 +0200

    batman-adv: tt: prevent TVLV entry number overflow
    
    commit 99d9958fa10fb684b2a8e2c48a8d704122721420 upstream.
    
    The helpers to prepare the buffers for the local and global TT based
    replies are trying to sum up all TT entries which can be found for each
    VLAN. In theory, this sum can be too big for an u16 and therefore overflow.
    A too small buffer would then be allocated for the TVLV.
    
    The too small buffer will be handled gracefully by
    batadv_tt_tvlv_generate() and is not causing a buffer overflow - just a
    truncated reply. But this overflow shouldn't have happened in the first and
    the too small buffer should never have been allocated when an overflow was
    detected.
    
    Cc: stable@kernel.org
    Fixes: 7ea7b4a14275 ("batman-adv: make the TT CRC logic VLAN specific")
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: tt: reject oversized local TVLV buffers [+ + +]

Author: Sven Eckelmann <sven@narfation.org>
Date:   Sat May 2 19:08:37 2026 +0200

    batman-adv: tt: reject oversized local TVLV buffers
    
    commit 1e9fab756f8395096d5bba7be0c373c4c8f5d165 upstream.
    
    The commit 3a359bf5c61d ("batman-adv: reject oversized global TT response
    buffers") added a check to ensure that a global return buffer size can be
    stored in an u16. The same buffer handling also exists for the local data
    buffer but was not touched.
    
    A similar check should be also be in place for the local TVLV buffer. It
    doesn't have the similar attack surface because it is only generated from
    locally discovered MAC addresses but the dynamic nature could still cause
    temporarily to large buffers.
    
    Cc: stable@kernel.org
    Fixes: 7ea7b4a14275 ("batman-adv: make the TT CRC logic VLAN specific")
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: tvlv: abort OGM send on tvlv append failure [+ + +]

Author: Sven Eckelmann <sven@narfation.org>
Date:   Thu May 14 16:33:12 2026 +0200

    batman-adv: tvlv: abort OGM send on tvlv append failure
    
    commit 501368506563e151b322c8c3f228b796e615b90d upstream.
    
    batadv_tvlv_container_ogm_append() could fail in two ways: a memory
    allocation failure when resizing the packet buffer, or the tvlv data
    exceeding U16_MAX bytes. In both cases the function previously returned the
    old (now stale) tvlv_value_len rather than signalling an error, causing the
    OGM/OGM2 send path to transmit a packet whose TVLV length field no longer
    matched the actual buffer contents. And because it also didn't fill in the
    new TVLV data, sending either uninitialized or corrupted data on the wire.
    
    All errors in batadv_tvlv_container_ogm_append() must be forwarded to the
    caller. And the caller must abort the send of the OGM2. For B.A.T.M.A.N.
    IV, it is currently not allowed to abort the send. The non-TVLV part of the
    OGM must be queued up instead.
    
    Cc: stable@kernel.org
    Fixes: ef26157747d4 ("batman-adv: tvlv - basic infrastructure")
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: tvlv: reject oversized TVLV packets [+ + +]

Author: Sven Eckelmann <sven@narfation.org>
Date:   Sat May 9 21:55:29 2026 +0200

    batman-adv: tvlv: reject oversized TVLV packets
    
    commit f50487e3566358b2b982b7801945e858c78ad9ab upstream.
    
    batadv_tvlv_container_ogm_append() builds a TVLV packet section from
    the tvlv.container_list. The total size of this section is computed by
    batadv_tvlv_container_list_size(), which sums the sizes of all registered
    containers.
    
    The return type and accumulator in batadv_tvlv_container_list_size() were
    u16. If the accumulated size exceeds U16_MAX, the value wraps around,
    causing the subsequent allocation in batadv_tvlv_container_ogm_append()
    to be undersized. The memcpy-style copy that follows would then write
    beyond the end of the allocated buffer, corrupting kernel memory.
    
    Fix this by widening the return type of batadv_tvlv_container_list_size()
    to size_t. In batadv_tvlv_container_ogm_append(), check the computed length
    against U16_MAX before proceeding, and bail out as if the allocation had
    failed when the limit is exceeded.
    
    Cc: stable@kernel.org
    Fixes: ef26157747d4 ("batman-adv: tvlv - basic infrastructure")
    Reported-by: Yuan Tan <yuantan098@gmail.com>
    Reported-by: Yifan Wu <yifanwucs@gmail.com>
    Reported-by: Juefei Pu <tomapufckgml@gmail.com>
    Reported-by: Xin Liu <bird@lzu.edu.cn>
    Reviewed-by: Yuan Tan <yuantan098@gmail.com>
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

batman-adv: v: stop OGMv2 on disabled interface [+ + +]

Author: Sven Eckelmann <sven@narfation.org>
Date:   Sat May 9 22:44:12 2026 +0200

    batman-adv: v: stop OGMv2 on disabled interface
    
    commit f8ce8b8331a1bc44ad4905886a482214d428b253 upstream.
    
    When a batadv_hard_iface is disabled, its mesh_iface pointer is set to
    NULL. However, batadv_v_ogm_send_meshif() may still dispatch OGMs via
    batadv_v_ogm_queue_on_if() for interfaces that have since lost their
    mesh_iface association. This results in a NULL pointer dereference when
    batadv_v_ogm_queue_on_if() unconditionally calls netdev_priv() on the
    now NULL hard_iface->mesh_iface to retrieve the batadv_priv.
    
    It is necessary to ensure that the batadv_v_ogm_queue_on_if() checks that
    it is using the same mesh_iface for which batadv_v_ogm_send_meshif() was
    called.
    
    Cc: stable@kernel.org
    Fixes: 0da0035942d4 ("batman-adv: OGMv2 - add basic infrastructure")
    Reported-by: Yuan Tan <yuantan098@gmail.com>
    Reported-by: Yifan Wu <yifanwucs@gmail.com>
    Reported-by: Juefei Pu <tomapufckgml@gmail.com>
    Reported-by: Xin Liu <bird@lzu.edu.cn>
    Reviewed-by: Yuan Tan <yuantan098@gmail.com>
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

blk-mq: pop cached request if it is usable [+ + +]

Author: Keith Busch <kbusch@kernel.org>
Date:   Thu May 21 12:02:53 2026 -0700

    blk-mq: pop cached request if it is usable
    
    [ Upstream commit dc278e9bf2b9513a763353e6b9cc21e0f532954e ]
    
    When submitting a bio to blk-mq, if the task should sleep after peeking
    a cached request, but before it pops it, the plug flushes and calls
    blk_mq_free_plug_rqs, freeing the cached_rqs. This creates a
    use-after-free bug. Fix this by popping the cached request before any
    possible blocking calls if it is suitable for use.
    
    Popping this request first holds a queue reference, so avoid any
    serialization races with queue freezes and can safely proceed with
    dispatching that request to the driver. This potentially increases a
    timing window from when a driver wants to freeze its queue to when
    requests stop being dispatched. That scenario is off the fast path
    though, and drivers need to appropriately handle requests during a
    freeze request anyway.
    
    The downside is the popped element needs to be individually freed when
    we performed a bio plug merge. The cached request would have had to be
    freed later anyway, but this patch does it inline with building the plug
    list instead of after flushing it.
    
    Fixes: b0077e269f6c1 ("blk-mq: make sure active queue usage is held for bio_integrity_prep()")
    Fixes: 7b4f36cd22a65 ("block: ensure we hold a queue reference when using queue limits")
    Signed-off-by: Keith Busch <kbusch@kernel.org>
    Link: https://patch.msgid.link/20260521190253.242065-1-kbusch@meta.com
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

block: allow submitting all zone writes from a single context [+ + +]

Author: Damien Le Moal <dlemoal@kernel.org>
Date:   Fri Feb 27 22:19:49 2026 +0900

    block: allow submitting all zone writes from a single context
    
    [ Upstream commit 1365b6904fd050bf22ab9f3df375a396de5837a1 ]
    
    In order to maintain sequential write patterns per zone with zoned block
    devices, zone write plugging issues only a single write BIO per zone at
    any time. This works well but has the side effect that when large
    sequential write streams are issued by the user and these streams cross
    zone boundaries, the device ends up receiving a discontiguous set of
    write commands for different zones. The same also happens when a user
    writes simultaneously at high queue depth multiple zones: the device
    does not see all sequential writes per zone and receives discontiguous
    writes to different zones. While this does not affect the performance of
    solid state zoned block devices, when using an SMR HDD, this pattern
    change from sequential writes to discontiguous writes to different zones
    significantly increases head seek which results in degraded write
    throughput.
    
    In order to reduce this seek overhead for rotational media devices,
    introduce a per disk zone write plugs kernel thread to issue all write
    BIOs to zones. This single zone write issuing context is enabled for
    any zoned block device that has a request queue flagged with the new
    QUEUE_ZONED_QD1_WRITES flag.
    
    The flag QUEUE_ZONED_QD1_WRITES is visible as the sysfs queue attribute
    zoned_qd1_writes for zoned devices. For regular block devices, this
    attribute is not visible. For zoned block devices, a user can override
    the default value set to force the global write maximum queue depth of
    1 for a zoned block device, or clear this attribute to fallback to the
    default behavior of zone write plugging which limits writes to QD=1 per
    sequential zone.
    
    Writing to a zoned block device flagged with QUEUE_ZONED_QD1_WRITES is
    implemented using a list of zone write plugs that have a non-empty BIO
    list. Listed zone write plugs are processed by the disk zone write plugs
    worker kthread in FIFO order, and all BIOs of a zone write plug are all
    processed before switching to the next listed zone write plug. A newly
    submitted BIO for a non-FULL zone write plug that is not yet listed
    causes the addition of the zone write plug at the end of the disk list
    of zone write plugs.
    
    Since the write BIOs queued in a zone write plug BIO list are
    necessarilly sequential, for rotational media, using the single zone
    write plugs kthread to issue all BIOs maintains a sequential write
    pattern and thus reduces seek overhead and improves write throughput.
    This processing essentially result in always writing to HDDs at QD=1,
    which is not an issue for HDDs operating with write caching enabled.
    Performance with write cache disabled is also not degraded thanks to
    the efficient write handling of modern SMR HDDs.
    
    A disk list of zone write plugs is defined using the new struct gendisk
    zone_wplugs_list, and accesses to this list is protected using the
    zone_wplugs_list_lock spinlock.  The per disk kthread
    (zone_wplugs_worker) code is implemented by the function
    disk_zone_wplugs_worker(). A reference on listed zone write plugs is
    always held until all BIOs of the zone write plug are processed by the
    worker kthread. BIO issuing at QD=1 is driven using a completion
    structure (zone_wplugs_worker_bio_done) and calls to blk_io_wait().
    
    With this change, performance when sequentially writing the zones of a
    30 TB SMR SATA HDD connected to an AHCI adapter changes as follows
    (1MiB direct I/Os, results in MB/s unit):
    
                        +--------------------+
                        |   Write BW (MB/s)  |
     +------------------+----------+---------+
     | Sequential write | Baseline | Patched |
     |  Queue Depth     | 6.19-rc8 |         |
     +------------------+----------+---------+
     | 1                | 244      | 245     |
     | 2                | 244      | 245     |
     | 4                | 245      | 245     |
     | 8                | 242      | 245     |
     | 16               | 222      | 246     |
     | 32               | 211      | 245     |
     | 64               | 193      | 244     |
     | 128              | 112      | 246     |
     +------------------+----------+---------+
    
    With the current code (baseline), as the sequential write stream crosses
    a zone boundary, higher queue depth creates a gap between the
    last IO to the previous zone and the first IOs to the following zones,
    causing head seeks and degrading performance. Using the disk zone
    write plugs worker thread, this pattern disappears and the maximum
    throughput of the drive is maintained, leading to over 100%
    improvements in throughput for high queue depth write.
    
    Using 16 fio jobs all writing to randomly chosen zones at QD=32 with 1
    MiB direct IOs, write throughput also increases significantly.
    
                        +--------------------+
                        |   Write BW (MB/s)  |
     +------------------+----------+---------+
     |   Random write   | Baseline | Patched |
     |  Number of zones | 6.19-rc7 |         |
     +------------------+----------+---------+
     | 1                | 191      | 192     |
     | 2                | 101      | 128     |
     | 4                | 115      | 123     |
     | 8                | 90       | 120     |
     | 16               | 64       | 115     |
     | 32               | 58       | 105     |
     | 64               | 56       | 101     |
     | 128              | 55       | 99      |
     +------------------+----------+---------+
    
    Tests using XFS shows that buffered write speed with 8 jobs writing
    files increases by 12% to 35% depending on the workload.
    
                        +--------------------+
                        |   Write BW (MB/s)  |
     +------------------+----------+---------+
     |     Workload     | Baseline | Patched |
     |                  | 6.19-rc7 |         |
     +------------------+----------+---------+
     | 256MiB file size | 212      | 238     |
     +------------------+----------+---------+
     | 4MiB .. 128 MiB  | 213      | 243     |
     | random file size |          |         |
     +------------------+----------+---------+
     | 2MiB .. 8 MiB    | 179      | 242     |
     | random file size |          |         |
     +------------------+----------+---------+
    
    Performance gains are even more significant when using an HBA that
    limits the maximum size of commands to a small value, e.g. HBAs
    controlled with the mpi3mr driver limit commands to a maximum of 1 MiB.
    In such case, the write throughput gains are over 40%.
    
                        +--------------------+
                        |   Write BW (MB/s)  |
     +------------------+----------+---------+
     |     Workload     | Baseline | Patched |
     |                  | 6.19-rc7 |         |
     +------------------+----------+---------+
     | 256MiB file size | 175      | 245     |
     +------------------+----------+---------+
     | 4MiB .. 128 MiB  | 174      | 244     |
     | random file size |          |         |
     +------------------+----------+---------+
     | 2MiB .. 8 MiB    | 171      | 243     |
     | random file size |          |         |
     +------------------+----------+---------+
    
    Signed-off-by: Damien Le Moal <dlemoal@kernel.org>
    Reviewed-by: Christoph Hellwig <hch@lst.de>
    Reviewed-by: Bart Van Assche <bvanassche@acm.org>
    Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com>
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Stable-dep-of: 836efd35c472 ("block: fix handling of dead zone write plugs")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

block: avoid use-after-free in disk_free_zone_resources() [+ + +]

Author: Damien Le Moal <dlemoal@kernel.org>
Date:   Fri May 22 20:56:22 2026 +0900

    block: avoid use-after-free in disk_free_zone_resources()
    
    [ Upstream commit f6982769910ecddabdb5b8b9afdab0bb8b6668ac ]
    
    The function disk_update_zone_resources() may call
    disk_free_zone_resources() in case of error, and following this,
    blk_revalidate_disk_zones() will again calls disk_free_zone_resources() if
    disk_update_zone_resources() failed. If a zone worker thread is being used
    (which is the default for a rotational media zoned device),
    disk_free_zone_resources() will try to stop the zone worker thread twice
    because disk->zone_wplugs_worker is not reset to NULL when the worker
    thread is stopped the first time.
    
    In disk_free_zone_resources(), fix this by correctly clearing
    disk->zone_wplugs_worker to NULL when the worker thread is stopped.
    
    And while at it, since disk_free_zone_resources() is always called after a
    failed call to disk_update_zone_resources(), remove the unnecessary call
    to disk_free_zone_resources() in disk_update_zone_resources().
    
    Fixes: 1365b6904fd0 ("block: allow submitting all zone writes from a single context")
    Signed-off-by: Damien Le Moal <dlemoal@kernel.org>
    Reviewed-by: Christoph Hellwig <hch@lst.de>
    Link: https://patch.msgid.link/20260522115622.588535-1-dlemoal@kernel.org
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

block: bio-integrity: Fix null-ptr-deref in bio_integrity_map_user() [+ + +]

Author: Sungwoo Kim <iam@sung-woo.kim>
Date:   Tue May 12 01:09:29 2026 -0400

    block: bio-integrity: Fix null-ptr-deref in bio_integrity_map_user()
    
    [ Upstream commit 8582792cf23b3d94674d4d838f7cde9a28d0fcaf ]
    
    pin_user_pages_fast() can partially succeed and return the number of
    pages that were actually pinned. However, the bio_integrity_map_user()
    does not handle this partial pinning. This leads to a general protection
    fault since bvec_from_pages() dereferences an unpinned page address,
    which is 0.
    
    To fix this, add a check to verify that all requested memory is pinned.
    If partial pinning occurs, unpin the memory and return -EFAULT.
    
    Kernel Oops:
    
    Oops: general protection fault, probably for non-canonical address 0xdffffc0000000001: 0000 [#1] SMP KASAN NOPTI
    KASAN: null-ptr-deref in range [0x0000000000000008-0x000000000000000f]
    CPU: 0 UID: 0 PID: 1061 Comm: nvme-passthroug Not tainted 7.0.0-11783-g90957f9314e8-dirty #16 PREEMPT(lazy)
    Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS rel-1.17.0-0-gb52ca86e094d-prebuilt.qemu.org 04/01/2014
    RIP: 0010:bio_integrity_map_user.cold+0x1b0/0x9d6
    
    Fixes: 492c5d455969 ("block: bio-integrity: directly map user buffers")
    Acked-by: Chao Shi <cshi008@fiu.edu>
    Acked-by: Weidong Zhu <weizhu@fiu.edu>
    Acked-by: Dave Tian <daveti@purdue.edu>
    Signed-off-by: Sungwoo Kim <iam@sung-woo.kim>
    Tested-by: Shin'ichiro Kawasaki <shinichiro.kawasaki@wdc.com>
    Link: https://github.com/linux-blktests/blktests/pull/244
    Link: https://patch.msgid.link/20260512050929.541397-2-iam@sung-woo.kim
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

block: don't overwrite bip_vcnt in bio_integrity_copy_user() [+ + +]

Author: David Carlier <devnexen@gmail.com>
Date:   Mon May 11 22:51:51 2026 +0100

    block: don't overwrite bip_vcnt in bio_integrity_copy_user()
    
    [ Upstream commit 637ad3a56a3b889527d1dacea6fea2a8bd648140 ]
    
    bio_integrity_add_page() already sets bip_vcnt to 1 for the bounce
    segment. Overwriting it with nr_vecs breaks bip_vcnt <= bip_max_vcnt
    on WRITE (bip_max_vcnt is 1), so the gap-merge checks in block/blk.h
    read past the bip_vec[] flex array. On READ the read is in bounds
    but lands on a saved user bvec instead of the bounce.
    
    The line was added for split propagation, but bio_integrity_clone()
    doesn't copy bip_vcnt and BIP_CLONE_FLAGS excludes BIP_COPY_USER.
    
    Fixes: 3991657ae707 ("block: set bip_vcnt correctly")
    Signed-off-by: David Carlier <devnexen@gmail.com>
    Reviewed-by: Christoph Hellwig <hch@lst.de>
    Link: https://patch.msgid.link/20260511215151.346228-1-devnexen@gmail.com
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

block: fix handling of dead zone write plugs [+ + +]

Author: Damien Le Moal <dlemoal@kernel.org>
Date:   Wed May 13 20:11:29 2026 +0900

    block: fix handling of dead zone write plugs
    
    [ Upstream commit 836efd35c472d89c838d7b17ef339ddb3286ffc5 ]
    
    Shin'ichiro reported hard to reproduce unaligned write errors with zoned
    block devices. Under normal operation conditions (e.g. running XFS on an
    SMR disk), these errors are nearly impossible to trigger. But using a
    "slow" kernel with many debug options enables and some specific use
    cases (e.g. fio zbd test case 46), the errors can be reproduced fairly
    easily.
    
    The unaligned write errors come from mishandling a valid reference
    counting pattern of zone write plugs. Such pattern triggers for instance
    if a process A writes a zone (not necessarilly to the full state),
    another process B immediately resets the zone and immediately following
    the completion of the zone reset, starts issuing writes to the zone.
    With such pattern, in some cases, the zone write plugs worker thread of
    the device may still be holding a reference to the zone write plug of
    the zone taken when process A was writing to the zone. The following
    zone reset from process B marks the zone as dead but does not remove the
    zone write plug from the device hash table as a reference to the plug
    still exist. Once process B starts issuing new writes, the zone write
    plug is seen as dead and the writes from process B are immediately
    failed, despite this write pattern being perfectly legal.
    
    Fix this by allowing restoring a dead zone write plug to a live state if
    a write is issued to the zone when the zone is: marked as dead, empty
    and the write sector corresponds to the first sector of the zone (that
    is, the write is aligned to the zone write pointer). This is done with
    the new helper function disk_check_zone_wplug_dead(), which restores a
    dead zone write plug to a live state by clearing the BLK_ZONE_WPLUG_DEAD
    flag and restoring the initial reference to the zone write plug taken
    when the plug was added to the device hash table.
    
    Reported-by: Shin'ichiro Kawasaki <shinichiro.kawasaki@wdc.com>
    Fixes: b7d4ffb51037 ("block: fix zone write plug removal")
    Signed-off-by: Damien Le Moal <dlemoal@kernel.org>
    Tested-by: Shin'ichiro Kawasaki <shinichiro.kawasaki@wdc.com>
    Link: https://patch.msgid.link/20260513111129.108809-1-dlemoal@kernel.org
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

block: recompute nr_integrity_segments in blk_insert_cloned_request [+ + +]

Author: Casey Chen <cachen@purestorage.com>
Date:   Mon May 11 15:22:30 2026 -0600

    block: recompute nr_integrity_segments in blk_insert_cloned_request
    
    [ Upstream commit 2c6e6a18a37b905cb584eb0dda3ae482162a81ca ]
    
    blk_insert_cloned_request() already recomputes nr_phys_segments
    against the bottom queue, because "the queue settings related to
    segment counting may differ from the original queue." The exact same
    reasoning applies to integrity segments: a stacked driver's underlying
    queue can have tighter virt_boundary_mask, seg_boundary_mask, or
    max_segment_size than the top queue, in which case
    blk_rq_count_integrity_sg() against the bottom queue produces a
    different count than the cached rq->nr_integrity_segments inherited
    from the source request by blk_rq_prep_clone().
    
    When the cached count is lower than the bottom queue's actual count,
    blk_rq_map_integrity_sg() trips
    
            BUG_ON(segments > rq->nr_integrity_segments);
    
    on dispatch. The same families of stacked setups that motivated the
    existing nr_phys_segments recompute -- dm-multipath fanning out to
    nvme-rdma in particular -- can produce this.
    
    Mirror the nr_phys_segments handling: when the request carries
    integrity, recompute nr_integrity_segments against the bottom queue
    and reject the request if it exceeds the bottom queue's
    max_integrity_segments. blk_rq_count_integrity_sg() and
    queue_max_integrity_segments() are both already available via
    <linux/blk-integrity.h>, which blk-mq.c includes.
    
    This closes a latent gap in the stacking contract and brings the
    integrity-segment accounting in line with the existing
    phys-segment accounting.
    
    Fixes: 76c313f658d2 ("blk-integrity: improved sg segment mapping")
    Signed-off-by: Casey Chen <cachen@purestorage.com>
    Reviewed-by: Christoph Hellwig <hch@lst.de>
    Link: https://patch.msgid.link/20260511212230.27511-1-cachen@purestorage.com
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

block: rename struct gendisk zone_wplugs_lock field [+ + +]

Author: Damien Le Moal <dlemoal@kernel.org>
Date:   Fri Feb 27 22:19:48 2026 +0900

    block: rename struct gendisk zone_wplugs_lock field
    
    [ Upstream commit b7cbc30e93e3a64ea058230f6d0c764d6d80276f ]
    
    Rename struct gendisk zone_wplugs_lock field to zone_wplugs_hash_lock to
    clearly indicates that this is the spinlock used for manipulating the
    hash table of zone write plugs.
    
    Signed-off-by: Damien Le Moal <dlemoal@kernel.org>
    Reviewed-by: Hannes Reinecke <hare@suse.de>
    Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com>
    Reviewed-by: Christoph Hellwig <hch@lst.de>
    Reviewed-by: Bart Van Assche <bvanassche@acm.org>
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Stable-dep-of: 836efd35c472 ("block: fix handling of dead zone write plugs")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

Bluetooth: bnep: Fix UAF read of dev->name [+ + +]

Author: Jann Horn <jannh@google.com>
Date:   Tue May 12 22:15:39 2026 +0200

    Bluetooth: bnep: Fix UAF read of dev->name
    
    commit 59e932ded949fa6f0340bf7c6d7818f962fa4fd2 upstream.
    
    bnep_add_connection() needs to keep holding the bnep_session_sem while
    reading dev->name (just like bnep_get_connlist() does); otherwise the
    bnep_session() thread can concurrently free the net_device, which can for
    example be triggered by a concurrent bnep_del_connection().
    
    (This UAF is fairly uninteresting from a security perspective;
    calling bnep_add_connection() requires passing a capable(CAP_NET_ADMIN)
    check. It also requires completely tearing down a netdev during a fairly
    tight race window.)
    
    Cc: stable@vger.kernel.org
    Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
    Signed-off-by: Jann Horn <jannh@google.com>
    Signed-off-by: Luiz Augusto von Dentz <luiz.von.dentz@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

Bluetooth: btintel_pcie: Fix incorrect MAC access programming [+ + +]

Author: Kiran K <kiran.k@intel.com>
Date:   Fri May 15 00:32:48 2026 +0530

    Bluetooth: btintel_pcie: Fix incorrect MAC access programming
    
    [ Upstream commit 88365d04fdc821dc4e9eb0cc00fdf6905430d172 ]
    
    btintel_pcie_get_mac_access() and btintel_pcie_release_mac_access()
    were programming STOP_MAC_ACCESS_DIS and XTAL_CLK_REQ in addition to
    the MAC_ACCESS_REQ handshake. These bits are not part of the host
    MAC-access handshake on the supported parts; the driver was
    programming them incorrectly. Drop the writes so the register update
    contains only the bits the controller actually consumes.
    
    Fixes: b9465e6670a2 ("Bluetooth: btintel_pcie: Read hardware exception data")
    Signed-off-by: Kiran K <kiran.k@intel.com>
    Signed-off-by: Luiz Augusto von Dentz <luiz.von.dentz@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

Bluetooth: btmtk: fix urb->setup_packet leak in error paths [+ + +]

Author: Jiajia Liu <liujiajia@kylinos.cn>
Date:   Mon May 18 10:24:02 2026 +0800

    Bluetooth: btmtk: fix urb->setup_packet leak in error paths
    
    [ Upstream commit dd1dda6b8d6e1f4376a5b3055a04f0ecbdb4d6bd ]
    
    The setup_packet of control urb is not freed if usb_submit_urb fails or
    the submitted urb is killed. Add free in these two paths.
    
    Fixes: a1c49c434e150 ("Bluetooth: btusb: Add protocol support for MediaTek MT7668U USB devices")
    Signed-off-by: Jiajia Liu <liujiajia@kylinos.cn>
    Signed-off-by: Luiz Augusto von Dentz <luiz.von.dentz@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

Bluetooth: fix UAF in l2cap_sock_cleanup_listen() vs l2cap_conn_del() [+ + +]

Author: Safa Karakuş <safa.karakus@secunnix.com>
Date:   Sat May 16 21:15:04 2026 +0300

    Bluetooth: fix UAF in l2cap_sock_cleanup_listen() vs l2cap_conn_del()
    
    commit ab1513597c6cf17cd1ad2a21e3b045421b48e022 upstream.
    
    bt_accept_dequeue() unlinks a not-yet-accepted child from the parent
    accept queue and release_sock()s it before returning, so the returned
    sk has no caller reference and is unlocked.
    
    l2cap_sock_cleanup_listen() walks these children on listening-socket
    close.  A concurrent HCI disconnect drives hci_rx_work ->
    l2cap_conn_del() which runs l2cap_chan_del() + l2cap_sock_kill() and
    frees the child sk and its l2cap_chan; cleanup_listen() then uses both:
    
      BUG: KASAN: slab-use-after-free in l2cap_sock_kill
        l2cap_sock_kill / l2cap_sock_cleanup_listen / __x64_sys_close
      Freed by: l2cap_conn_del -> l2cap_sock_close_cb -> l2cap_sock_kill
    
    This is distinct from the two fixes already in this area: commit
    e83f5e24da741 ("Bluetooth: serialize accept_q access") serialises the
    accept_q list/poll and takes temporary refs inside bt_accept_dequeue(),
    and CVE-2025-39860 serialises the userspace close()/accept() race by
    calling cleanup_listen() under lock_sock() in l2cap_sock_release().
    Neither covers l2cap_conn_del() running from hci_rx_work, so this UAF
    still reproduces on current bluetooth/master.
    
    Take the reference at the source: bt_accept_dequeue() does sock_hold()
    while sk is still locked, before release_sock(); callers sock_put().
    cleanup_listen() pins the chan with l2cap_chan_hold_unless_zero() under
    a brief child sk lock (serialising vs l2cap_sock_teardown_cb()), drops
    it before l2cap_chan_lock(), and skips a duplicate l2cap_sock_kill() on
    SOCK_DEAD.  conn->lock is not taken here: cleanup_listen() runs under
    the parent sk lock and that would invert
    conn->lock -> chan->lock -> sk_lock (lockdep).
    
    KASAN/SMP: an unprivileged listen/close vs HCI-disconnect race produced
    12 use-after-free reports per run before this change; 0, and no lockdep
    report, over 1600+ raced iterations after it on bluetooth/master.
    
    Fixes: 15f02b910562 ("Bluetooth: L2CAP: Add initial code for Enhanced Credit Based Mode")
    Cc: stable@vger.kernel.org
    Reported-by: Siwei Zhang <oss@fourdim.xyz>
    Reviewed-by: Siwei Zhang <oss@fourdim.xyz>
    Signed-off-by: Safa Karakuş <safa.karakus@secunnix.com>
    Signed-off-by: Luiz Augusto von Dentz <luiz.von.dentz@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

Bluetooth: hci_qca: Convert timeout from jiffies to ms [+ + +]

Author: Shuai Zhang <shuai.zhang@oss.qualcomm.com>
Date:   Mon May 11 21:58:37 2026 +0800

    Bluetooth: hci_qca: Convert timeout from jiffies to ms
    
    commit 375ba7484132662a4a8c7547d088fb6275c00282 upstream.
    
    Since the timer uses jiffies as its unit rather than ms, the timeout value
    must be converted from ms to jiffies when configuring the timer. Otherwise,
    the intended 8s timeout is incorrectly set to approximately 33s.
    
    To improve readability, embed msecs_to_jiffies() directly in the macro
    definitions and drop the _MS suffix from macros that now yield jiffies
    values: MEMDUMP_TIMEOUT, FW_DOWNLOAD_TIMEOUT, IBS_DISABLE_SSR_TIMEOUT,
    CMD_TRANS_TIMEOUT, and IBS_BTSOC_TX_IDLE_TIMEOUT.
    
    IBS_WAKE_RETRANS_TIMEOUT_MS and IBS_HOST_TX_IDLE_TIMEOUT_MS are
    intentionally left unchanged. Their values are stored in the struct fields
    wake_retrans and tx_idle_delay, which hold ms values at runtime and can be
    modified via debugfs. The msecs_to_jiffies() conversion happens at each
    call site against the field value, so it cannot be embedded in the macro.
    
    Wake timer depends on commit c347ca17d62a
    
    Cc: stable@vger.kernel.org
    Fixes: d841502c79e3 ("Bluetooth: hci_qca: Collect controller memory dump during SSR")
    Reviewed-by: Paul Menzel <pmenzel@molgen.mpg.de>
    Acked-by: Bartosz Golaszewski <bartosz.golaszewski@linaro.org>
    Signed-off-by: Shuai Zhang <shuai.zhang@oss.qualcomm.com>
    Signed-off-by: Luiz Augusto von Dentz <luiz.von.dentz@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

Bluetooth: hci_sync: Fix not setting mask for HCI_EVT_LE_ALL_REMOTE_FEATURES_COMPLETE [+ + +]

Author: Luiz Augusto von Dentz <luiz.von.dentz@intel.com>
Date:   Thu May 14 09:42:24 2026 -0400

    Bluetooth: hci_sync: Fix not setting mask for HCI_EVT_LE_ALL_REMOTE_FEATURES_COMPLETE
    
    [ Upstream commit 23d528d817a485fe9800a66c9411bd9e3d8a6f63 ]
    
    This fixes not setting the bit for HCI_EVT_LE_ALL_REMOTE_FEATURES_COMPLETE
    when extended features bit is set otherwise the controller may not
    generate HCI_EVT_LE_ALL_REMOTE_FEATURES_COMPLETE causing
    hci_le_read_all_remote_features_sync to timeout waiting for it.
    
    Also remove dead code.
    
    Fixes: a106e50be74b ("Bluetooth: HCI: Add support for LL Extended Feature Set")
    Signed-off-by: Luiz Augusto von Dentz <luiz.von.dentz@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

Bluetooth: hci_uart: fix UAFs and race conditions in close and init paths [+ + +]

Author: Mingyu Wang <25181214217@stu.xidian.edu.cn>
Date:   Mon May 18 10:49:49 2026 +0800

    Bluetooth: hci_uart: fix UAFs and race conditions in close and init paths
    
    commit c1bb9336ae6b54a5f6a353c4bd4ed9a4307e429b upstream.
    
    Vulnerabilities leading to Use-After-Free (UAF) and Null Pointer
    Dereference (NPD) conditions were observed in the lifecycle management
    of hci_uart.
    
    The primary issue arises because the workqueues (init_ready and
    write_work) are only flushed/cancelled if the HCI_UART_PROTO_READY
    flag is set during TTY close. If a hangup occurs before setup completes,
    hci_uart_tty_close() skips the teardown of these workqueues and
    proceeds to free the `hu` struct. When the scheduled work executes
    later, it blindly dereferences the freed `hu` struct.
    
    Furthermore, several data races and UAFs were identified in the teardown
    sequence:
    1. Calling hci_uart_flush() from hci_uart_close() without effectively
       disabling write_work causes a race condition where both can concurrently
       double-free hu->tx_skb. This happens because protocol timers can
       concurrently invoke hci_uart_tx_wakeup() and requeue write_work.
    2. Calling hci_free_dev(hdev) before hu->proto->close(hu) causes a UAF
       when vendor specific protocol close callbacks dereference hu->hdev.
    3. In the initialization error paths, failing to take the proto_lock
       write lock before clearing PROTO_READY leads to races with active
       readers. Additionally, hci_uart_tty_receive() accesses hu->hdev
       outside the read lock, leading to UAFs if the initialization error
       path frees hdev concurrently.
    
    Fix these synchronization and lifecycle issues by:
    1. Re-ordering hci_uart_tty_close() to clear HCI_UART_PROTO_READY first,
       followed immediately by a cancel_work_sync(&hu->write_work). Clearing
       the flag locks out concurrent protocol timers from successfully invoking
       hci_uart_tx_wakeup(), effectively rendering the cancellation permanent
       and preventing the tx_skb double-free.
    2. Note: Clearing PROTO_READY early causes hci_uart_close() to skip
       hu->proto->flush(). This is perfectly safe in the tty_close path
       because hu->proto->close() executes shortly after, which intrinsically
       purges all protocol SKB queues and tears down the state.
    3. Relocating hu->proto->close(hu) strictly prior to hci_free_dev(hdev)
       across all close and error paths to prevent vendor-level UAFs.
    4. Moving the hdev->stat.byte_rx increment in hci_uart_tty_receive()
       inside the proto_lock read-side critical section to safely synchronize
       with device unregistration.
    5. Adding cancel_work_sync(&hu->write_work) to hci_uart_close() to safely
       flush the workqueue before hci_uart_flush() is invoked via the HCI core.
    6. Utilizing cancel_work_sync() instead of disable_work_sync() across
       all paths to prevent permanently breaking user-space retry capabilities.
    
    Fixes: 3b799254cf6f ("Bluetooth: hci_uart: Cancel init work before unregistering")
    Cc: stable@vger.kernel.org
    Signed-off-by: Mingyu Wang <25181214217@stu.xidian.edu.cn>
    Signed-off-by: Luiz Augusto von Dentz <luiz.von.dentz@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

Bluetooth: ISO: drop ISO_END frames received without prior ISO_START [+ + +]

Author: David Carlier <devnexen@gmail.com>
Date:   Fri May 15 07:25:25 2026 +0100

    Bluetooth: ISO: drop ISO_END frames received without prior ISO_START
    
    commit 84c24fb151fc1179355296d7ff29129ac7c42129 upstream.
    
    ISO data PDUs carry a packet-boundary flag indicating START, CONT, END
    or SINGLE. The ISO_CONT branch of iso_recv() guards against a missing
    ISO_START by checking conn->rx_len before touching conn->rx_skb, but
    ISO_END does not.
    
    If a peer sends an ISO_END as the first packet on a fresh ISO
    connection, conn->rx_skb is still NULL and conn->rx_len is zero, so
    skb_put(conn->rx_skb, ...) dereferences NULL and oopses. For BIS,
    where receivers sync to a broadcaster without pairing, any broadcaster
    on the air can trigger this.
    
    Mirror the ISO_CONT check at the top of ISO_END so a stray end fragment
    is logged and dropped instead of crashing the host.
    
    Fixes: ccf74f2390d6 ("Bluetooth: Add BTPROTO_ISO socket type")
    Cc: stable@vger.kernel.org
    Assisted-by: Claude:claude-opus-4-7
    Signed-off-by: David Carlier <devnexen@gmail.com>
    Signed-off-by: Luiz Augusto von Dentz <luiz.von.dentz@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

Bluetooth: L2CAP: ecred_reconfigure: send packed pdu, not stack pointer [+ + +]

Author: Michael Bommarito <michael.bommarito@gmail.com>
Date:   Mon May 11 08:26:41 2026 -0400

    Bluetooth: L2CAP: ecred_reconfigure: send packed pdu, not stack pointer
    
    commit 3374ef8cf99368a40f7efd51a2a375a4c5dc6f0d upstream.
    
    Commit 1c08108f3014 ("Bluetooth: L2CAP: Avoid -Wflex-array-member-not-at-end
    warnings") converted the on-stack request PDU in l2cap_ecred_reconfigure()
    from an explicit packed struct to DEFINE_RAW_FLEX(), but did not adjust the
    size and source-pointer arguments to l2cap_send_cmd():
    
      -    struct {
      -            struct l2cap_ecred_reconf_req req;
      -            __le16 scid;
      -    } pdu;
      +    DEFINE_RAW_FLEX(struct l2cap_ecred_reconf_req, pdu, scid, 1);
           ...
           l2cap_send_cmd(conn, chan->ident, L2CAP_ECRED_RECONF_REQ,
                          sizeof(pdu), &pdu);
    
    After the conversion, DEFINE_RAW_FLEX() expands to declare an anonymous
    union pdu_u plus a local pointer "pdu" pointing at it. Therefore:
    
      - sizeof(pdu) is now sizeof(struct l2cap_ecred_reconf_req *) = 8 on
        64-bit (4 on 32-bit), not the 6 bytes of (mtu, mps, scid[1]).
      - &pdu is the address of the local pointer's stack storage, not the
        address of the request payload.
    
    l2cap_send_cmd() forwards (data, count) to l2cap_build_cmd(), which calls
    skb_put_data(skb, data, count). The L2CAP_ECRED_RECONFIGURE_REQ packet
    body therefore contains 8 bytes copied from the kernel stack starting at
    &pdu -- the 8 bytes overlap the pdu pointer's value, leaking a kernel
    stack address to the paired Bluetooth peer. The intended (mtu, mps, scid)
    fields are not transmitted at all, so the peer rejects the request as
    malformed and the L2CAP_ECRED_RECONFIGURE feature itself has been broken
    for the local-side initiator since the introducing commit landed.
    
    The sibling site l2cap_ecred_conn_req() in the same commit was converted
    correctly (sizeof(*pdu) + len, pdu); only this site was missed.
    
    Restore the original semantics: pass the full flex-struct size via
    struct_size(pdu, scid, 1) and the pdu pointer (the struct address) as
    the source.
    
    Validated on a stock 7.0-based host kernel via the real call path:
    setsockopt(SOL_BLUETOOTH, BT_RCVMTU, ...) on a BT_CONNECTED
    L2CAP_MODE_EXT_FLOWCTL socket emits an L2CAP_ECRED_RECONFIGURE_REQ
    whose body is 8 bytes (the on-stack pdu local's value) rather than
    the expected 6. Three captures from fresh socket / fresh hciemu peer
    on the same host -- low bytes vary per call, high 0xffff confirms a
    kernel virtual address (KASLR-randomised stack slot, not a fixed
    string):
    
      RECONF_REQ body (ident=0x02 len=8): 42 fb 54 af 0e ca ff ff
      RECONF_REQ body (ident=0x02 len=8): 52 3d 2e af 0e ca ff ff
      RECONF_REQ body (ident=0x02 len=8): b2 fc 5b af 0e ca ff ff
    
    After this patch the body is 6 bytes carrying the expected
    little-endian (mtu, mps, scid).
    
    Cc: stable@vger.kernel.org
    Fixes: 1c08108f3014 ("Bluetooth: L2CAP: Avoid -Wflex-array-member-not-at-end warnings")
    Assisted-by: Claude:claude-opus-4-7
    Signed-off-by: Michael Bommarito <michael.bommarito@gmail.com>
    Signed-off-by: Luiz Augusto von Dentz <luiz.von.dentz@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

Bluetooth: MGMT: validate Add Extended Advertising Data length [+ + +]

Author: Michael Bommarito <michael.bommarito@gmail.com>
Date:   Fri May 15 10:38:19 2026 -0400

    Bluetooth: MGMT: validate Add Extended Advertising Data length
    
    commit d3f7d17960ed50df3a6709c5158caff989c8c905 upstream.
    
    MGMT_OP_ADD_EXT_ADV_DATA is registered as a variable-length command,
    with MGMT_ADD_EXT_ADV_DATA_SIZE as the fixed header size.  The handler
    then uses cp->adv_data_len and cp->scan_rsp_len to validate and copy
    cp->data, but it never checks that those bytes are part of the mgmt
    command payload.
    
    A short command can therefore make add_ext_adv_data() pass an
    out-of-bounds pointer into tlv_data_is_valid().  If the bytes beyond
    the command buffer are addressable, they can also be copied into the
    advertising instance as scan response data, where the caller can read
    them back via MGMT_OP_GET_ADV_INSTANCE.  The trigger requires
    CAP_NET_ADMIN in the initial user namespace; KASAN reports an 8-byte
    slab-out-of-bounds read.
    
    Reject commands whose length does not match the fixed header plus both
    advertising data lengths before parsing cp->data.
    
    Fixes: 12410572833a ("Bluetooth: Break add adv into two mgmt commands")
    Cc: stable@vger.kernel.org
    Assisted-by: Claude:claude-opus-4-7
    Signed-off-by: Michael Bommarito <michael.bommarito@gmail.com>
    Signed-off-by: Luiz Augusto von Dentz <luiz.von.dentz@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

Bluetooth: serialize accept_q access [+ + +]

Author: Jiexun Wang <wangjiexun2025@gmail.com>
Date:   Wed May 6 19:43:30 2026 +0800

    Bluetooth: serialize accept_q access
    
    commit e83f5e24da741fa9405aeeff00b08c5ee7c37b88 upstream.
    
    bt_sock_poll() walks the accept queue without synchronization, while
    child teardown can unlink the same socket and drop its last reference.
    The unsynchronized accept queue walk has existed since the initial
    Bluetooth import.
    
    Protect accept_q with a dedicated lock for queue updates and polling.
    Also rework bt_accept_dequeue() to take temporary child references under
    the queue lock before dropping it and locking the child socket.
    
    Fixes: 1da177e4c3f41524e886b7f1b8a0c1fc7321cac2 ("Linux-2.6.12-rc2")
    Cc: stable@vger.kernel.org
    Reported-by: Jann Horn <jannh@google.com>
    Reported-by: Yuan Tan <yuantan098@gmail.com>
    Reported-by: Yifan Wu <yifanwucs@gmail.com>
    Reported-by: Juefei Pu <tomapufckgml@gmail.com>
    Reported-by: Xin Liu <bird@lzu.edu.cn>
    Signed-off-by: Jiexun Wang <wangjiexun2025@gmail.com>
    Signed-off-by: Ren Wei <n05ec@lzu.edu.cn>
    Signed-off-by: Jiexun Wang <wangjiexun2025@gmail.com>
    Reviewed-by: Jann Horn <jannh@google.com>
    Signed-off-by: Luiz Augusto von Dentz <luiz.von.dentz@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

bpf, skmsg: fix verdict sk_data_ready racing with ktls rx [+ + +]

Author: Xingwang Xiang <v3rdant.xiang@gmail.com>
Date:   Sun May 17 23:56:26 2026 +0900

    bpf, skmsg: fix verdict sk_data_ready racing with ktls rx
    
    [ Upstream commit ddf8029623a1af20e984c040e89ff918158397ab ]
    
    sk_psock_strp_data_ready() already checks tls_sw_has_ctx_rx() and
    defers to psock->saved_data_ready when a TLS RX context is present,
    avoiding a conflict with the TLS strparser's ownership of the receive
    queue (commit e91de6afa81c, "bpf: Fix running sk_skb program types
    with ktls").
    
    sk_psock_verdict_data_ready() has no equivalent guard.  When a socket
    is inserted into a sockmap (BPF_SK_SKB_VERDICT) before TLS RX is
    configured, tls_sw_strparser_arm() saves sk_psock_verdict_data_ready
    as rx_ctx->saved_data_ready.  On data arrival:
    
      tls_data_ready -> tls_strp_data_ready -> tls_rx_msg_ready
        -> saved_data_ready() = sk_psock_verdict_data_ready()
          -> tcp_read_skb() drains sk_receive_queue via __skb_unlink()
             without calling tcp_eat_skb(), so copied_seq is not advanced.
    
    tls_strp_msg_load() then finds tcp_inq() >= full_len (stale), calls
    tcp_recv_skb() on the now-empty queue, hits WARN_ON_ONCE(!first), and
    returns with rx_ctx->strp.anchor.frag_list pointing at a psock-owned
    (potentially freed) skb.  tls_decrypt_sg() subsequently walks that
    frag_list: use-after-free.
    
    Apply the same fix as sk_psock_strp_data_ready(): if a TLS RX context
    is present, call psock->saved_data_ready (sock_def_readable) to wake
    recv() waiters and return immediately, leaving the receive queue
    untouched.  TLS retains sole ownership of the queue and decrypts the
    record normally through tls_sw_recvmsg().
    
    Fixes: ef5659280eb1 ("bpf, sockmap: Allow skipping sk_skb parser program")
    Signed-off-by: Xingwang Xiang <v3rdant.xiang@gmail.com>
    Link: https://patch.msgid.link/20260517145630.20521-2-v3rdant.xiang@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

bridge: mcast: Fix a possible use-after-free when removing a bridge port [+ + +]

Author: Ido Schimmel <idosch@nvidia.com>
Date:   Sun May 17 15:11:21 2026 +0300

    bridge: mcast: Fix a possible use-after-free when removing a bridge port
    
    [ Upstream commit 4df78ff02629c7729168f0696a7a2123c389818d ]
    
    When per-VLAN multicast snooping is enabled, the bridge iterates over
    all the bridge ports, disables the per-port multicast context on each
    port and enables the per-{port, VLAN} multicast contexts instead. The
    reverse happens when per-VLAN multicast snooping is disabled.
    
    When global multicast snooping is enabled, the bridge iterates over all
    the bridge ports and enables the per-port multicast context on each
    port. The reverse happens when multicast snooping is disabled.
    
    The above scheme can result in a situation where both types of contexts
    (per-port and per-{port, VLAN}) are enabled on a single bridge port:
    
     # ip link add name br1 up type bridge mcast_snooping 1 mcast_querier 1 vlan_filtering 1
     # ip link add name dummy1 up master br1 type dummy
     # ip link set dev br1 type bridge mcast_vlan_snooping 1
     # ip link set dev br1 type bridge mcast_snooping 0
     # ip link set dev br1 type bridge mcast_snooping 1
    
    This is not intended and it is a problem since the commit cited below.
    Prior to this commit, when removing a bridge port,
    br_multicast_disable_port() would disable the per-port multicast context
    and the per-{port, VLAN} multicast contexts would get disabled when
    flushing VLANs.
    
    After this commit, br_multicast_disable_port() only disables the
    per-port multicast context if per-VLAN multicast snooping is disabled.
    If both types of contexts were enabled on the port when it was removed,
    the per-port multicast context would remain enabled when freeing the
    bridge port, leading to a use-after-free [1].
    
    Fix by preventing the bridge from enabling / disabling the per-port
    multicast contexts when toggling global multicast snooping if per-VLAN
    multicast snooping is enabled.
    
    [1]
    ODEBUG: free active (active state 0) object: ffff88810f8bda78 object type: timer_list hint: br_ip6_multicast_port_query_expired (net/bridge/br_multicast.c:1927)
    WARNING: lib/debugobjects.c:629 at debug_print_object+0x1b1/0x3e0, CPU#5: swapper/5/0
    [...]
    Call Trace:
    <IRQ>
    __debug_check_no_obj_freed (lib/debugobjects.c:1116)
    kfree (mm/slub.c:2620 mm/slub.c:6250 mm/slub.c:6565)
    kobject_cleanup (lib/kobject.c:689)
    rcu_do_batch (kernel/rcu/tree.c:2617)
    rcu_core (kernel/rcu/tree.c:2869)
    handle_softirqs (kernel/softirq.c:622)
    __irq_exit_rcu (kernel/softirq.c:656 kernel/softirq.c:496 kernel/softirq.c:735)
    irq_exit_rcu (kernel/softirq.c:752)
    sysvec_apic_timer_interrupt (arch/x86/kernel/apic/apic.c:1061 (discriminator 47) arch/x86/kernel/apic/apic.c:1061 (discriminator 47))
    </IRQ>
    
    Fixes: 4b30ae9adb04 ("net: bridge: mcast: re-implement br_multicast_{enable, disable}_port functions")
    Reported-by: syzbot+ae231e0552fa77b26ea1@syzkaller.appspotmail.com
    Closes: https://lore.kernel.org/netdev/87qznowlfs.ffs@tglx/
    Reported-by: Thomas Gleixner <tglx@kernel.org>
    Acked-by: Nikolay Aleksandrov <nikolay@nvidia.com>
    Signed-off-by: Ido Schimmel <idosch@nvidia.com>
    Link: https://patch.msgid.link/20260517121122.188333-2-idosch@nvidia.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

btrfs: check for subvolume before deleting squota qgroup [+ + +]

Author: Boris Burkov <boris@bur.io>
Date:   Mon May 11 13:07:11 2026 -0700

    btrfs: check for subvolume before deleting squota qgroup
    
    [ Upstream commit 1e92637722ae4bd417f7a37e8d1485dc23b93935 ]
    
    The invariant that we want to maintain with subvolume qgroups is that
    the qgroup can only be deleted if there is no root. With squotas, we
    thought that it was sufficient to just check the usage, because we
    assumed that deleting a subvolume will drive it's qgroups usage to 0,
    and thus 0 usage implies no subvolume.
    
    However, this is false, for two reasons:
    
    - A subvol whose extents are all from before squotas was enabled.
    - A subvol that was created in this transaction and for which we have
      not yet run any delayed refs.
    
    In both cases, deleting the qgroup breaks the desired invariant and we
    are left with a subvolume with no qgroup but squotas are enabled.
    
    Fix this by unifying the deletion check logic between full qgroups and
    squotas. Squotas do all the same checks *and* the additional usage == 0
    check, which is the one extra rule peculiar to squotas.
    
    Link: https://lore.kernel.org/linux-btrfs/adnBhWfJQ1n3hZC8@merlins.org/
    Fixes: a8df35619948 ("btrfs: forbid deleting live subvol qgroup")
    Reviewed-by: Qu Wenruo <wqu@suse.com>
    Signed-off-by: Boris Burkov <boris@bur.io>
    Reviewed-by: David Sterba <dsterba@suse.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

btrfs: fix squota accounting during enable generation [+ + +]

Author: Boris Burkov <boris@bur.io>
Date:   Mon May 11 19:53:46 2026 -0700

    btrfs: fix squota accounting during enable generation
    
    [ Upstream commit d7c600554816b8ef70adffe078a0e360c055d82b ]
    
    The first transaction that enables squotas is special and a bit tricky.
    We have to set BTRFS_FS_QUOTA_ENABLED after the transaction to avoid a
    deadlock, so any delayed refs that run before we set the bit are not
    squota accounted. For data this is fine, we don't get an owner_ref, so
    there is no real harm, it's as if the extent predated squotas. However
    for metadata, the tree block will have gen == enable_gen so when we free
    it later, we will decrement the squota accounting, which can result in
    an underflow. Before it is freed, btrfs check shows errors, as we have
    mismatched usage between the node generations/owners and the squota
    values.
    
    There are two angles to this fix:
    
    1. For extents that come in delayed_refs that run during the
       enable_gen transaction, we must actually set enable_gen to the *next*
       transaction. That is the first transaction that we can really
       properly account in any way.
    2. For extents that come in between the end of our transaction handle
       and the time we set the BTRFS_FS_QUOTA_ENABLED bit, we need an
       additional bit, BTRFS_FS_SQUOTA_ENABLING which only affects recording
       squota deltas, so we do pick up those extents. Otherwise, we would
       miss them, even for enable_gen + 1.
    
    Fixes: bd7c1ea3a302 ("btrfs: qgroup: check generation when recording simple quota delta")
    Reviewed-by: Qu Wenruo <wqu@suse.com>
    Signed-off-by: Boris Burkov <boris@bur.io>
    Reviewed-by: David Sterba <dsterba@suse.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

btrfs: tracepoints: fix sleep while in atomic context in btrfs_sync_file() [+ + +]

Author: Filipe Manana <fdmanana@suse.com>
Date:   Tue Apr 28 16:58:56 2026 +0100

    btrfs: tracepoints: fix sleep while in atomic context in btrfs_sync_file()
    
    [ Upstream commit c73370c677646e86fc4b1780fb07027bdf847375 ]
    
    The trace event btrfs_sync_file() is called in an atomic context (all trace
    events are) and its call to dput(), which is needed due to the call to
    dget_parent(), can sleep, triggering a kernel splat.
    
    This can be reproduced by enabling the trace event and running btrfs/056
    from fstests for example. The splat shown in dmesg is the following:
    
      [53.919] BUG: sleeping function called from invalid context at fs/dcache.c:970
      [53.947] in_atomic(): 1, irqs_disabled(): 0, non_block: 0, pid: 32773, name: xfs_io
      [53.988] preempt_count: 2, expected: 0
      [53.967] RCU nest depth: 0, expected: 0
      [53.943] Preemption disabled at:
      [53.944] [<0000000000000000>] 0x0
      [54.078] CPU: 0 UID: 0 PID: 32773 Comm: xfs_io Tainted: G        W           7.1.0-rc1-btrfs-next-232+ #1 PREEMPT(full)
      [54.070] Tainted: [W]=WARN
      [54.071] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS rel-1.16.2-0-gea1b7a073390-prebuilt.qemu.org 04/01/2014
      [54.072] Call Trace:
      [54.074]  <TASK>
      [54.076]  dump_stack_lvl+0x56/0x80
      [54.079]  __might_resched.cold+0xd6/0x10f
      [54.072]  dput.part.0+0x24/0x110
      [54.078]  trace_event_raw_event_btrfs_sync_file+0x75/0x140 [btrfs]
      [54.089]  btrfs_sync_file+0x1ed/0x530 [btrfs]
      [54.087]  ? __handle_mm_fault+0x8ae/0xed0
      [54.089]  btrfs_do_write_iter+0x172/0x210 [btrfs]
      [54.091]  vfs_write+0x21f/0x450
      [54.094]  __x64_sys_pwrite64+0x8d/0xc0
      [54.096]  ? do_user_addr_fault+0x20c/0x670
      [54.099]  do_syscall_64+0x60/0xf20
      [54.092]  ? clear_bhb_loop+0x60/0xb0
      [54.094]  entry_SYSCALL_64_after_hwframe+0x76/0x7e
    
    So stop using dget_parent() and dput() and access the parent dentry
    directly as dentry->d_parent. This is also what ext4 is doing in
    its equivalent trace event ext4_sync_file_enter().
    
    Fixes: a85b46db143f ("btrfs: tracepoints: get correct superblock from dentry in event btrfs_sync_file()")
    Reviewed-by: Boris Burkov <boris@bur.io>
    Signed-off-by: Filipe Manana <fdmanana@suse.com>
    Reviewed-by: David Sterba <dsterba@suse.com>
    Signed-off-by: David Sterba <dsterba@suse.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

cachefiles: Fix error return when vfs_mkdir() fails [+ + +]

Author: Hongling Zeng <zenghongling@kylinos.cn>
Date:   Wed May 13 18:34:06 2026 +0800

    cachefiles: Fix error return when vfs_mkdir() fails
    
    [ Upstream commit 8a220d1c312c66194f4a33dd52d1fba42bc2b341 ]
    
    When vfs_mkdir() fails, the error code is not extracted from the
    returned error pointer. This causes mkdir_error to be reached with
    ret=0, which leads to returning ERR_PTR(0) (NULL) instead of a
    proper error pointer.
    
    Fix this by extracting the error code from the error pointer when
    vfs_mkdir() fails.
    
    Fixes: 406fad7698f5 ("cachefiles: Fix oops in vfs_mkdir from cachefiles_get_directory")
    Signed-off-by: Hongling Zeng <zenghongling@kylinos.cn>
    Link: https://patch.msgid.link/20260513103406.202320-1-zenghongling@kylinos.cn
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

cgroup/rstat: validate cpu before css_rstat_cpu() access [+ + +]

Author: Qing Ming <a0yami@mailbox.org>
Date:   Sat May 16 15:08:49 2026 +0800

    cgroup/rstat: validate cpu before css_rstat_cpu() access
    
    [ Upstream commit 8817005efbdfdf5d4e4814cb5dc52b53d12917d7 ]
    
    css_rstat_updated() is exposed as a BPF kfunc and accepts a
    caller-provided cpu argument. The function uses cpu for per-cpu rstat
    lookups without checking whether it refers to a valid possible CPU.
    
    A BPF iter/cgroup program with CAP_BPF and CAP_PERFMON can pass an
    invalid cpu value. On an unfixed UBSCAN_BOUNDS test kernel, cpu ==
    0x7fffffff triggers:
    
      UBSAN: array-index-out-of-bounds in kernel/cgroup/rstat.c:31:9
      index 2147483647 is out of range for type 'long unsigned int [64]'
      Call Trace:
        css_rstat_updated
        bpf_iter_run_prog
        cgroup_iter_seq_show
        bpf_seq_read
    
    Add cpu validation to the BPF-facing css_rstat_updated() kfunc and
    move the common implementation to __css_rstat_updated() for in-kernel
    callers.
    
    Fixes: a319185be9f5 ("cgroup: bpf: enable bpf programs to integrate with rstat")
    Signed-off-by: Qing Ming <a0yami@mailbox.org>
    Signed-off-by: Tejun Heo <tj@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

cgroup: rstat: relax NMI guard after switch to try_cmpxchg [+ + +]

Author: Cunlong Li <shenxiaogll@gmail.com>
Date:   Wed May 20 11:30:54 2026 +0800

    cgroup: rstat: relax NMI guard after switch to try_cmpxchg
    
    [ Upstream commit 22572dbcd3486e6c4dced877125bbf50e4e24edf ]
    
    Commit 36df6e3dbd7e ("cgroup: make css_rstat_updated nmi safe") used
    this_cpu_cmpxchg() for the lockless insertion, and therefore required
    both ARCH_HAVE_NMI_SAFE_CMPXCHG and ARCH_HAS_NMI_SAFE_THIS_CPU_OPS in
    the NMI guard: on archs without the latter, this_cpu_cmpxchg() falls
    back to "local_irq_save() + plain cmpxchg", and local_irq_save()
    cannot mask NMIs.
    
    Commit 3309b63a2281 ("cgroup: rstat: use LOCK CMPXCHG in
    css_rstat_updated") later replaced this_cpu_cmpxchg() with plain
    try_cmpxchg() to fix cross-CPU lockless-list corruption, but left the
    NMI guard untouched.  After that switch, css_rstat_updated() no longer
    performs any this_cpu_*() RMW operations and only relies on the arch
    having NMI-safe cmpxchg, so ARCH_HAS_NMI_SAFE_THIS_CPU_OPS is no
    longer required in the guard.
    
    Relax the guard accordingly so that archs which have HAVE_NMI and
    ARCH_HAVE_NMI_SAFE_CMPXCHG but not ARCH_HAS_NMI_SAFE_THIS_CPU_OPS
    (e.g. sparc, powerpc on PPC64/BOOK3S) can benefit from the existing
    CONFIG_MEMCG_NMI_SAFETY_REQUIRES_ATOMIC path.  Without this, the css
    is never queued in NMI on those archs, and the atomics staged by
    account_{slab,kmem}_nmi_safe() are not drained by flush_nmi_stats().
    
    Fixes: 3309b63a2281 ("cgroup: rstat: use LOCK CMPXCHG in css_rstat_updated")
    Signed-off-by: Cunlong Li <shenxiaogll@gmail.com>
    Signed-off-by: Tejun Heo <tj@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

cifs: client: stage smb3_reconfigure() updates and restore ctx on failure [+ + +]

Author: DaeMyung Kang <charsyam@gmail.com>
Date:   Wed May 13 22:26:22 2026 +0900

    cifs: client: stage smb3_reconfigure() updates and restore ctx on failure
    
    [ Upstream commit ab26dfeba278b0efbcea012f1698cf524d9b5695 ]
    
    smb3_reconfigure() moves strings out of cifs_sb->ctx before the
    multichannel update, so a later failure can leave the live context
    with NULL strings or options that do not match the session.
    
    Stage the new ctx separately, commit it only on success, and restore
    the snapshot on failure. Also make smb3_sync_session_ctx_passwords()
    all-or-nothing.
    
    Commit session passwords before channel updates so newly added channels
    authenticate with the staged credentials.
    
    Fixes: ef529f655a2c ("cifs: client: allow changing multichannel mount options on remount")
    Reported-by: RAJASI MANDAL <rajasimandalos@gmail.com>
    Closes: https://lore.kernel.org/lkml/CAEY6_V1+dzW3OD5zqXhsWyXwrDTrg5tAMGZ1AJ7_GAuRE+aevA@mail.gmail.com/
    Link: https://lore.kernel.org/lkml/xkr2dlvgibq5j6gkcxd3yhhnj4atgxw2uy4eug2pxm7wy7nbms@iq6cf5taa65v/
    Reviewed-by: Henrique Carvalho <henrique.carvalho@suse.com>
    Signed-off-by: DaeMyung Kang <charsyam@gmail.com>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

cifs: Fix busy dentry used after unmounting [+ + +]

Author: Zhihao Cheng <chengzhihao1@huawei.com>
Date:   Tue May 19 17:18:05 2026 +0800

    cifs: Fix busy dentry used after unmounting
    
    commit c68337442f03953237a94577beb468ab2662a851 upstream.
    
    Since commit 340cea84f691c ("cifs: open files should not hold ref on
    superblock"), cifs file only holds the dentry ref_cnt, the cifs file
    close work(cfile->deferred) could be executed after unmounting, which
    will trigger a warning in generic_shutdown_super:
     BUG: Dentry 00000000a14a6845{i=c,n=file}  still in use (1) [unmount of
     cifs cifs]
    
    The detailed processs is:
       process A           process B           kworker
     fd = open(PATH)
      vfs_open
       file->__f_path = *path // dentry->d_lockref.count = 1
       cifs_open
        cifs_new_fileinfo
         cfile->dentry = dget(dentry) // dentry->d_lockref.count = 2
     close(fd)
      __fput
      cifs_close
       queue_delayed_work(deferredclose_wq, cfile->deferred)
      dput(dentry) // dentry->d_lockref.count = 1
                                             smb2_deferred_work_close
                                              _cifsFileInfo_put
                                               list_del(&cifs_file->flist)
                        umount
                         cleanup_mnt
                          deactivate_super
                           cifs_kill_sb
                            cifs_close_all_deferred_files_sb
                             cifs_close_all_deferred_files
                              // cannot find cfile, skip _cifsFileInfo_put
                            kill_anon_super
                             generic_shutdown_super
                              shrink_dcache_for_umount
                               umount_check
                                WARN ! // dentry->d_lockref.count = 1
                                               cifsFileInfo_put_final
                                                dput(cifs_file->dentry)
                                                // dentry->d_lockref.count = 0
    
    Fix it by flushing 'deferredclose_wq' before calling kill_anon_super.
    
    Fetch a reproducer in https://bugzilla.kernel.org/show_bug.cgi?id=221548.
    
    Fixes: 340cea84f691c ("cifs: open files should not hold ref on superblock")
    Cc: stable@vger.kernel.org
    Reviewed-by: Shyam Prasad N <sprasad@microsoft.com>
    Signed-off-by: Zhihao Cheng <chengzhihao1@huawei.com>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

cifs: Fix undefined variables [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Mon May 18 22:13:09 2026 +0100

    cifs: Fix undefined variables
    
    [ Upstream commit 8cf8b5ae8e093132b0dce0a932af10c9ef077936 ]
    
    Fix a couple of undefined variables introduced by the patch to fix tearing
    on ->remote_i_size and ->zero_point.  For some reason, make W=1 with gcc
    doesn't give undefined variable warnings (but clang does).
    
    Fixes: 2c8f4742bb76 ("netfs: Fix potential for tearing in ->remote_i_size and ->zero_point")
    Reported-by: kernel test robot <lkp@intel.com>
    Closes: https://lore.kernel.org/oe-kbuild-all/202605031459.eX5UbO3K-lkp@intel.com/
    Closes: https://lore.kernel.org/oe-kbuild-all/202605021450.ca5QGqLH-lkp@intel.com/
    cc: Steve French <sfrench@samba.org>
    cc: Paulo Alcantara <pc@manguebit.org>
    cc: Matthew Wilcox <willy@infradead.org>
    cc: Christian Brauner <brauner@kernel.org>
    cc: linux-cifs@vger.kernel.org
    cc: netfs@lists.linux.dev
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

cpufreq: intel_pstate: Use correct scaling factor on Raptor Lake-E [+ + +]

Author: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
Date:   Tue May 12 21:20:30 2026 +0200

    cpufreq: intel_pstate: Use correct scaling factor on Raptor Lake-E
    
    commit 0e7c710478b3089cdfe8669347f77b163e836c4f upstream.
    
    Raptor Lake-E has the same processor ID as Raptor Lake-S, so there is
    an entry in intel_hybrid_scaling_factor[] for it.  It does not contain
    E-cores though and hybrid_get_cpu_type() returns 0 for its P-cores, so
    they get the default "core" scaling factor.  However, the original
    Raptor Lake scaling factor for P-cores still needs to be used for
    mapping the HWP performance levels of the P-cores in Raptor Lake-E to
    frequency, as though they were part of a real hybrid system.
    
    To address this, update hwp_get_cpu_scaling() to return
    hybrid_scaling_factor, which is the P-core scaling factor
    retrieved from intel_hybrid_scaling_factor[], for all CPUs
    that are not enumerated as E-cores.
    
    Fixes: 9b18d536b124 ("cpufreq: intel_pstate: Use CPPC to get scaling factors")
    Link: https://lore.kernel.org/all/20260511235328.2018458-1-srinivas.pandruvada@linux.intel.com/
    Reported-by: Henry Tseng <henrytseng@qnap.com>
    Closes: https://lore.kernel.org/linux-pm/20260508063032.3248602-1-henrytseng@qnap.com/
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Cc: All applicable <stable@vger.kernel.org>
    Link: https://patch.msgid.link/4523296.ejJDZkT8p0@rafael.j.wysocki
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

crypto/krb5, rxrpc: Fix lack of pre-decrypt/pre-verify length checks [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Sat May 16 00:05:13 2026 +0100

    crypto/krb5, rxrpc: Fix lack of pre-decrypt/pre-verify length checks
    
    [ Upstream commit 2b50aceafe6606ea52ed42aadd1b4d44a188aade ]
    
    Change the krb5 crypto library to provide facilities to precheck the length
    of the message about to be decrypted or verified.
    
    Fix AF_RXRPC to make use of this to validate DATA packets secured with
    RxGK.
    
    Fixes: 9d1d2b59341f ("rxrpc: rxgk: Implement the yfs-rxgk security class (GSSAPI)")
    Closes: https://sashiko.dev/#/patchset/20260511160753.607296-1-dhowells%40redhat.com
    Signed-off-by: David Howells <dhowells@redhat.com>
    cc: Herbert Xu <herbert@gondor.apana.org.au>
    cc: Simon Horman <horms@kernel.org>
    cc: Chuck Lever <chuck.lever@oracle.com>
    cc: linux-afs@lists.infradead.org
    Reviewed-by: Jeffrey Altman <jaltman@auristor.com>
    Tested-by: Marc Dionne <marc.dionne@auristor.com>
    Link: https://patch.msgid.link/20260515230516.2718212-2-dhowells@redhat.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

device property: set fwnode->secondary to NULL in fwnode_init() [+ + +]

Author: Bartosz Golaszewski <bartosz.golaszewski@oss.qualcomm.com>
Date:   Wed May 6 13:57:00 2026 +0200

    device property: set fwnode->secondary to NULL in fwnode_init()
    
    commit 215c90ee656114f5e8c32408228d97082f8e0eef upstream.
    
    If a firmware node is allocated on the stack (for instance: temporary
    software node whose life-time we control) or on the heap - but using a
    non-zeroing allocation function - and initialized using fwnode_init(),
    its secondary pointer will contain uninitalized memory which likely will
    be neither NULL nor IS_ERR() and so may end up being dereferenced (for
    example: in dev_to_swnode()). Set fwnode->secondary to NULL on
    initialization.
    
    Cc: stable <stable@kernel.org>
    Fixes: 01bb86b380a3 ("driver core: Add fwnode_init()")
    Signed-off-by: Bartosz Golaszewski <bartosz.golaszewski@oss.qualcomm.com>
    Reviewed-by: Rafael J. Wysocki (Intel) <rafael@kernel.org>
    Reviewed-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com>
    Reviewed-by: Sakari Ailus <sakari.ailus@linux.intel.com>
    Link: https://patch.msgid.link/20260506115701.23035-1-bartosz.golaszewski@oss.qualcomm.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

dma-mapping: move dma_map_resource() sanity check into debug code [+ + +]

Author: Jianpeng Chang <jianpeng.chang.cn@windriver.com>
Date:   Wed May 13 15:22:09 2026 +0800

    dma-mapping: move dma_map_resource() sanity check into debug code
    
    [ Upstream commit af0c3f05866237f7592219bfe05387bc3bfc99b5 ]
    
    dma_map_resource() uses pfn_valid() to ensure the range is not RAM.
    However, pfn_valid() only checks for availability of the memory map for
    a PFN but it does not ensure that the PFN is actually backed by RAM. On
    ARM64 with SPARSEMEM (128MB section granularity), MMIO addresses that
    share a section with RAM will falsely trigger the WARN_ON_ONCE and cause
    dma_map_resource() to return DMA_MAPPING_ERROR.
    
    This causes a WARNING on Raspberry Pi 4 during spi_bcm2835 probe because
    the SPI FIFO register (0xfe204004) falls in the same sparsemem section
    as the end of RAM (0xf8000000-0xfbffffff), both in section 31
    (0xf8000000-0xffffffff).
    
    Move the sanity check from dma_map_resource() into debug_dma_map_phys()
    and replace the unreliable pfn_valid() with pfn_valid() &&
    !PageReserved(), which correctly identifies actual usable RAM without
    false positives for MMIO regions that happen to have struct pages.
    
    Since dma_map_resource() is dma_map_phys(DMA_ATTR_MMIO), the check
    applies equally to both APIs. Any non-reserved page represents kernel
    memory to a sufficient degree that using DMA_ATTR_MMIO on it is almost
    certainly wrong and risks breaking coherency on non-coherent platforms.
    ZONE_DEVICE pages used for PCI P2P DMA (MEMORY_DEVICE_PCI_P2PDMA) have
    PageReserved set, so they will not trigger a false positive.
    
    The check no longer blocks the mapping and uses err_printk() to
    integrate with dma-debug filtering.
    
    Fixes: f7326196a781 ("dma-mapping: export new dma_*map_phys() interface")
    Reviewed-by: Robin Murphy <robin.murphy@arm.com>
    Signed-off-by: Jianpeng Chang <jianpeng.chang.cn@windriver.com>
    Reviewed-by: Leon Romanovsky <leonro@nvidia.com>
    Signed-off-by: Marek Szyprowski <m.szyprowski@samsung.com>
    Link: https://lore.kernel.org/r/20260513072209.1486986-1-jianpeng.chang.cn@windriver.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

Documentation: intel_pstate: Fix description of asymmetric packing with SMT [+ + +]

Author: Ricardo Neri <ricardo.neri-calderon@linux.intel.com>
Date:   Fri Apr 24 14:41:13 2026 -0700

    Documentation: intel_pstate: Fix description of asymmetric packing with SMT
    
    [ Upstream commit ee047fc7a2da90554410128195058c409a391d43 ]
    
    Patchset [1], including commits
    
     046a5a95c3b0 ("x86/sched/itmt: Give all SMT siblings of a core the same priority")
     995998ebdebd ("x86/sched: Remove SD_ASYM_PACKING from the SMT domain flags")
    
    overhauled asym_packing handling in the scheduler on x86 hybrid
    processors with SMT. It removed SD_ASYM_PACKING from the x86 SMT
    scheduling domain and made all SMT siblings of a core share the same
    priority. As a result, asym_packing operates only across physical
    cores, spreading tasks among them and only using idle SMT siblings
    once all physical cores are busy.
    
    Fix the documentation to reflect this behavior.
    
    Fixes: f20af84c29b2 ("cpufreq: intel_pstate: Document hybrid processor support")
    Link: https://lore.kernel.org/r/20230406203148.19182-1-ricardo.neri-calderon@linux.intel.com [1]
    Signed-off-by: Ricardo Neri <ricardo.neri-calderon@linux.intel.com>
    [ rjw: Changelog edits ]
    Link: https://patch.msgid.link/20260424-rneri-fix-intel-pstate-doc-smt-asym-packing-v1-1-317bf7d5c362@linux.intel.com
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

Documentation: laptops: Update documentation for uniwill laptops [+ + +]

Author: Werner Sembach <wse@tuxedocomputers.com>
Date:   Tue Mar 24 21:32:12 2026 +0100

    Documentation: laptops: Update documentation for uniwill laptops
    
    [ Upstream commit 9ec6bf62cf98e30c7126a0f51ee7cdf2e8d458b6 ]
    
    Adds short description for two new sysfs entries, ctgp_offset and
    usb_c_power_priority, to the documentation of uniwill laptops.
    
    Reviewed-by: Armin Wolf <W_Armin@gmx.de>
    Reviewed-by: Shuah Khan <skhan@linuxfoundation.org>
    Signed-off-by: Werner Sembach <wse@tuxedocomputers.com>
    Link: https://patch.msgid.link/20260324203413.454361-6-wse@tuxedocomputers.com
    Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Stable-dep-of: 26cbe119f99c ("platform/x86: uniwill-laptop: Do not enable the charging limit even when forced")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drivers/base/memory: fix memory block reference leak in poison accounting [+ + +]

Author: Muchun Song <muchun.song@linux.dev>
Date:   Tue Apr 28 16:52:18 2026 +0800

    drivers/base/memory: fix memory block reference leak in poison accounting
    
    commit 03a2cc1756a0570f887d624cd6c535ea0cbd4951 upstream.
    
    memblk_nr_poison_inc() and memblk_nr_poison_sub() look up a memory block
    via find_memory_block_by_id(), which acquires a reference to the memory
    block device.
    
    Both helpers use the returned memory block without dropping that
    reference, leaking the device reference on each successful lookup.  Drop
    the reference after updating nr_hwpoison.
    
    Link: https://lore.kernel.org/20260428085219.1316047-3-songmuchun@bytedance.com
    Fixes: 5033091de814 ("mm/hwpoison: introduce per-memory_block hwpoison counter")
    Signed-off-by: Muchun Song <songmuchun@bytedance.com>
    Reviewed-by: Miaohe Lin <linmiaohe@huawei.com>
    Acked-by: Oscar Salvador <osalvador@suse.de>
    Acked-by: David Hildenbrand (Arm) <david@kernel.org>
    Cc: Danilo Krummrich <dakr@kernel.org>
    Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
    Cc: "Huang, Ying" <huang.ying.caritas@gmail.com>
    Cc: Naoya Horiguchi <nao.horiguchi@gmail.com>
    Cc: "Rafael J. Wysocki" <rafael@kernel.org>
    Cc: Vishal Verma <vishal.l.verma@intel.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/amd/display: Fix integer overflow in bios_get_image() [+ + +]

Author: Harry Wentland <harry.wentland@amd.com>
Date:   Mon May 4 11:14:45 2026 -0400

    drm/amd/display: Fix integer overflow in bios_get_image()
    
    commit cd86529ec61474a38c3837fb7823790a7c3f8cce upstream.
    
    [Why&How]
    The bounds check in bios_get_image() computes 'offset + size' using
    unsigned 32-bit arithmetic before comparing against bios_size. If a
    VBIOS image contains a near-UINT32_MAX offset the addition wraps to a
    small value, the comparison passes, and the function returns a wild
    pointer past the VBIOS mapping.
    
    Additionally, the comparison uses '<' (strict), which incorrectly
    rejects the valid exact-fit case where offset + size == bios_size.
    
    Fix both issues by restructuring the check to avoid the addition
    entirely: first reject if offset alone exceeds bios_size, then check
    size against the remaining space (bios_size - offset). This eliminates
    the overflow and correctly permits exact-fit accesses.
    
    Assisted-by: GitHub Copilot:claude-opus-4.6
    Reviewed-by: Alex Hung <alex.hung@amd.com>
    Signed-off-by: Harry Wentland <harry.wentland@amd.com>
    Signed-off-by: Ivan Lipski <ivan.lipski@amd.com>
    Tested-by: Dan Wheeler <daniel.wheeler@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    (cherry picked from commit d40fb392af659c4a02b560319f226842f6ec1a95)
    Cc: stable@vger.kernel.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/amd/display: Validate GPIO pin LUT table size before iterating [+ + +]

Author: Harry Wentland <harry.wentland@amd.com>
Date:   Mon May 4 16:14:11 2026 -0400

    drm/amd/display: Validate GPIO pin LUT table size before iterating
    
    commit 86d2b20644b11d21fe52c596e6e922b4590a3e3f upstream.
    
    [Why&How]
    The GPIO pin table parsers in get_gpio_i2c_info() and
    bios_parser_get_gpio_pin_info() derive an element count from the VBIOS
    table_header.structuresize field, then iterate over gpio_pin[] entries.
    However, GET_IMAGE() only validates that the table header itself fits
    within the BIOS image. If the VBIOS reports a structuresize larger than
    the actual mapped data, the loop reads past the end of the BIOS image,
    causing an out-of-bounds read.
    
    Fix this by calling bios_get_image() to validate that the full claimed
    structuresize is accessible within the BIOS image before entering the
    loop in both functions.
    
    Assisted-by: GitHub Copilot:claude-opus-4-6
    Reviewed-by: Alex Hung <alex.hung@amd.com>
    Signed-off-by: Harry Wentland <harry.wentland@amd.com>
    Signed-off-by: Ivan Lipski <ivan.lipski@amd.com>
    Tested-by: Dan Wheeler <daniel.wheeler@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    (cherry picked from commit ba5e95b43b773ae1bf1f66ee6b31eb774e65afe3)
    Cc: stable@vger.kernel.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/amd/display: Validate payload length and link_index in dc_process_dmub_aux_transfer_async [+ + +]

Author: Harry Wentland <harry.wentland@amd.com>
Date:   Thu May 7 16:26:31 2026 -0400

    drm/amd/display: Validate payload length and link_index in dc_process_dmub_aux_transfer_async
    
    commit 6c92f6d9600efa3ef0d9e560a2b52776d9803c29 upstream.
    
    [Why&How]
    dc_process_dmub_aux_transfer_async() copies payload->length bytes into a
    16-byte stack buffer (dpaux.data[16]) guarded only by an ASSERT(), which
    is a no-op in release builds. If a caller ever passes length > 16 this
    results in a stack buffer overflow via memcpy.
    
    Additionally, link_index is used to dereference dc->links[] without
    bounds checking against dc->link_count, risking an out-of-bounds access.
    
    Replace the ASSERT with a hard runtime check that returns false when
    payload->length exceeds the destination buffer size, and add a bounds
    check for link_index before it is used.
    
    Assisted-by: GitHub Copilot:Claude claude-4-opus
    Reviewed-by: Alex Hung <alex.hung@amd.com>
    Signed-off-by: Harry Wentland <harry.wentland@amd.com>
    Signed-off-by: Ivan Lipski <ivan.lipski@amd.com>
    Tested-by: Dan Wheeler <daniel.wheeler@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    (cherry picked from commit ba4caa9fecdf7a38f98c878ad05a8a64148b6881)
    Cc: stable@vger.kernel.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/amdgpu/vce1: Check that the GPU address is < 128 MiB [+ + +]

Author: Timur Kristóf <timur.kristof@gmail.com>
Date:   Wed May 13 22:04:09 2026 +0200

    drm/amdgpu/vce1: Check that the GPU address is < 128 MiB
    
    [ Upstream commit 9f907adb66d8369dd45412794a04845011503fa8 ]
    
    When ensuring the low 32-bit address, make sure it is
    less than 128 MiB, otherwise the VCE seems to fail to initialize.
    This seems to be an undocumented limitation of the firmware
    validation mechanism. Note that in case of VCE1 the BAR
    address is zero and we can't change it also due to the
    firmware validator.
    
    When programming the mmVCE_VCPU_CACHE_OFFSETn registers,
    don't AND them with a mask. This is incorrect because
    the register mask is actually 0x0fffffff and useless because
    we already ensure the addresses are below the limit.
    
    Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
    Reviewed-by: Christian König <christian.koenig@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    (cherry picked from commit e729ae5f3ac73c861c062080ac8c3d666c972404)
    Stable-dep-of: 3e5a1d5bb2ff ("drm/amdgpu/vce1: Fix VCE 1 firmware size and offsets")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amdgpu/vce1: Fix VCE 1 firmware size and offsets [+ + +]

Author: Timur Kristóf <timur.kristof@gmail.com>
Date:   Wed May 13 22:04:13 2026 +0200

    drm/amdgpu/vce1: Fix VCE 1 firmware size and offsets
    
    [ Upstream commit 3e5a1d5bb2ff061e64c7992f8e5404dfd4c2d0f3 ]
    
    The VCPU BO contains the actual FW at an offset, but
    it was not calculated into the VCPU BO size.
    Subtract this from the FW size to make sure there is
    no out of bounds access.
    
    Make sure the stack and data offsets are aligned to
    the 32K TLB size.
    
    Check that the FW microcode actually fits in the
    space that is reserved for it.
    
    Fixes: d4a640d4b9f3 ("drm/amdgpu/vce1: Implement VCE1 IP block (v2)")
    Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
    Reviewed-by: Christian König <christian.koenig@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    (cherry picked from commit c16fe59f622a080fc457a57b3e8f14c780699449)
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/amdgpu/vpe: Force collaborate sync after TRAP [+ + +]

Author: Alan Liu <haoping.liu@amd.com>
Date:   Fri May 1 12:35:48 2026 +0800

    drm/amdgpu/vpe: Force collaborate sync after TRAP
    
    commit b6074630a461b1322a814988779005cbc43612ea upstream.
    
    VPE1 could possibly hang and fail to power off at the end of commands in
    collaboration mode. This workaround adds a COLLAB_SYNC after TRAP to
    force instances synchronized to avoid VPE1 fail to power off.
    
    Reviewed-by: Mario Limonciello (AMD) <superm1@kernel.org>
    Signed-off-by: Alan liu <haoping.liu@amd.com>
    Closes: https://gitlab.freedesktop.org/drm/amd/-/work_items/5171
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    (cherry picked from commit a8b749c5c5afb7e5daa2bfb95d958fb3c6b8f055)
    Cc: stable@vger.kernel.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/amdgpu: Align amdgpu_gtt_mgr entries to TLB size on Tahiti (v2) [+ + +]

Author: Timur Kristóf <timur.kristof@gmail.com>
Date:   Wed May 13 22:04:08 2026 +0200

    drm/amdgpu: Align amdgpu_gtt_mgr entries to TLB size on Tahiti (v2)
    
    [ Upstream commit 4d798ea0712fddbd35b439cef32b8ac735eb76f9 ]
    
    The TLB is organized in groups of 8 entries, each one is 4K.
    On Tahiti, the HW requires these GART entries to be 32K-aligned.
    
    This fixes a VCE 1 firmware validation failure that can happen
    after suspend/resume since we use amdgpu_gtt_mgr for VCE 1.
    
    v2:
    - Change variable declaration order
    - Add comment about "V bit HW bug"
    
    Fixes: 698fa62f56aa ("drm/amdgpu: Add helper to alloc GART entries")
    Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
    Reviewed-by: Christian König <christian.koenig@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    (cherry picked from commit 530411b465ef0b2c0cc18c2e3d7e38422b1117d1)
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/bridge: chipone-icn6211: use devm_drm_bridge_add in i2c probe [+ + +]

Author: Osama Abdelkader <osama.abdelkader@gmail.com>
Date:   Thu Apr 30 21:49:42 2026 +0200

    drm/bridge: chipone-icn6211: use devm_drm_bridge_add in i2c probe
    
    commit 73d01051e8040c0b1de7fd26b3b8d0c2ffa6895c upstream.
    
    Use devm_drm_bridge_add() so the bridge is released if probe
    fails after registration, and drop drm_bridge_remove() in chipone_i2c_probe.
    
    Signed-off-by: Osama Abdelkader <osama.abdelkader@gmail.com>
    Fixes: 8dde6f7452a1 ("drm: bridge: icn6211: Add I2C configuration support")
    Cc: stable@vger.kernel.org
    Reviewed-by: Luca Ceresoli <luca.ceresoli@bootlin.com>
    Link: https://patch.msgid.link/20260430194944.78119-1-osama.abdelkader@gmail.com
    Signed-off-by: Luca Ceresoli <luca.ceresoli@bootlin.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/bridge: it66121: acquire reset GPIO in probe [+ + +]

Author: Julien Chauveau <chauveau.julien@gmail.com>
Date:   Tue Mar 24 20:30:11 2026 +0100

    drm/bridge: it66121: acquire reset GPIO in probe
    
    commit e02b5262fd288cc235f14e12233ea54e78c04611 upstream.
    
    The it66121_ctx structure has a gpio_reset field, and it66121_hw_reset()
    calls gpiod_set_value() on it. However, the GPIO descriptor is never
    acquired via devm_gpiod_get(), leaving gpio_reset as NULL throughout
    the driver lifetime.
    
    gpiod_set_value() silently returns when passed a NULL descriptor, so
    the hardware reset sequence in it66121_hw_reset() is a no-op. This
    leaves the chip in an undefined state at probe time, which can prevent
    it from responding on the I2C bus.
    
    The DT binding marks reset-gpios as a required property, so all
    compliant device trees provide this GPIO. Add the missing
    devm_gpiod_get() call after enabling power supplies and before the
    hardware reset, so the chip is properly reset with power applied.
    
    Fixes: 988156dc2fc9 ("drm: bridge: add it66121 driver")
    Cc: stable@vger.kernel.org
    Signed-off-by: Julien Chauveau <chauveau.julien@gmail.com>
    Reviewed-by: Javier Martinez Canillas <javierm@redhat.com>
    Tested-by: Javier Martinez Canillas <javierm@redhat.com>
    Link: https://patch.msgid.link/20260324193011.16583-1-chauveau.julien@gmail.com
    Signed-off-by: Javier Martinez Canillas <javierm@redhat.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/bridge: megachips: remove bridge when irq request fails [+ + +]

Author: Osama Abdelkader <osama.abdelkader@gmail.com>
Date:   Thu Apr 30 21:56:59 2026 +0200

    drm/bridge: megachips: remove bridge when irq request fails
    
    commit d45d5c819f2cd0b6b5d76a194a537a5f4aeefecb upstream.
    
    If devm_request_threaded_irq() fails after drm_bridge_add(), remove the
    bridge before returning.
    
    Keep drm_bridge_add() rather than devm_drm_bridge_add(): registration is
    tied to the STDP4028 device while ge_b850v3_register() may complete from
    either I2C probe; devm would not unwind the bridge if the other client's
    probe fails.
    
    Signed-off-by: Osama Abdelkader <osama.abdelkader@gmail.com>
    Fixes: fcfa0ddc18ed ("drm/bridge: Drivers for megachips-stdpxxxx-ge-b850v3-fw (LVDS-DP++)")
    Cc: stable@vger.kernel.org
    Reviewed-by: Luca Ceresoli <luca.ceresoli@bootlin.com>
    Tested-by: Ian Ray <ian.ray@gehealthcare.com>
    Link: https://patch.msgid.link/20260430195700.80317-1-osama.abdelkader@gmail.com
    Signed-off-by: Luca Ceresoli <luca.ceresoli@bootlin.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/gem: Make the GEM LRU lock part of drm_device [+ + +]

Author: Boris Brezillon <boris.brezillon@collabora.com>
Date:   Mon May 18 13:41:45 2026 +0200

    drm/gem: Make the GEM LRU lock part of drm_device
    
    [ Upstream commit 379e8f1ca5e919b130b40d8115d92a536e5f8d7a ]
    
    Recently, a few races have been discovered in the GEM LRU logic, all
    of them caused by the fact the LRU lock is accessed through
    gem->lru->lock, and that very same lock also protects changes to
    gem->lru, leading to situations where gem->lru needs to first be
    accessed without the lock held, to then get the lru to access the lock
    through and finally take the lock and do the expected operation.
    
    Currently, the only driver making use of this API (MSM) declares a
    device-wide lock, and the user we're about to add (panthor) will
    do the same. There's no evidence that we will ever have a driver
    that wants different pools of LRUs protected by different locks under
    the same drm_device. So we're better off moving this lock to drm_device
    and always locking it through obj->dev->gem_lru_mutex, or directly
    through dev->gem_lru_mutex.
    
    If anyone ever needs more fine-grained locking, this can be revisited
    to pass some drm_gem_lru_pool object representing the pool of LRUs
    under a specific lock, but for now, the per-device lock seems to be
    enough.
    
    Fixes: e7c2af13f811 ("drm/gem: Add LRU/shrinker helper")
    Reported-by: Chia-I Wu <olvaffe@gmail.com>
    Closes: https://gitlab.freedesktop.org/panfrost/linux/-/work_items/86
    Reviewed-by: Rob Clark <rob.clark@oss.qualcomm.com>
    Reviewed-by: Liviu Dudau <liviu.dudau@arm.com>
    Reviewed-by: Steven Price <steven.price@arm.com>
    Reviewed-by: Chia-I Wu <olvaffe@gmail.com>
    Link: https://patch.msgid.link/20260518-panthor-shrinker-fixes-v4-1-1920234470d5@collabora.com
    Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/i915/display: Copy color pipeline from plane in the primary joiner pipe [+ + +]

Author: Chaitanya Kumar Borah <chaitanya.kumar.borah@intel.com>
Date:   Mon May 11 11:02:10 2026 +0530

    drm/i915/display: Copy color pipeline from plane in the primary joiner pipe
    
    commit 86ed2d96db1965e9008e919b1936145ae66540e3 upstream.
    
    When copying plane color state in a joiner configuration, use the plane in
    the primary joiner pipe since it carries the pipeline number selected by
    the user-space.
    
    This assumes that all pipes in the joiner are symmetric in their plane
    color capabilities.
    
    Cc: stable@vger.kernel.org # v6.19+
    Fixes: a78f1b6baf4d ("drm/i915/color: Add framework to program CSC")
    Tested-by: Vidya Srinivas <vidya.srinivas@intel.com>
    Signed-off-by: Chaitanya Kumar Borah <chaitanya.kumar.borah@intel.com>
    Reviewed-by: Uma Shankar <uma.shankar@intel.com>
    Signed-off-by: Ankit Nautiyal <ankit.k.nautiyal@intel.com>
    Link: https://patch.msgid.link/20260511053213.3122314-2-chaitanya.kumar.borah@intel.com
    (cherry picked from commit e8308fb5e05ca08ddfb8b46f6d947a6e3fd80cd7)
    Signed-off-by: Tvrtko Ursulin <tursulin@ursulin.net>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/i915/dp: Fix readback for target_rr in Adaptive Sync SDP [+ + +]

Author: Ankit Nautiyal <ankit.k.nautiyal@intel.com>
Date:   Mon May 11 18:02:15 2026 +0530

    drm/i915/dp: Fix readback for target_rr in Adaptive Sync SDP
    
    [ Upstream commit f87abd0c6604fb6cc31cc86fc7ccc6a576924352 ]
    
    Correct the bit-shift logic to properly readback the 10 bit target_rr from
    DB3 and DB4.
    
    v2: Align the style with readback for vtotal. (Ville)
    
    Fixes: 12ea89291603 ("drm/i915/dp: Add Read/Write support for Adaptive Sync SDP")
    Cc: Mitul Golani <mitulkumar.ajitkumar.golani@intel.com>
    Cc: Ankit Nautiyal <ankit.k.nautiyal@intel.com>
    Signed-off-by: Ankit Nautiyal <ankit.k.nautiyal@intel.com>
    Reviewed-by: Ville Syrjälä <ville.syrjala@linux.intel.com>
    Link: https://patch.msgid.link/20260511123218.1589830-2-ankit.k.nautiyal@intel.com
    (cherry picked from commit f7abc4af2b19240a145a221461dfe756cc01d74a)
    Signed-off-by: Tvrtko Ursulin <tursulin@ursulin.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/mediatek: mtk_cec: Fix non-static global variable [+ + +]

Author: Louis-Alexis Eyraud <louisalexis.eyraud@collabora.com>
Date:   Wed Apr 29 11:59:01 2026 +0200

    drm/mediatek: mtk_cec: Fix non-static global variable
    
    [ Upstream commit 571f00a5fb725984049bd532ee8193cc34ff2994 ]
    
    The struct 'mtk_cec_driver' is not used outside of the
    mtk_cec.c file, so make it static to silence sparse warning:
    ```
    drivers/gpu/drm/mediatek/mtk_cec.c:243:24: sparse: warning: symbol
    'mtk_cec_driver' was not declared. Should it be static?
    ```
    
    Fixes: 1e914a89ab7e ("drm/mediatek: mtk_cec: Switch to register as module_platform_driver")
    Signed-off-by: Louis-Alexis Eyraud <louisalexis.eyraud@collabora.com>
    Reviewed-by: CK Hu <ck.hu@mediatek.com>
    Link: https://patchwork.kernel.org/project/dri-devel/patch/20260429-mediatek-drm-fix-sparse-warnings-v1-3-d95c4d118b83@collabora.com/
    Signed-off-by: Chun-Kuang Hu <chunkuang.hu@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/mediatek: mtk_hdmi_ddc: Fix non-static global variable [+ + +]

Author: Louis-Alexis Eyraud <louisalexis.eyraud@collabora.com>
Date:   Wed Apr 29 11:59:02 2026 +0200

    drm/mediatek: mtk_hdmi_ddc: Fix non-static global variable
    
    [ Upstream commit 87ed4e845d5a90bba1a56c0a5c580a13982e8648 ]
    
    The struct 'mtk_hdmi_ddc_driver' is not used outside of the
    mtk_hdmi_ddc.c file, so make it static to silence sparse warning:
    ```
    drivers/gpu/drm/mediatek/mtk_hdmi_ddc.c:331:24: sparse: warning: symbol
      'mtk_hdmi_ddc_driver' was not declared. Should it be static?
    ```
    
    Fixes: c241118b6216 ("drm/mediatek: mtk_hdmi_ddc: Switch to register as module_platform_driver")
    Signed-off-by: Louis-Alexis Eyraud <louisalexis.eyraud@collabora.com>
    Reviewed-by: CK Hu <ck.hu@mediatek.com>
    Link: https://patchwork.kernel.org/project/dri-devel/patch/20260429-mediatek-drm-fix-sparse-warnings-v1-4-d95c4d118b83@collabora.com/
    Signed-off-by: Chun-Kuang Hu <chunkuang.hu@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/mediatek: mtk_hdmi_ddc_v2: Fix non-static global variable [+ + +]

Author: Louis-Alexis Eyraud <louisalexis.eyraud@collabora.com>
Date:   Wed Apr 29 11:58:59 2026 +0200

    drm/mediatek: mtk_hdmi_ddc_v2: Fix non-static global variable
    
    [ Upstream commit e9f5e8da29762df1111a58ae0b4a83091595d834 ]
    
    The struct 'mtk_hdmi_ddc_v2_driver' is not used outside of the
    mtk_hdmi_ddc_v2.c file, so make it static to silence sparse warning:
    ```
    drivers/gpu/drm/mediatek/mtk_hdmi_ddc_v2.c:392:24: sparse: warning:
      symbol 'mtk_hdmi_ddc_v2_driver' was not declared. Should it be
      static?
    ```
    
    Fixes: 8d0f79886273 ("drm/mediatek: Introduce HDMI/DDC v2 for MT8195/MT8188")
    Reported-by: kernel test robot <lkp@intel.com>
    Closes: https://lore.kernel.org/oe-kbuild-all/202604132044.fcYjEcU8-lkp@intel.com/
    Signed-off-by: Louis-Alexis Eyraud <louisalexis.eyraud@collabora.com>
    Reviewed-by: CK Hu <ck.hu@mediatek.com>
    Link: https://patchwork.kernel.org/project/dri-devel/patch/20260429-mediatek-drm-fix-sparse-warnings-v1-1-d95c4d118b83@collabora.com/
    Signed-off-by: Chun-Kuang Hu <chunkuang.hu@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/mediatek: mtk_hdmi_v2: Fix non-static global variable [+ + +]

Author: Louis-Alexis Eyraud <louisalexis.eyraud@collabora.com>
Date:   Wed Apr 29 11:59:00 2026 +0200

    drm/mediatek: mtk_hdmi_v2: Fix non-static global variable
    
    [ Upstream commit dc245d9a7f1b06f86271d4e524d6e5634c5ce312 ]
    
    The struct 'mtk_hdmi_v2_clk_names' is not used outside of the
    mtk_hdmi_v2.c file, so make it static to silence sparse warning:
    ```
    drivers/gpu/drm/mediatek/mtk_hdmi_v2.c:53:12: sparse: warning: symbol
    'mtk_hdmi_v2_clk_names' was not declared. Should it be static?
    ```
    
    Fixes: 8d0f79886273 ("drm/mediatek: Introduce HDMI/DDC v2 for MT8195/MT8188")
    Reported-by: kernel test robot <lkp@intel.com>
    Closes: https://lore.kernel.org/oe-kbuild-all/202604132044.fcYjEcU8-lkp@intel.com/
    Signed-off-by: Louis-Alexis Eyraud <louisalexis.eyraud@collabora.com>
    Reviewed-by: CK Hu <ck.hu@mediatek.com>
    Link: https://patchwork.kernel.org/project/dri-devel/patch/20260429-mediatek-drm-fix-sparse-warnings-v1-2-d95c4d118b83@collabora.com/
    Signed-off-by: Chun-Kuang Hu <chunkuang.hu@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/msm/a6xx: Add soft fuse detection support [+ + +]

Author: Akhil P Oommen <akhilpo@oss.qualcomm.com>
Date:   Fri Mar 27 05:44:01 2026 +0530

    drm/msm/a6xx: Add soft fuse detection support
    
    [ Upstream commit 4ac686bfd1929ef659a99f893ebe8faf7f35c76c ]
    
    Recent chipsets like Glymur supports a new mechanism for SKU detection.
    A new CX_MISC register exposes the combined (or final) speedbin value
    from both HW fuse register and the Soft Fuse register. Implement this new
    SKU detection along with a new quirk to identify the GPUs that has soft
    fuse support.
    
    There is a side effect of this patch on A4x and older series. The
    speedbin field in the MSM_PARAM_CHIPID will be 0 instead of 0xffff. This
    should be okay as Mesa correctly handles it. Speedbin was not even a
    thing when those GPUs' support were added.
    
    Signed-off-by: Akhil P Oommen <akhilpo@oss.qualcomm.com>
    Patchwork: https://patchwork.freedesktop.org/patch/714676/
    Message-ID: <20260327-a8xx-gpu-batch2-v2-12-2b53c38d2101@oss.qualcomm.com>
    Signed-off-by: Rob Clark <robin.clark@oss.qualcomm.com>
    Stable-dep-of: e64bca63647d ("drm/msm/adreno: Fix a reference leak in a6xx_gpu_init()")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/msm/a6xx: Check kzalloc return in a8xx_hfi_send_perf_table [+ + +]

Author: Chen Ni <nichen@iscas.ac.cn>
Date:   Tue Apr 28 15:35:58 2026 +0800

    drm/msm/a6xx: Check kzalloc return in a8xx_hfi_send_perf_table
    
    [ Upstream commit b5c7a7f452b885bfbe102bd3a057a5f496802f8b ]
    
    Check the return value of kzalloc() to prevent a NULL pointer
    dereference on allocation failure.
    
    Fixes: 06cfbca0e1c6 ("drm/msm/a6xx: Share dependency vote table with GMU")
    Signed-off-by: Chen Ni <nichen@iscas.ac.cn>
    Reviewed-by: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
    Reviewed-by: Akhil P Oommen <akhilpo@oss.qualcomm.com>
    Patchwork: https://patchwork.freedesktop.org/patch/721342/
    Message-ID: <20260428073558.1234238-1-nichen@iscas.ac.cn>
    Signed-off-by: Rob Clark <robin.clark@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/msm/a6xx: Restore sysprof_active [+ + +]

Author: Rob Clark <robin.clark@oss.qualcomm.com>
Date:   Sat Apr 11 08:03:12 2026 -0700

    drm/msm/a6xx: Restore sysprof_active
    
    [ Upstream commit 7a529ff48b99011c946e6d8addd071c06d3ccdae ]
    
    This got lost in the shuffle somehow when moving the vfunc table to
    catalogue.  Fixes inhibiting IFPC when userspace is collecting perfcntr
    data.
    
    Fixes: 491fadb2b818 ("drm/msm/adreno: Move adreno_gpu_func to catalogue")
    Signed-off-by: Rob Clark <robin.clark@oss.qualcomm.com>
    Reviewed-by: Akhil P Oommen <akhilpo@oss.qualcomm.com>
    Patchwork: https://patchwork.freedesktop.org/patch/717780/
    Message-ID: <20260411150312.257937-1-robin.clark@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/msm/adreno: Fix a reference leak in a6xx_gpu_init() [+ + +]

Author: Felix Gu <ustc.gu@gmail.com>
Date:   Sat Jan 24 00:37:38 2026 +0800

    drm/msm/adreno: Fix a reference leak in a6xx_gpu_init()
    
    [ Upstream commit e64bca63647db1d5518198d6c5ca2dbcc66b182b ]
    
    In a6xx_gpu_init(), node is obtained via of_parse_phandle().
    While there was a manual of_node_put() at the end of the
    common path, several early error returns would bypass this call,
    resulting in a reference leak.
    Fix this by using the __free(device_node) cleanup handler to
    release the reference when the variable goes out of scope.
    
    Fixes: 5a903a44a984 ("drm/msm/a6xx: Introduce GMU wrapper support")
    Signed-off-by: Felix Gu <ustc.gu@gmail.com>
    Patchwork: https://patchwork.freedesktop.org/patch/700661/
    Message-ID: <20260124-a6xx_gpu-v1-1-fa0c8b2dcfb1@gmail.com>
    Signed-off-by: Rob Clark <robin.clark@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/msm/adreno: fix userspace-triggered crash on a2xx-a4xx [+ + +]

Author: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
Date:   Sat Apr 11 17:59:15 2026 +0300

    drm/msm/adreno: fix userspace-triggered crash on a2xx-a4xx
    
    [ Upstream commit 2b4abf879360ea00a9e2b46d2d15dcdbc0687eed ]
    
    Before a5xx Adreno driver will not try fetching UBWC params (because
    those generations didn't support UBWC anyway), however it's still
    possible to query UBWC-related params from the userspace, triggering
    possible NULL pointer dereference. Check for UBWC config in
    adreno_get_param() and return sane defaults if there is none.
    
    Fixes: a452510aad53 ("drm/msm/adreno: Switch to the common UBWC config struct")
    Signed-off-by: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
    Reviewed-by: Rob Clark <rob.clark@oss.qualcomm.com>
    Patchwork: https://patchwork.freedesktop.org/patch/717778/
    Message-ID: <20260411-adreno-fix-ubwc-v3-1-4983156f3f80@oss.qualcomm.com>
    Signed-off-by: Rob Clark <robin.clark@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/msm/dpu: don't mix devm and drmm functions [+ + +]

Author: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
Date:   Tue May 5 03:24:58 2026 +0300

    drm/msm/dpu: don't mix devm and drmm functions
    
    [ Upstream commit c0c70a11365cba7fba25a77463582bcec0f7846e ]
    
    Mixing devm and drmm functions will result in a use-after-free on msm
    driver teardown if userspace keeps a reference on the drm device:
    The WB connector data will be destroyed because of the use of
    devm_kzalloc()), while the usersoace still can try interacting with the
    WB connector (which uses drmm_ functions).
    
    Change dpu_writeback_init() to use drmm_.
    
    Fixes: 0b37ac63fc9d ("drm/msm/dpu: use drmm_writeback_connector_init()")
    Reported-by: Christophe JAILLET <christophe.jaillet@wanadoo.fr>
    Closes: https://lore.kernel.org/r/78c764b8-44cf-4db5-88e7-807a85954518@wanadoo.fr
    Signed-off-by: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
    Reviewed-by: John.Harrison@Igalia.com
    Patchwork: https://patchwork.freedesktop.org/patch/722656/
    Link: https://lore.kernel.org/r/20260505-wb-drop-encoder-v5-1-42567b7c7af2@oss.qualcomm.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/msm/dpu: Fix Kaanapali CWB register configuration [+ + +]

Author: Mahadevan P <mahadevan.p@oss.qualcomm.com>
Date:   Tue Apr 28 17:14:25 2026 +0530

    drm/msm/dpu: Fix Kaanapali CWB register configuration
    
    [ Upstream commit d03279f0d9fdbe6f6761f191a76093c395930018 ]
    
    The Kaanapali DPU catalog defines kaanapali_cwb[] with the correct
    CWB base addresses for this platform (0x169200, 0x169600, 0x16a200,
    0x16a600), but the dpu_kaanapali_cfg struct was mistakenly pointing
    to sm8650_cwb instead. The SM8650 CWB blocks sit at completely
    different offsets (0x66200, 0x66600, 0x7E200, 0x7E600), so using
    them on Kaanapali would program CWB registers at wrong addresses,
    corrupting unrelated hardware blocks and breaking writeback capture.
    
    Fix this by pointing .cwb to the correct kaanapali_cwb array.
    
    Fixes: 83fe2cd56b1d ("drm/msm/dpu: Add support for Kaanapali DPU")
    Signed-off-by: Mahadevan P <mahadevan.p@oss.qualcomm.com>
    Reviewed-by: Konrad Dybcio <konrad.dybcio@oss.qualcomm.com>
    Reviewed-by: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
    Patchwork: https://patchwork.freedesktop.org/patch/721444/
    Link: https://lore.kernel.org/r/20260428-kaanapali_cwb-v1-1-51fdb2c65498@oss.qualcomm.com
    Signed-off-by: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/msm/dpu: fix UV scanlines calculation for YUV UBWC formats [+ + +]

Author: Neil Armstrong <neil.armstrong@linaro.org>
Date:   Tue Apr 14 17:14:30 2026 +0200

    drm/msm/dpu: fix UV scanlines calculation for YUV UBWC formats
    
    [ Upstream commit 933430f1709b089a0bf0b23ef0f047014ef899e7 ]
    
    The UV scanlines is calculated with (height + 1) / 2 unlike
    the Y scanlines, add back the correct scanlines calculation
    for UBWC YUV formats.
    
    Fixes: 2f3ff6ab8f5c ("drm/msm/dpu: use standard functions in _dpu_format_populate_plane_sizes_ubwc()")
    Fixes: ada4a19ed21c ("drm/msm/dpu: rewrite _dpu_format_populate_plane_sizes_ubwc()")
    Signed-off-by: Neil Armstrong <neil.armstrong@linaro.org>
    Reviewed-by: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
    Patchwork: https://patchwork.freedesktop.org/patch/718309/
    Link: https://lore.kernel.org/r/20260414-topic-sm8x50-msm-dpu1-formats-qc10c-v1-1-0b62325b9030@linaro.org
    Signed-off-by: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/msm/dsi: don't dump registers past the mapped region [+ + +]

Author: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
Date:   Tue Apr 28 20:21:38 2026 +0300

    drm/msm/dsi: don't dump registers past the mapped region
    
    [ Upstream commit 5b49a46baa853b26dbefa65c6c75dd9ff69f63d4 ]
    
    On DSI 6G platforms the IO address space is internally adjusted by
    io_offset. Later this adjusted address might be used for memory dumping.
    However the size that is used for memory dumping isn't adjusted to
    account for the io_offset, leading to the potential access to the
    unmapped region. Lower ctrl_size by the io_offset value to prevent
    access past the mapped area.
    
     msm_disp_snapshot_add_block+0x1d4/0x3c8 [msm] (P)
     msm_dsi_host_snapshot+0x4c/0x78 [msm]
     msm_dsi_snapshot+0x28/0x50 [msm]
     msm_disp_snapshot_capture_state+0x74/0x140 [msm]
     msm_disp_snapshot_state_sync+0x60/0x90 [msm]
     _msm_disp_snapshot_work+0x30/0x90 [msm]
     kthread_worker_fn+0xdc/0x460
     kthread+0x120/0x140
    
    Fixes: bac2c6a62ed9 ("drm/msm: get rid of msm_iomap_size")
    Signed-off-by: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
    Reviewed-by: Konrad Dybcio <konrad.dybcio@oss.qualcomm.com>
    Patchwork: https://patchwork.freedesktop.org/patch/721747/
    Link: https://lore.kernel.org/r/20260428-msm-fix-dsi-dump-v1-1-5d4cb5ccfac7@oss.qualcomm.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/msm/snapshot: fix dumping of the unaligned regions [+ + +]

Author: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
Date:   Sat May 16 14:53:45 2026 +0300

    drm/msm/snapshot: fix dumping of the unaligned regions
    
    [ Upstream commit 76824d2467feb1828b745d6add2541918d7be3da ]
    
    The snapshotting code internally aligns data segment to 16 bytes. This
    works fine for DPU code (where most of the regions are aligned), but
    fails for snapshotting of the DSI data (because DSI data region is
    shifted by 4 bytes). Fix the code by removing length alignment and by
    accurately printing last registers in the region. While reworking the
    code also fix the 16x memory overallocation in
    msm_disp_state_dump_regs().
    
    Fixes: 98659487b845 ("drm/msm: add support to take dpu snapshot")
    Reported-by: Salendarsingh Gaud <sgaud@qti.qualcomm.com>
    Signed-off-by: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
    Patchwork: https://patchwork.freedesktop.org/patch/725449/
    Message-ID: <20260516-msm-fix-dsi-dump-2-v2-1-9e49fb2d240e@oss.qualcomm.com>
    Signed-off-by: Rob Clark <robin.clark@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/msm: Fix GMEM_BASE for A650 [+ + +]

Author: Alexander Koskovich <akoskovich@pm.me>
Date:   Sat Mar 14 04:14:50 2026 +0000

    drm/msm: Fix GMEM_BASE for A650
    
    [ Upstream commit 46e351e84853dda726072bb3d38ba7bd63e7532b ]
    
    Commit dc220915ddb2 ("drm/msm: Fix GMEM_BASE for gen8") changed the
    GMEM_BASE check from adreno_is_a650_family() & adreno_is_a740_family()
    to family >= ADRENO_6XX_GEN4.
    
    This inadvertently excluded A650 (ADRENO_6XX_GEN3), causing it to report
    an incorrect GMEM_BASE which results in severe rendering corruption.
    
    Update check to also include ADRENO_6XX_GEN3 to fix A650.
    
    Fixes: dc220915ddb2 ("drm/msm: Fix GMEM_BASE for gen8")
    Signed-off-by: Alexander Koskovich <akoskovich@pm.me>
    Reviewed-by: Konrad Dybcio <konrad.dybcio@oss.qualcomm.com>
    Reviewed-by: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
    Reviewed-by: Akhil P Oommen <akhilpo@oss.qualcomm.com>
    Patchwork: https://patchwork.freedesktop.org/patch/711880/
    Message-ID: <20260314-fix-gmem-base-a650-v1-1-3308f60cf74c@pm.me>
    Signed-off-by: Rob Clark <robin.clark@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/msm: Fix iommu_map_sgtable() return value check and avoid WARN [+ + +]

Author: Mikko Perttunen <mperttunen@nvidia.com>
Date:   Tue Apr 21 13:02:38 2026 +0900

    drm/msm: Fix iommu_map_sgtable() return value check and avoid WARN
    
    [ Upstream commit 55e0f0d1c1a4ee1e46da7da4d443eb3044fb3851 ]
    
    Commit "iommu: return full error code from iommu_map_sg[_atomic]()"
    changed iommu_map_sgtable() to return an ssize_t and negative values
    in error cases, rather than a size_t and a zero.
    
    Store the return value in the appropriate type and in case of error,
    return it rather than WARNing.
    
    Fixes: ad8f36e4b6b1 ("iommu: return full error code from iommu_map_sg[_atomic]()")
    Signed-off-by: Mikko Perttunen <mperttunen@nvidia.com>
    Patchwork: https://patchwork.freedesktop.org/patch/719685/
    Message-ID: <20260421-iommu_map_sgtable-return-v1-3-fb484c07d2a1@nvidia.com>
    Signed-off-by: Rob Clark <robin.clark@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/msm: Fix shrinker deadlock [+ + +]

Author: Daniel J Blueman <daniel@quora.org>
Date:   Fri May 8 14:57:21 2026 +0800

    drm/msm: Fix shrinker deadlock
    
    commit 3392291fc509d8ad6e4ad90f15b0a193f721cbc9 upstream.
    
    With PROVE_LOCKING on an Snapdragon X1 and VM reclaim pressure, we see:
    
       ======================================================
       WARNING: possible circular locking dependency detected
       7.0.0-debug+ #43 Tainted: G        W
       ------------------------------------------------------
       kswapd0/82 is trying to acquire lock:
       ffff800080ec3870 (reservation_ww_class_acquire){+.+.}-{0:0}, at: msm_gem_shrinker_scan+0x17c/0x400 [msm]
    
       but task is already holding lock:
       ffffc31709b263b8 (fs_reclaim){+.+.}-{0:0}, at: balance_pgdat+0x88/0x988
    
       which lock already depends on the new lock.
    
       the existing dependency chain (in reverse order) is:
    
       -> #2 (fs_reclaim){+.+.}-{0:0}:
              __lock_acquire+0x4d0/0xad0
              lock_acquire.part.0+0xc4/0x248
              lock_acquire+0x8c/0x248
              fs_reclaim_acquire+0xd0/0xf0
              dma_resv_lockdep+0x224/0x348
              do_one_initcall+0x84/0x5d0
              do_initcalls+0x194/0x1d8
              kernel_init_freeable+0x128/0x180
              kernel_init+0x2c/0x160
              ret_from_fork+0x10/0x20
    
       -> #1 (reservation_ww_class_mutex){+.+.}-{4:4}:
              __lock_acquire+0x4d0/0xad0
              lock_acquire.part.0+0xc4/0x248
              lock_acquire+0x8c/0x248
              dma_resv_lockdep+0x1a8/0x348
              do_one_initcall+0x84/0x5d0
              do_initcalls+0x194/0x1d8
              kernel_init_freeable+0x128/0x180
              kernel_init+0x2c/0x160
              ret_from_fork+0x10/0x20
    
       -> #0 (reservation_ww_class_acquire){+.+.}-{0:0}:
              check_prev_add+0x114/0x790
              validate_chain+0x594/0x6f0
              __lock_acquire+0x4d0/0xad0
              lock_acquire.part.0+0xc4/0x248
              lock_acquire+0x8c/0x248
              drm_gem_lru_scan+0x1ac/0x440
              msm_gem_shrinker_scan+0x17c/0x400 [msm]
              do_shrink_slab+0x150/0x4a0
              shrink_slab+0x144/0x460
              shrink_one+0x9c/0x1b0
              shrink_many+0x27c/0x5c0
              shrink_node+0x344/0x550
              balance_pgdat+0x2c0/0x988
              kswapd+0x11c/0x318
              kthread+0x10c/0x128
              ret_from_fork+0x10/0x20
    
       other info that might help us debug this:
       Chain exists of:
         reservation_ww_class_acquire --> reservation_ww_class_mutex --> fs_reclaim
        Possible unsafe locking scenario:
              CPU0                    CPU1
              ----                    ----
         lock(fs_reclaim);
                                      lock(reservation_ww_class_mutex);
                                      lock(fs_reclaim);
         lock(reservation_ww_class_acquire);
    
        *** DEADLOCK ***
       1 lock held by kswapd0/82:
        #0: ffffc31709b263b8 (fs_reclaim){+.+.}-{0:0}, at: balance_pgdat+0x88/0x988
    
       stack backtrace:
       CPU: 4 UID: 0 PID: 82 Comm: kswapd0 Tainted: G        W           7.0.0-debug+ #43 PREEMPT(full)
       Tainted: [W]=WARN
       Hardware name: LENOVO 21BX0016US/21BX0016US, BIOS N3HET94W (1.66 ) 09/15/2025
       Call trace:
        show_stack+0x20/0x40 (C)
        dump_stack_lvl+0x9c/0xd0
        dump_stack+0x18/0x30
        print_circular_bug+0x114/0x120
        check_noncircular+0x178/0x198
        check_prev_add+0x114/0x790
        validate_chain+0x594/0x6f0
        __lock_acquire+0x4d0/0xad0
        lock_acquire.part.0+0xc4/0x248
        lock_acquire+0x8c/0x248
        drm_gem_lru_scan+0x1ac/0x440
        msm_gem_shrinker_scan+0x17c/0x400 [msm]
        do_shrink_slab+0x150/0x4a0
        shrink_slab+0x144/0x460
        shrink_one+0x9c/0x1b0
        shrink_many+0x27c/0x5c0
        shrink_node+0x344/0x550
        balance_pgdat+0x2c0/0x988
        kswapd+0x11c/0x318
        kthread+0x10c/0x128
        ret_from_fork+0x10/0x20
    
    kswapd0 holding fs_reclaim calls the MSM shrinker, which calls
    dma_resv_lock. This in turn acquires fs_reclaim.
    
    Fix this deadlock by using dma_resv_trylock() instead, dropping the
    subsequently unused passed wait-wound lock 'ticket'.
    
    Cc: stable@vger.kernel.org
    Signed-off-by: Daniel J Blueman <daniel@quora.org>
    Fixes: fe4952b5f27c ("drm/msm: Convert vm locking")
    Patchwork: https://patchwork.freedesktop.org/patch/723564/
    Message-ID: <20260508065722.18785-1-daniel@quora.org>
    [rob: fixup compile errors, replace lockdep splat with something legible]
    Signed-off-by: Rob Clark <robin.clark@oss.qualcomm.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/msm: Restore second parameter name in purge() and evict() [+ + +]

Author: Nathan Chancellor <nathan@kernel.org>
Date:   Mon May 18 15:17:14 2026 -0700

    drm/msm: Restore second parameter name in purge() and evict()
    
    [ Upstream commit 53676e4d44d6b38c8a0d9bff331f170ae2e41bbe ]
    
    After commit 3392291fc509 ("drm/msm: Fix shrinker deadlock"), all
    supported versions of clang warn (or error with CONFIG_WERROR=y):
    
      drivers/gpu/drm/msm/msm_gem_shrinker.c:105:58: error: omitting the parameter name in a function definition is a C23 extension [-Werror,-Wc23-extensions]
        105 | purge(struct drm_gem_object *obj, struct ww_acquire_ctx *)
            |                                                          ^
      drivers/gpu/drm/msm/msm_gem_shrinker.c:117:58: error: omitting the parameter name in a function definition is a C23 extension [-Werror,-Wc23-extensions]
        117 | evict(struct drm_gem_object *obj, struct ww_acquire_ctx *)
            |                                                          ^
      2 errors generated.
    
    With older but supported versions of GCC, this is an unconditional hard error:
    
      drivers/gpu/drm/msm/msm_gem_shrinker.c: In function 'purge':
      drivers/gpu/drm/msm/msm_gem_shrinker.c:105:35: error: parameter name omitted
       purge(struct drm_gem_object *obj, struct ww_acquire_ctx *)
                                         ^~~~~~~~~~~~~~~~~~~~~~~
      drivers/gpu/drm/msm/msm_gem_shrinker.c: In function 'evict':
      drivers/gpu/drm/msm/msm_gem_shrinker.c:117:35: error: parameter name omitted
       evict(struct drm_gem_object *obj, struct ww_acquire_ctx *)
                                         ^~~~~~~~~~~~~~~~~~~~~~~
    
    Restore the parameter name to clear up the warnings, renaming it
    "unused" to make it clear it is only needed to satisfy the prototype of
    drm_gem_lru_scan().
    
    Cc: stable@vger.kernel.org
    Fixes: 3392291fc509 ("drm/msm: Fix shrinker deadlock")
    Signed-off-by: Nathan Chancellor <nathan@kernel.org>
    Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/v3d: Fix use-after-free of CPU job query arrays on error path [+ + +]

Author: Maíra Canal <mcanal@igalia.com>
Date:   Fri May 15 12:07:14 2026 -0300

    drm/v3d: Fix use-after-free of CPU job query arrays on error path
    
    commit b0fe80c0b9250b35e2211bf3117e7aca814a21b0 upstream.
    
    The CPU job ioctl's fail label calls kvfree() on cpu_job's timestamp and
    performance query arrays after v3d_job_cleanup(), which drops the job's
    last reference and frees cpu_job. Reading cpu_job at that point is a
    use-after-free. Also, on the early v3d_job_init() failure path, it is a
    NULL dereference, since v3d_job_deallocate() zeroes the local pointer.
    
    In the success path, the arrays are released from the scheduler's
    .free_job callback, but on the error path, they are freed manually, as
    the job was never pushed to the scheduler. While the success path deals
    with this correctly, the fail path doesn't.
    
    On top of that, the manual kvfree() calls only free the array storage;
    they don't drm_syncobj_put() the per-query syncobjs that
    v3d_timestamp_query_info_free() and v3d_performance_query_info_free()
    release on the success path. So the same fail path that triggers the
    use-after-free also leaks one syncobj reference per query.
    
    Unify the CPU job teardown into the CPU job's kref destructor, mirroring
    v3d_render_job_free(). The scheduler's .free_job slot reverts to the
    generic v3d_sched_job_free() and the fail label drops the manual
    kvfree() calls, leaving a single teardown path that is reached from both
    the scheduler and the ioctl error path. That removes the use-after-free,
    the NULL dereference, and the syncobj leak by construction.
    
    Cc: stable@vger.kernel.org
    Fixes: 9ba0ff3e083f ("drm/v3d: Create a CPU job extension for the timestamp query job")
    Assisted-by: Claude:claude-opus-4.7
    Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
    Link: https://patch.msgid.link/20260515-v3d-cpu-job-leaks-v1-1-7f147cbbf935@igalia.com
    Signed-off-by: Maíra Canal <mcanal@igalia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/v3d: Release indirect CSD GEM reference on CPU job free [+ + +]

Author: Maíra Canal <mcanal@igalia.com>
Date:   Fri May 15 12:07:15 2026 -0300

    drm/v3d: Release indirect CSD GEM reference on CPU job free
    
    commit 6eb6e5acafa46854d4363e6c34981289995f3ace upstream.
    
    v3d_get_cpu_indirect_csd_params() takes a reference to the indirect BO via
    drm_gem_object_lookup() and stashes it in cpu_job->indirect_csd.indirect,
    but nothing on the CPU job teardown path ever drops that reference.
    
    Drop the extra reference in v3d_cpu_job_free(). The NULL check covers ioctl
    errors before the lookup ran and CPU job types other than
    V3D_CPU_JOB_TYPE_INDIRECT_CSD, which leave the field zero-initialised.
    
    Cc: stable@vger.kernel.org
    Fixes: 18b8413b25b7 ("drm/v3d: Create a CPU job extension for a indirect CSD job")
    Assisted-by: Claude:claude-opus-4.7
    Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>
    Link: https://patch.msgid.link/20260515-v3d-cpu-job-leaks-v1-2-7f147cbbf935@igalia.com
    Signed-off-by: Maíra Canal <mcanal@igalia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/virtio: use uninterruptible resv lock for plane updates [+ + +]

Author: Deepanshu Kartikey <kartikey406@gmail.com>
Date:   Tue May 19 13:52:47 2026 +0530

    drm/virtio: use uninterruptible resv lock for plane updates
    
    commit 9af1b6e175c82daf4b423da339a722d8e67a735a upstream.
    
    virtio_gpu_cursor_plane_update() and virtio_gpu_resource_flush() lock
    the framebuffer BO's dma_resv via virtio_gpu_array_lock_resv() and
    ignore its return value. The function can fail with -EINTR from
    dma_resv_lock_interruptible() (signal during lock wait) or with
    -ENOMEM from dma_resv_reserve_fences() (fence slot allocation),
    leaving the resv lock not held. The queue path then walks the object
    array and calls dma_resv_add_fence(), which requires the lock held;
    with lockdep enabled this trips dma_resv_assert_held():
    
      WARNING: drivers/dma-buf/dma-resv.c:296 at dma_resv_add_fence+0x71e/0x840
      Call Trace:
       virtio_gpu_array_add_fence
       virtio_gpu_queue_ctrl_sgs
       virtio_gpu_queue_fenced_ctrl_buffer
       virtio_gpu_cursor_plane_update
       drm_atomic_helper_commit_planes
       drm_atomic_helper_commit_tail
       commit_tail
       drm_atomic_helper_commit
       drm_atomic_commit
       drm_atomic_helper_update_plane
       __setplane_atomic
       drm_mode_cursor_universal
       drm_mode_cursor_common
       drm_mode_cursor_ioctl
       drm_ioctl
       __x64_sys_ioctl
    
    Beyond the WARN, mutating the dma_resv fence list without the lock
    races with concurrent readers/writers and can corrupt the list.
    
    Both call sites run inside the .atomic_update plane callback, which
    DRM atomic helpers do not allow to fail (by the time it runs, the
    commit has been signed off to userspace and there is no clean
    rollback path). Moving the lock acquisition to .prepare_fb was
    rejected because the broader lock scope deadlocks against other BO
    locking paths in the same atomic commit.
    
    Introduce virtio_gpu_lock_one_resv_uninterruptible() that uses
    dma_resv_lock() instead of dma_resv_lock_interruptible(). This
    eliminates the -EINTR failure mode -- the realistic syzbot trigger
    -- without extending the lock hold across the commit. The helper
    locks a single BO and rejects nents > 1 with -EINVAL; both fix
    sites lock exactly one BO.
    
    Use it from virtio_gpu_cursor_plane_update() and
    virtio_gpu_resource_flush(); check the return value to handle the
    remaining -ENOMEM case from dma_resv_reserve_fences() by freeing
    the objs and skipping the plane update for that frame. The
    framebuffer BOs touched here are not shared with other contexts
    and lock contention is expected to be brief, so the loss of
    signal-interruptibility is acceptable.
    
    Other callers of virtio_gpu_array_lock_resv() (the ioctl paths)
    continue to use the interruptible variant.
    
    The bug was reported by syzbot, triggered via fault injection
    (fail_nth) on the DRM_IOCTL_MODE_CURSOR path, which forces the
    -ENOMEM branch in dma_resv_reserve_fences().
    
    Reported-by: syzbot+72bd3dd3a5d5f39a0271@syzkaller.appspotmail.com
    Closes: https://syzkaller.appspot.com/bug?extid=72bd3dd3a5d5f39a0271
    Fixes: 5cfd31c5b3a3 ("drm/virtio: fix virtio_gpu_cursor_plane_update().")
    Cc: stable@vger.kernel.org
    Signed-off-by: Deepanshu Kartikey <kartikey406@gmail.com>
    Signed-off-by: Dmitry Osipenko <dmitry.osipenko@collabora.com>
    Link: https://patch.msgid.link/20260519082247.34470-1-kartikey406@gmail.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/xe/gsc: Fix double-free of managed BO in error path [+ + +]

Author: Shuicheng Lin <shuicheng.lin@intel.com>
Date:   Mon May 11 15:41:34 2026 +0000

    drm/xe/gsc: Fix double-free of managed BO in error path
    
    [ Upstream commit d3ded53fab90996e7d94a39049e11962dd066725 ]
    
    The error path in xe_gsc_init_post_hwconfig() explicitly frees a BO
    allocated with xe_managed_bo_create_pin_map() via
    xe_bo_unpin_map_no_vm(). Since the managed BO already has a devm
    cleanup action registered, this causes a double-free when devm
    unwinds during probe failure.
    
    Remove the explicit free and let devm handle it, consistent with
    all other xe_managed_bo_create_pin_map() callers.
    
    Fixes: 2e5d47fe7839 ("drm/xe/uc: Use managed bo for HuC and GSC objects")
    Reviewed-by: Daniele Ceraolo Spurio <daniele.ceraolospurio@intel.com>
    Assisted-by: Claude:claude-opus-4.6
    Link: https://patch.msgid.link/20260511154134.223696-1-shuicheng.lin@intel.com
    Signed-off-by: Shuicheng Lin <shuicheng.lin@intel.com>
    (cherry picked from commit 71d61e3e299a17139e47f980a4d6f425b2c59bf7)
    Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/xe/multi_queue: Fix secondary queue error case [+ + +]

Author: Niranjana Vishwanathapura <niranjana.vishwanathapura@intel.com>
Date:   Mon May 18 12:16:40 2026 -0700

    drm/xe/multi_queue: Fix secondary queue error case
    
    commit 00907da2126ed785451b2a2f0fef282246dad104 upstream.
    
    If xe_lrc_create() fails, the secondary queue added to the
    multi-queue group list is not removed before freeing the
    queue. Fix error path handling for secondary queues by
    removing it from the multi-queue group list at the right
    place.
    
    Reported-by: Sebastian Österlund <sebastian.osterlund@intel.com>
    Closes: https://gitlab.freedesktop.org/drm/xe/kernel/-/work_items/7979
    Fixes: d716a5088c88 ("drm/xe/multi_queue: Handle tearing down of a multi queue")
    Cc: stable@vger.kernel.org # v7.0+
    Signed-off-by: Niranjana Vishwanathapura <niranjana.vishwanathapura@intel.com>
    Reviewed-by: Matthew Auld <matthew.auld@intel.com>
    Link: https://patch.msgid.link/20260518191639.320890-2-niranjana.vishwanathapura@intel.com
    (cherry picked from commit d2d23c12789cf69eddc35b8d38cd8eaabd0168f1)
    Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

drm/xe/oa: Fix exec_queue leak on width check in stream open [+ + +]

Author: Shuicheng Lin <shuicheng.lin@intel.com>
Date:   Thu May 14 20:32:10 2026 +0000

    drm/xe/oa: Fix exec_queue leak on width check in stream open
    
    [ Upstream commit 4d25342543c01310fc4e0cba7cb17c775e2421e2 ]
    
    In xe_oa_stream_open_ioctl(), when param.exec_q->width > 1 the
    function returns -EOPNOTSUPP directly, skipping the existing
    err_exec_q cleanup path. The exec_queue reference obtained by
    xe_exec_queue_lookup() is leaked.
    
    The exec queue holds a reference on the xe_file, which is only
    dropped during queue teardown. The leaked lookup ref is not on
    the file's exec_queue xarray, so file close cannot release it.
    This keeps both the exec queue and the file private state pinned
    indefinitely.
    
    Jump to err_exec_q instead of returning directly so the reference
    is released.
    
    Fixes: f0ed39830e60 ("xe/oa: Fix query mode of operation for OAR/OAC")
    Assisted-by: Claude:claude-opus-4.6
    Reviewed-by: Ashutosh Dixit <ashutosh.dixit@intel.com>
    Link: https://patch.msgid.link/20260514203210.593488-1-shuicheng.lin@intel.com
    Signed-off-by: Shuicheng Lin <shuicheng.lin@intel.com>
    (cherry picked from commit 339fa0be9e4a5d69fa47e91f4a36574224fb478f)
    Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/xe/pf: Fix CFI failure in debugfs access [+ + +]

Author: Mohanram Meenakshisundaram <mohanram.meenakshisundaram@intel.com>
Date:   Thu May 14 23:19:18 2026 +0530

    drm/xe/pf: Fix CFI failure in debugfs access
    
    [ Upstream commit 96bf49b526e2d03a2b7f6e861925a08f46ed0d28 ]
    
    Reading debugfs file (/sys/kernel/debug/dri/0/gt*/pf/adverse_events)
    with CFI (Control Flow Integrity) enabled, the kernel panics at
    xe_gt_debugfs_simple_show+0x82/0xc0.
    
    xe_gt_debugfs_simple_show() declare a function pointer expecting int
    return type, but xe_gt_sriov_pf_monitor_print_events() is void return
    type, leading to CFI failure and kernel panic.
    
    [507620.973657] CFI failure at xe_gt_debugfs_simple_show+0x82/0xc0 [xe]
    (target: xe_gt_sriov_pf_monitor_print_events+0x0/0x130 [xe]; expected
    type: 0xd72c7139)
    
    Fix xe_gt_sriov_pf_monitor_print_events() function by updating to return
    an int type.
    
    Fixes: 1c99d3d3edab ("drm/xe/pf: Expose PF monitor details via debugfs")
    Signed-off-by: Mohanram Meenakshisundaram <mohanram.meenakshisundaram@intel.com>
    Reviewed-by: Michal Wajdeczko <michal.wajdeczko@intel.com>
    Signed-off-by: Michal Wajdeczko <michal.wajdeczko@intel.com>
    Link: https://patch.msgid.link/20260514174918.1556357-2-mohanram.meenakshisundaram@intel.com
    (cherry picked from commit ff1d386a8359746d9699ac30336e3b0684c68958)
    Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/xe/tuning: Apply windower hardware filtering setting on Xe3 and Xe3p [+ + +]

Author: Matt Roper <matthew.d.roper@intel.com>
Date:   Tue Feb 24 15:50:56 2026 -0800

    drm/xe/tuning: Apply windower hardware filtering setting on Xe3 and Xe3p
    
    [ Upstream commit 8ccf5f6b2295164962bbee5b0770f4366fd9bee2 ]
    
    A recent bspec tuning guide update asks us to program
    COMMON_SLICE_CHICKEN4[5] on Xe3 and Xe3p platforms.  Add this setting to
    our LRC tuning RTP table so that the setting will become part of each
    context's LRC.
    
    Bspec: 72161, 55902
    Reviewed-by: Shuicheng Lin <shuicheng.lin@intel.com>
    Link: https://patch.msgid.link/20260224235055.3038710-2-matthew.d.roper@intel.com
    Signed-off-by: Matt Roper <matthew.d.roper@intel.com>
    Stable-dep-of: 6df5678b6a94 ("drm/xe: Define and use MCR version of COMMON_SLICE_CHICKEN4")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/xe/vf: Fix signature of print functions [+ + +]

Author: Michal Wajdeczko <michal.wajdeczko@intel.com>
Date:   Thu May 14 17:57:26 2026 +0200

    drm/xe/vf: Fix signature of print functions
    
    [ Upstream commit 9bb2f1d7e6e58b8e434ddc2048c661bf87ccdf2a ]
    
    We have plugged-in existing VF print functions into our GT debugfs
    show helper as-is, but we missed that the helper expects functions
    to return int, while they were defined as void. This can lead to
    errors being reported when CFI is enabled.
    
    Fixes: 63d8cb8fe3dd ("drm/xe/vf: Expose SR-IOV VF attributes to GT debugfs")
    Signed-off-by: Michal Wajdeczko <michal.wajdeczko@intel.com>
    Cc: Mohanram Meenakshisundaram <mohanram.meenakshisundaram@intel.com>
    Reviewed-by: Shuicheng Lin <shuicheng.lin@intel.com>
    Link: https://patch.msgid.link/20260514155726.7165-1-michal.wajdeczko@intel.com
    (cherry picked from commit 314e31c9a8a1c421ee4f7f755b9348aefbbca090)
    Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/xe: Consolidate workaround entries for Wa_14019988906 [+ + +]

Author: Matt Roper <matthew.d.roper@intel.com>
Date:   Fri Feb 20 09:27:40 2026 -0800

    drm/xe: Consolidate workaround entries for Wa_14019988906
    
    [ Upstream commit c2142a1a841525d897ef69b3e6a5ab48183e1fcf ]
    
    Wa_14019988906 applies to all graphics versions from 20.01 through 20.04
    (inclusive).  Consolidate the RTP entries into a single range-based entry.
    
    Reviewed-by: Shuicheng Lin <shuicheng.lin@intel.com>
    Link: https://patch.msgid.link/20260220-forupstream-wa_cleanup-v2-18-b12005a05af6@intel.com
    Signed-off-by: Matt Roper <matthew.d.roper@intel.com>
    Stable-dep-of: a4660bd94973 ("drm/xe: Define and use MCR version of COMMON_SLICE_CHICKEN1")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/xe: Consolidate workaround entries for Wa_18033852989 [+ + +]

Author: Matt Roper <matthew.d.roper@intel.com>
Date:   Fri Feb 20 09:27:41 2026 -0800

    drm/xe: Consolidate workaround entries for Wa_18033852989
    
    [ Upstream commit fe681e7b44d78fd77d79de21eca58c3b6bdcda0e ]
    
    Wa_18033852989 applies to all graphics versions from 20.01 through 20.04
    (inclusive).  Consolidate the RTP entries into a single range-based entry.
    
    Reviewed-by: Shuicheng Lin <shuicheng.lin@intel.com>
    Link: https://patch.msgid.link/20260220-forupstream-wa_cleanup-v2-19-b12005a05af6@intel.com
    Signed-off-by: Matt Roper <matthew.d.roper@intel.com>
    Stable-dep-of: a4660bd94973 ("drm/xe: Define and use MCR version of COMMON_SLICE_CHICKEN1")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/xe: Define and use MCR version of COMMON_SLICE_CHICKEN1 [+ + +]

Author: Gustavo Sousa <gustavo.sousa@intel.com>
Date:   Thu May 14 18:44:45 2026 -0300

    drm/xe: Define and use MCR version of COMMON_SLICE_CHICKEN1
    
    [ Upstream commit a4660bd949733fd6ea621fdb50fabac2608155e9 ]
    
    The register COMMON_SLICE_CHICKEN1 is a MCR register on Xe2.
    Let's make sure to define a MCR version of it and use it for the
    relevant IP versions.
    
    Use XEHP_ as prefix for the register name, since it is MCR as of Xe_HP.
    
    Fixes: a5d221924e13 ("drm/xe/xe2_hpg: Add set of workarounds")
    Fixes: 9f18b55b6d3f ("drm/xe/xe2: Add workaround 18033852989")
    Bspec: 66534, 71185
    Reviewed-by: Matt Roper <matthew.d.roper@intel.com>
    Link: https://patch.msgid.link/20260514-rtp-mcr-check-v3-2-30dd47855fee@intel.com
    Signed-off-by: Gustavo Sousa <gustavo.sousa@intel.com>
    (cherry picked from commit a672725fdbfc3ea430130039d677c7dc98d59df8)
    Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

drm/xe: Define and use MCR version of COMMON_SLICE_CHICKEN4 [+ + +]

Author: Gustavo Sousa <gustavo.sousa@intel.com>
Date:   Thu May 14 18:44:46 2026 -0300

    drm/xe: Define and use MCR version of COMMON_SLICE_CHICKEN4
    
    [ Upstream commit 6df5678b6a94ac80e31e847074c4b30c21025b1f ]
    
    The register COMMON_SLICE_CHICKEN4 is a MCR register on both Xe2 and
    Xe3. Let's make sure to define a MCR version of it and use it for the
    relevant IP versions.
    
    Use XEHP_ as prefix for the register name, since it is MCR as of Xe_HP.
    
    v2:
      - Also change for one entry in lrc_tunnings, which was caught by
        manual testing and add corresponging Fixes tag in commit message.
        (Gustavo)
    
    Fixes: 8d6f16f1f082 ("drm/xe: Extend Wa_22021007897 to Xe3 platforms")
    Fixes: e5c13e2c505b ("drm/xe/xe2hpg: Add Wa_22021007897")
    Fixes: 8ccf5f6b2295 ("drm/xe/tuning: Apply windower hardware filtering setting on Xe3 and Xe3p")
    Bspec: 66534, 71185, 74417
    Reviewed-by: Matt Roper <matthew.d.roper@intel.com>
    Link: https://patch.msgid.link/20260514-rtp-mcr-check-v3-3-30dd47855fee@intel.com
    Signed-off-by: Gustavo Sousa <gustavo.sousa@intel.com>
    (cherry picked from commit 75f65f1a4c06da1d87f28570a9d4cdad28f13360)
    Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

efi: Allocate runtime workqueue before ACPI init [+ + +]

Author: Ard Biesheuvel <ardb@kernel.org>
Date:   Tue May 19 10:03:00 2026 +0200

    efi: Allocate runtime workqueue before ACPI init
    
    commit 13c6da02e767152c9ac4330962247a5e47011035 upstream.
    
    Since commit
    
      5894cf571e14 ("acpi/prmt: Use EFI runtime sandbox to invoke PRM handlers")
    
    ACPI PRM calls are delegated to a workqueue which runs in a kernel
    thread, making it easier to detect and mitigate faulting memory accesses
    performed by the firmware.
    
    Rafael reports that such PRM accesses may occur before efisubsys_init()
    executes, which is where the workqueue is allocated, leading to NULL
    pointer dereferences. Since acpi_init() [which triggers the early PRM
    accesses] executes as a subsys_initcall() as well, and has its own
    dependencies that may be sensitive to initcall ordering, deferring
    acpi_init() is not an option.
    
    So instead, split off the workqueue allocation into its own postcore
    initcall, as this is the only missing piece to allow EFI runtime calls
    to be made. This ensures that EFI runtime call (including PRM calls) are
    accessible to all code running at subsys_initcall() level.
    
    Cc: <stable@vger.kernel.org>
    Fixes: 5894cf571e14 ("acpi/prmt: Use EFI runtime sandbox to invoke PRM handlers")
    Reviewed-by: Rafael J. Wysocki (Intel) <rafael@kernel.org>
    Signed-off-by: Ard Biesheuvel <ardb@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

erofs: fix managed cache race for unaligned extents [+ + +]

Author: Gao Xiang <xiang@kernel.org>
Date:   Tue Apr 28 12:34:31 2026 +0800

    erofs: fix managed cache race for unaligned extents
    
    [ Upstream commit 649932fc3815eda2f24eb4de4b3a5e94886ee0b9 ]
    
    After unaligned compressed extents were introduced, the following race
    could occur:
    
    [Thread 1]                                   [Thread 2]
    (z_erofs_fill_bio_vec)
    <handle a Z_EROFS_PREALLOCATED_FOLIO folio>
    ...
    filemap_add_folio (1)
                                                 (z_erofs_bind_cache)
                                                 <the same folio is found..>
                                                 ..
                                                 ..
    folio_attach_private (2)
                                                 filemap_add_folio (3) again
    
    Since (1) is executed but (2) hasn't been executed yet, it's possible
    that another thread finds the same managed folio in z_erofs_bind_cache()
    for a different pcluster and calls filemap_add_folio() again since
    folio->private is still Z_EROFS_PREALLOCATED_FOLIO.
    
    Fix this by explicitly clearing folio->private before making the folio
    visible in the managed cache so that another pcluster can simply wait
    on the locked managed folio as what we did for other shared cases [1].
    
    This only impacts unaligned data compression (`-E48bit` with zstd,
    for example).
    
    [1] Commit 9e2f9d34dd12 ("erofs: handle overlapped pclusters out of
     crafted images properly") was originally introduced to handle crafted
     overlapped extents, but it addresses unaligned extents as well.
    
    Fixes: 7361d1e3763b ("erofs: support unaligned encoded data")
    Reported-by: Arseniy Krasnov <avkrasnov@salutedevices.com>
    Closes: https://lore.kernel.org/r/4a2f3801-fac1-42fe-ae75-da315822e088@salutedevices.com
    Tested-by: Arseniy Krasnov <avkrasnov@salutedevices.com>
    Signed-off-by: Gao Xiang <hsiangkao@linux.alibaba.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

erofs: fix metabuf leak in inode xattr initialization [+ + +]

Author: Jia Zhu <zhujia.zj@bytedance.com>
Date:   Wed May 20 12:46:07 2026 +0800

    erofs: fix metabuf leak in inode xattr initialization
    
    [ Upstream commit 79b09c54c6563df9846ca3094bcfd72082c3e1d7 ]
    
    commit bb88e8da0025 ("erofs: use meta buffers for xattr operations")
    converted xattr operations to use on-stack erofs_buf instances.
    erofs_init_inode_xattrs() uses such a metabuf while reading the inline
    xattr header and shared xattr id array.
    
    Some error paths after erofs_read_metabuf() leave through out_unlock
    without dropping the metabuf, so the folio reference can leak.
    
    Consolidate the cleanup at out_unlock. erofs_put_metabuf() is a
    no-op if no folio has been acquired, and this keeps all paths after
    taking EROFS_I_BL_XATTR_BIT covered by a single cleanup site.
    
    Fixes: bb88e8da0025 ("erofs: use meta buffers for xattr operations")
    Signed-off-by: Jia Zhu <zhujia.zj@bytedance.com>
    Reviewed-by: Gao Xiang <hsiangkao@linux.alibaba.com>
    Fixes: bb88e8da0025 ("erofs: use meta buffers for xattr operations")
    Signed-off-by: Gao Xiang <hsiangkao@linux.alibaba.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

erofs: harden h_shared_count in erofs_init_inode_xattrs() [+ + +]

Author: Utkal Singh <singhutkal015@gmail.com>
Date:   Tue Mar 17 15:24:39 2026 +0000

    erofs: harden h_shared_count in erofs_init_inode_xattrs()
    
    [ Upstream commit 6a01f5478d208544c8ba5ddbd674ea660f1b7047 ]
    
    `u8 h_shared_count` indicates the shared xattr count of an inode. It is
    read from the on-disk xattr ibody header, which should be corrupted if
    the size of the shared xattr array exceeds the space available in
    `xattr_isize`.
    
    It does not cause harmful consequence (e.g. crashes), since the image is
    already considered corrupted, it indeed results in the silent processing
    of garbage metadata.
    
    Let's harden it to report -EFSCORRUPTED earlier.
    
    Signed-off-by: Utkal Singh <singhutkal015@gmail.com>
    Reviewed-by: Gao Xiang <hsiangkao@linux.alibaba.com>
    Reviewed-by: Chao Yu <chao@kernel.org>
    Signed-off-by: Gao Xiang <hsiangkao@linux.alibaba.com>
    Stable-dep-of: 79b09c54c656 ("erofs: fix metabuf leak in inode xattr initialization")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ethtool: fix ethnl_bitmap32_not_zero() bit interval semantics [+ + +]

Author: Chenguang Zhao <zhaochenguang@kylinos.cn>
Date:   Mon May 11 09:43:43 2026 +0800

    ethtool: fix ethnl_bitmap32_not_zero() bit interval semantics
    
    [ Upstream commit 3d042592ebd4c7e44974d556de0b727cb7db4dab ]
    
    ethnl_bitmap32_not_zero() should return true if some bit in [start, end)
    is set:
    
    - Fix inverted memchr_inv() sense: return true when the scan finds a
      non-zero byte, not when the middle words are all zero.
    - Return false for an empty interval (end <= start).
    - When end is 32-bit aligned, indices in [start, end) do not include any
      bits from map[end_word]; return false after earlier checks found no
      non-zero data.
    
    Fixes: 10b518d4e6dd ("ethtool: netlink bitset handling")
    Signed-off-by: Chenguang Zhao <zhaochenguang@kylinos.cn>
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

firmware: arm_ffa: Align RxTx buffer size before mapping [+ + +]

Author: Sudeep Holla <sudeep.holla@kernel.org>
Date:   Tue Apr 28 19:33:33 2026 +0100

    firmware: arm_ffa: Align RxTx buffer size before mapping
    
    [ Upstream commit 0399e3f872ca3d78044bb715a73ea645806d2c7b ]
    
    Commit 83210251fd70 ("firmware: arm_ffa: Use the correct buffer size during
    RXTX_MAP") advertises PAGE_ALIGN(rxtx_bufsz) to firmware when mapping the
    buffers but the driver continues to stores the minimum FF-A buffer size
    in drv_info->rxtx_bufsz which is used elsewhere in the driver.
    
    Align the size before storing it so that the allocation, validation and
    FFA_RXTX_MAP all use the same buffer size.
    
    Fixes: 83210251fd70 ("firmware: arm_ffa: Use the correct buffer size during RXTX_MAP")
    Cc: Sebastian Ene <sebastianene@google.com>
    Link: https://sashiko.dev/#/patchset/20260402113939.930221-1-sebastianene@google.com
    Reviewed-by: Sebastian Ene <sebastianene@google.com>
    Link: https://patch.msgid.link/20260428-ffa_fixes-v2-9-8595ae450034@kernel.org
    Signed-off-by: Sudeep Holla <sudeep.holla@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

firmware: arm_ffa: Bound PARTITION_INFO_GET_REGS copies [+ + +]

Author: Sudeep Holla <sudeep.holla@kernel.org>
Date:   Tue Apr 28 19:33:30 2026 +0100

    firmware: arm_ffa: Bound PARTITION_INFO_GET_REGS copies
    
    [ Upstream commit 3974ea1938406f9bfa7c1f48d4e43533f447bb08 ]
    
    The register-based PARTITION_INFO_GET path trusted the firmware-provided
    indices when copying partition descriptors into the caller buffer.
    Reject inconsistent counts or index progressions so the copy loop cannot
    write past the allocated array.
    
    Fixes: ba85c644ac8d ("firmware: arm_ffa: Add support for FFA_PARTITION_INFO_GET_REGS")
    Link: https://patch.msgid.link/20260428-ffa_fixes-v2-6-8595ae450034@kernel.org
    (fixed cur_idx when exactly one descriptor in the first fragment)
    Signed-off-by: Sudeep Holla <sudeep.holla@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

firmware: arm_ffa: Check for NULL FF-A ID table while driver registration [+ + +]

Author: Sudeep Holla <sudeep.holla@kernel.org>
Date:   Tue Apr 28 19:33:25 2026 +0100

    firmware: arm_ffa: Check for NULL FF-A ID table while driver registration
    
    [ Upstream commit 0a5e695095c557d2380131b613dea4e8d90371be ]
    
    The bus match callback assumes that every FF-A driver provides an
    id_table and dereferences it unconditionally. Enforce that contract at
    registration time so a buggy client driver cannot crash the bus during
    match.
    
    Fixes: 92743071464f ("firmware: arm_ffa: Ensure drivers provide a probe function")
    Link: https://patch.msgid.link/20260428-ffa_fixes-v2-1-8595ae450034@kernel.org
    Signed-off-by: Sudeep Holla <sudeep.holla@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

firmware: arm_ffa: Fix per-vcpu self notifications handling in workqueue [+ + +]

Author: Sudeep Holla <sudeep.holla@kernel.org>
Date:   Tue Apr 28 19:33:28 2026 +0100

    firmware: arm_ffa: Fix per-vcpu self notifications handling in workqueue
    
    [ Upstream commit 9985d5357ed93af0d1933969c247e966957730e1 ]
    
    Per-vcpu notification handling already runs from a per-cpu work item on
    the target cpu. Routing that path back through smp_call_function_single()
    re-enters the call-function IPI path and executes the notification
    handler with interrupts disabled. That makes the framework path unsafe,
    since it takes a mutex, allocates memory with GFP_KERNEL, and invokes
    client callbacks.
    
    Handle per-vcpu self notifications directly from the existing per-cpu
    work item instead. This keeps the per-vcpu path in task context and
    avoids the extra IPI hop entirely.
    
    Fixes: 3a3e2b83e805 ("firmware: arm_ffa: Avoid queuing work when running on the worker queue")
    Link: https://patch.msgid.link/20260428-ffa_fixes-v2-4-8595ae450034@kernel.org
    Signed-off-by: Sudeep Holla <sudeep.holla@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

firmware: arm_ffa: Fix sched-recv callback partition lookup [+ + +]

Author: Sudeep Holla <sudeep.holla@kernel.org>
Date:   Tue Apr 28 19:33:35 2026 +0100

    firmware: arm_ffa: Fix sched-recv callback partition lookup
    
    [ Upstream commit a6848a50404eefb6f0b131c21881a2d8d21b31a9 ]
    
    ffa_sched_recv_cb_update() used list_for_each_entry_safe() to search for
    a matching partition and then tested the iterator against NULL. That is
    not a valid end-of-list check for circular lists and can fall through
    with an invalid pointer. Use a normal iterator and detect the not-found
    case correctly before touching the partition state.
    
    Fixes: be61da938576 ("firmware: arm_ffa: Allow multiple UUIDs per partition to register SRI callback")
    Link: https://patch.msgid.link/20260428-ffa_fixes-v2-11-8595ae450034@kernel.org
    Signed-off-by: Sudeep Holla <sudeep.holla@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

firmware: arm_ffa: Keep framework RX release under lock [+ + +]

Author: Sudeep Holla <sudeep.holla@kernel.org>
Date:   Tue Apr 28 19:33:31 2026 +0100

    firmware: arm_ffa: Keep framework RX release under lock
    
    [ Upstream commit 2af18f8e36b277730527cacc2256b1332f56aa28 ]
    
    The framework notification handler drops rx_lock before issuing
    FFA_RX_RELEASE, leaving a window where another RX-buffer user can
    start a new FF-A transaction before ownership has actually been
    returned to firmware.
    
    Move the FFA_RX_RELEASE calls so they execute while rx_lock is still
    held on both the kmemdup() failure path and the normal success path.
    While doing that, switch the handler to scoped_guard() to keep the
    critical section explicit.
    
    Fixes: 285a5ea0f542 ("firmware: arm_ffa: Add support for handling framework notifications")
    Link: https://patch.msgid.link/20260428-ffa_fixes-v2-7-8595ae450034@kernel.org
    Signed-off-by: Sudeep Holla <sudeep.holla@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

firmware: arm_ffa: Skip free_pages on RX buffer alloc failure [+ + +]

Author: Sudeep Holla <sudeep.holla@kernel.org>
Date:   Tue Apr 28 19:33:26 2026 +0100

    firmware: arm_ffa: Skip free_pages on RX buffer alloc failure
    
    [ Upstream commit 09527e2c534911619d7e098729711100290bc3e1 ]
    
    If the RX buffer allocation fails in ffa_init(), the error path jumps to
    free_pages even though no buffer has been allocated yet. Route that case
    directly to free_drv_info so the cleanup path is only used after at
    least one RX/TX buffer allocation has succeeded.
    
    Fixes: 3bbfe9871005 ("firmware: arm_ffa: Add initial Arm FFA driver support")
    Link: https://patch.msgid.link/20260428-ffa_fixes-v2-2-8595ae450034@kernel.org
    Signed-off-by: Sudeep Holla <sudeep.holla@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

firmware: arm_ffa: Snapshot notifier callbacks under lock [+ + +]

Author: Sudeep Holla <sudeep.holla@kernel.org>
Date:   Tue Apr 28 19:33:34 2026 +0100

    firmware: arm_ffa: Snapshot notifier callbacks under lock
    
    [ Upstream commit 38290b180a4d5746baed796d49f88d56d2f336cd ]
    
    Both notification handlers currently look up a notifier callback under
    notify_lock, drop the lock, and then dereference the returned
    notifier entry. A concurrent unregister can delete and free that
    entry in the gap, leaving the handler to dereference stale memory.
    
    Copy the callback pointer and callback data while notify_lock is
    still held and invoke the callback only after the lock is dropped.
    This keeps the existing callback execution model while removing the
    use-after-free window in both the framework and non-framework
    notification paths.
    
    Fixes: 285a5ea0f542 ("firmware: arm_ffa: Add support for handling framework notifications")
    Link: https://patch.msgid.link/20260428-ffa_fixes-v2-10-8595ae450034@kernel.org
    Signed-off-by: Sudeep Holla <sudeep.holla@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

firmware: arm_ffa: Unregister bus notifier on teardown for FF-A v1.0 [+ + +]

Author: Sudeep Holla <sudeep.holla@kernel.org>
Date:   Tue Apr 28 19:33:29 2026 +0100

    firmware: arm_ffa: Unregister bus notifier on teardown for FF-A v1.0
    
    [ Upstream commit 6d3daa9b8d313f42d52e75590310f26a29b61b44 ]
    
    For FF-A v1.0 the driver registers a bus notifier to backfill UUID
    matching, but the notifier was never unregistered on cleanup paths.
    Track the registration state and unregister it during teardown and early
    partition-setup failure.
    
    Fixes: 9dd15934f60d ("firmware: arm_ffa: Move the FF-A v1.0 NULL UUID workaround to bus notifier")
    Link: https://patch.msgid.link/20260428-ffa_fixes-v2-5-8595ae450034@kernel.org
    Signed-off-by: Sudeep Holla <sudeep.holla@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

firmware: arm_ffa: Validate framework notification message layout [+ + +]

Author: Sudeep Holla <sudeep.holla@kernel.org>
Date:   Tue Apr 28 19:33:32 2026 +0100

    firmware: arm_ffa: Validate framework notification message layout
    
    [ Upstream commit 4a1cc9e96b311d2609a6f963a5e35bd4ae730d97 ]
    
    Framework notifications carry an indirect message in the shared RX
    buffer. Validate the reported offset and size before using them, reject
    zero-length payloads, and ensure that any non-header payload starts at
    the UUID field rather than in the middle of the message header.
    
    Use the validated offset and size values for both kmemdup() and the UUID
    parsing path so malformed firmware data cannot drive an out-of-bounds
    read or an oversized allocation.
    
    Fixes: 285a5ea0f542 ("firmware: arm_ffa: Add support for handling framework notifications")
    Link: https://patch.msgid.link/20260428-ffa_fixes-v2-8-8595ae450034@kernel.org
    Signed-off-by: Sudeep Holla <sudeep.holla@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

fprobe: Fix unregister_fprobe() to wait for RCU grace period [+ + +]

Author: Masami Hiramatsu (Google) <mhiramat@kernel.org>
Date:   Thu May 7 16:46:29 2026 +0900

    fprobe: Fix unregister_fprobe() to wait for RCU grace period
    
    [ Upstream commit 657b594b2084b39a4bc6d8493aa2140cb00cea49 ]
    
    Commit 4346ba1604093 ("fprobe: Rewrite fprobe on function-graph tracer")
    changed fprobe to register struct fprobe to an rcu-hlist, but it forgot
    to wait for RCU GP. Thus there can be use-after-free if the fprobe is
    released right after unregistering. This can be happened on fprobe
    event and sample module code.
    
    To fix this issue, add synchronize_rcu() in unregister_fprobe().
    
    Note that BPF is OK because fprobe is used as a part of
    bpf_kprobe_multi_link. This unregisters its fprobe in
    bpf_kprobe_multi_link_release() and it is deallocated via
    bpf_kprobe_multi_link_dealloc(), which is invoked from
    bpf_link_defer_dealloc_rcu_gp() RCU callback.
    
    For BPF, this also introduced unregister_fprobe_async() which does
    NOT wait for RCU grace priod.
    
    Link: https://lore.kernel.org/all/177813998919.256460.2809243930741138224.stgit@mhiramat.tok.corp.google.com/
    
    Fixes: 4346ba1604093 ("fprobe: Rewrite fprobe on function-graph tracer")
    Signed-off-by: Masami Hiramatsu (Google) <mhiramat@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

fs/statmount: fix slab out-of-bounds write in statmount_mnt_idmap [+ + +]

Author: Junyoung Jang <graypanda.inzag@gmail.com>
Date:   Mon May 4 20:26:49 2026 +0900

    fs/statmount: fix slab out-of-bounds write in statmount_mnt_idmap
    
    [ Upstream commit a3bf0f28d4ba16e1f35f8c983bb04426b87e2a78 ]
    
    statmount_mnt_idmap() writes one mapping with seq_printf() and then
    manually advances seq->count to include the NUL separator.
    
    If seq_printf() overflows, seq_set_overflow() sets seq->count to
    seq->size. The manual seq->count++ changes this to seq->size + 1.
    seq_has_overflowed() then no longer detects the overflow. The corrupted
    count returns to statmount_string(), which later executes:
    
        seq->buf[seq->count++] = '\0';
    
    This causes a 1-byte NULL out-of-bounds write on the dynamically
    allocated seq buffer.
    
    Fix this by checking for overflow immediately after seq_printf().
    
    Fixes: 37c4a9590e1e ("statmount: allow to retrieve idmappings")
    Signed-off-by: Junyoung Jang <graypanda.inzag@gmail.com>
    Link: https://patch.msgid.link/20260504112649.1862936-1-graypanda.inzag@gmail.com
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

fs: fix forced iversion increment on lazytime timestamp updates [+ + +]

Author: Pankaj Raghav <p.raghav@samsung.com>
Date:   Mon May 11 13:19:18 2026 +0200

    fs: fix forced iversion increment on lazytime timestamp updates
    
    [ Upstream commit 834e98acb748025c04fed3cac9c8954454f4b520 ]
    
    When updating timestamps with lazytime enabled, if only I_DIRTY_TIME is
    set (pure lazytime update), inode_maybe_inc_iversion() should not be
    forced to increment i_version. The force parameter should only be true
    when actual data or metadata changes require an iversion bump.
    
    The current code uses "!!dirty" which evaluates to true whenever dirty
    has any bits set, including the I_DIRTY_TIME bit alone. This forces an
    iversion increment on every lazytime timestamp update, which then sets
    I_DIRTY_SYNC, triggering expensive log flushes on subsequent fdatasync
    calls. Andres reported this issue when he noticed a perf regression[1].
    
    Fix this by using "dirty != I_DIRTY_TIME" as the force parameter. This
    passes false for pure lazytime updates (allowing the I_VERSION_QUERIED
    optimization to work), while still forcing the increment when dirty
    contains other flags indicating real changes that require iversion
    updates.
    
    [1] https://lore.kernel.org/linux-xfs/7ys6erh3nnyeerv2nybyfvp7dmaknuxrlxv74wx56ocdothkc6@ekfiadtkfn2r/
    
    Fixes: 85c871a02b03 ("fs: add support for non-blocking timestamp updates")
    Signed-off-by: Pankaj Raghav <p.raghav@samsung.com>
    Link: https://patch.msgid.link/20260511111918.1793689-1-p.raghav@samsung.com
    Reviewed-by: Jeff Layton <jlayton@kernel.org>
    Reviewed-by: Christoph Hellwig <hch@lst.de>
    Reviewed-by: Carlos Maiolino <cmaiolino@redhat.com>
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

fs: Fix return in jfs_mkdir and orangefs_mkdir [+ + +]

Author: Hongling Zeng <zenghongling@kylinos.cn>
Date:   Fri May 1 15:10:58 2026 +0800

    fs: Fix return in jfs_mkdir and orangefs_mkdir
    
    [ Upstream commit a7cf1da7ac016490d6a1106f2aa6b602d34e9a12 ]
    
    Return NULL instead of passing to ERR_PTR while err is zero
    Fixes these smatch warnings:
      - fs/jfs/namei.c:311 jfs_mkdir() warn: passing zero to 'ERR_PTR'
      - fs/orangefs/namei.c:369 orangefs_mkdir() warn: passing zero
        to 'ERR_PTR'
    
    Fixes: 88d5baf69082 ("Change inode_operations.mkdir to return struct dentry *")
    Signed-off-by: Hongling Zeng <zenghongling@kylinos.cn>
    Link: https://patch.msgid.link/20260501071058.1243245-1-zenghongling@kylinos.cn
    Reviewed-by: Jori Koolstra <jkoolstra@xs4all.nl>
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

fwctl: pds: Validate RPC input size before parsing [+ + +]

Author: Heechan Kang <gganji11@naver.com>
Date:   Sun May 17 15:22:32 2026 +0900

    fwctl: pds: Validate RPC input size before parsing
    
    commit e7537735028c3ad4b0bfc02ff8fa2a1a28aa04fe upstream.
    
    The fwctl core allocates the device-specific RPC input buffer with
    fwctl_rpc.in_len and passes that buffer to the driver callback.
    
    pdsfc_fw_rpc() casts the buffer to struct fwctl_rpc_pds and then calls
    pdsfc_validate_rpc(), which reads fields from that structure before
    checking that the input buffer is large enough to contain it. A short
    in_len can make pds_fwctl read beyond the allocation.
    
    Reject pds RPC buffers that are smaller than struct fwctl_rpc_pds before
    parsing any pds-specific fields.
    
    Fixes: 92c66ee829b9 ("pds_fwctl: add rpc and query support")
    Link: https://patch.msgid.link/r/20260517062232.1858747-1-gganji11@naver.com
    Cc: stable@vger.kernel.org # v6.15+
    Signed-off-by: Heechan Kang <gganji11@naver.com>
    Reviewed-by: Dave Jiang <dave.jiang@intel.com>
    Signed-off-by: Jason Gunthorpe <jgg@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

gcc-plugins: Always define CONST_CAST_GIMPLE and CONST_CAST_TREE [+ + +]

Author: Kees Cook <kees@kernel.org>
Date:   Sat Mar 14 14:24:56 2026 +0100

    gcc-plugins: Always define CONST_CAST_GIMPLE and CONST_CAST_TREE
    
    [ Upstream commit 905c559e51497b8bfdbb68df8be56d2f70f0de8e ]
    
    For gcc-16, the CONST_CAST macro family was removed. Add back what
    we were using in gcc-common.h, as they are simple wrappers.
    
    See GCC commits:
      c3d96ff9e916c02584aa081f03ab999292efbb50
      458c7926d48959abcb2c1adaa22458e27459a551
    
    Suggested-by: Ingo Saitz <ingo@hannover.ccc.de>
    Link: https://lore.kernel.org/lkml/ab6OKoay0OWkywjK@spatz.zoo
    Fixes: 6b90bd4ba40b ("GCC plugin infrastructure")
    Tested-by: Ivan Bulatovic <combuster@archlinux.us>
    Tested-by: Christopher Cradock <christopher@cradock.myzen.co.uk>
    Signed-off-by: Kees Cook <kees@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

gpio: aggregator: fix a potential use-after-free [+ + +]

Author: Bartosz Golaszewski <bartosz.golaszewski@oss.qualcomm.com>
Date:   Wed May 20 10:49:11 2026 +0200

    gpio: aggregator: fix a potential use-after-free
    
    [ Upstream commit 30c073cab97afb31901f94de9605177b6b84367e ]
    
    On error we free aggr->lookups->dev_id before removing the entry from
    the lookup table. If a concurrent thread calls gpiod_find() before we
    remove the entry, it could iterate over the list and call
    gpiod_match_lookup_table() which unconditionally dereferences dev_id
    when calling strcmp(). Reverse the order of cleanup.
    
    Fixes: 86f162e73d2d ("gpio: aggregator: introduce basic configfs interface")
    Reviewed-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Link: https://patch.msgid.link/20260520084911.27938-1-bartosz.golaszewski@oss.qualcomm.com
    Signed-off-by: Bartosz Golaszewski <bartosz.golaszewski@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

gpio: aggregator: lock device when calling device_is_bound() [+ + +]

Author: Bartosz Golaszewski <bartosz.golaszewski@oss.qualcomm.com>
Date:   Mon May 18 11:53:18 2026 +0200

    gpio: aggregator: lock device when calling device_is_bound()
    
    [ Upstream commit 598a2b3e2e0e6aa2e9f7843c96c45b5ea11e0411 ]
    
    The kerneldoc for device_is_bound() says it must be called with the
    device lock taken. Add missing synchronization to this driver.
    
    Fixes: 3a27f40b4570 ("gpio: aggregator: stop using dev-sync-probe")
    Link: https://patch.msgid.link/20260518-gpio-dev-lock-v1-2-cc4736f3ff0b@oss.qualcomm.com
    Signed-off-by: Bartosz Golaszewski <bartosz.golaszewski@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

gpio: aggregator: remove the software node when deactivating the aggregator [+ + +]

Author: Bartosz Golaszewski <bartosz.golaszewski@oss.qualcomm.com>
Date:   Wed May 20 14:16:31 2026 +0200

    gpio: aggregator: remove the software node when deactivating the aggregator
    
    [ Upstream commit 61fef83f239ecace1cce716135762a2d9b7b1fc6 ]
    
    The dynamic software node we create for the aggregator platform device
    when using configfs is leaked when the device is deactivated. Destroy it
    as the last step in the tear-down path.
    
    Fixes: 86f162e73d2d ("gpio: aggregator: introduce basic configfs interface")
    Reported-by: Geert Uytterhoeven <geert@linux-m68k.org>
    Closes: https://lore.kernel.org/all/CAMuHMdVZ=XUvJTGdDAjnkxgtw7Uvnn61iOy3XN_5XNZM2anctw@mail.gmail.com/
    Link: https://patch.msgid.link/20260520121631.33976-1-bartosz.golaszewski@oss.qualcomm.com
    Signed-off-by: Bartosz Golaszewski <bartosz.golaszewski@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

gpio: aggregator: stop using dev-sync-probe [+ + +]

Author: Bartosz Golaszewski <bartosz.golaszewski@oss.qualcomm.com>
Date:   Fri Mar 27 11:31:12 2026 +0100

    gpio: aggregator: stop using dev-sync-probe
    
    [ Upstream commit 3a27f40b457053e6112a63d14590e4a3ff553b44 ]
    
    dev-err-probe is an overengineered solution to a simple problem. Use a
    combination of wait_for_probe() and device_is_bound() to synchronously
    wait for the platform device to probe.
    
    Reviewed-by: Linus Walleij <linusw@kernel.org>
    Link: https://patch.msgid.link/20260327-gpio-kill-dev-sync-probe-v1-2-efac254f1a1d@oss.qualcomm.com
    Signed-off-by: Bartosz Golaszewski <bartosz.golaszewski@oss.qualcomm.com>
    Stable-dep-of: 61fef83f239e ("gpio: aggregator: remove the software node when deactivating the aggregator")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

gpio: cdev: check if uAPI v2 config attributes are correctly zeroed [+ + +]

Author: Bartosz Golaszewski <bartosz.golaszewski@oss.qualcomm.com>
Date:   Thu May 21 10:42:16 2026 +0200

    gpio: cdev: check if uAPI v2 config attributes are correctly zeroed
    
    [ Upstream commit 3e6ccd790ed69bedd3d9626d01dd35cf9821c121 ]
    
    We check the padding of other uAPI v2 structures but not that of line
    config attributes. For used attributes: check if their padding is
    zeroed, for unused: check if the entire structure is zeroed.
    
    Fixes: 3c0d9c635ae2 ("gpiolib: cdev: support GPIO_V2_GET_LINE_IOCTL and GPIO_V2_LINE_GET_VALUES_IOCTL")
    Reviewed-by: Kent Gibson <warthog618@gmail.com>
    Link: https://patch.msgid.link/20260521-gpio-cdev-attr-padding-check-v3-1-ec3bcbe2e358@oss.qualcomm.com
    Signed-off-by: Bartosz Golaszewski <bartosz.golaszewski@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

HID: intel-thc-hid: Intel-quickspi: Fix some error codes [+ + +]

Author: Dan Carpenter <error27@gmail.com>
Date:   Thu Apr 23 10:10:02 2026 +0300

    HID: intel-thc-hid: Intel-quickspi: Fix some error codes
    
    [ Upstream commit ae4ac077332ea3341a0f4c0973556c6b7ac5b7a1 ]
    
    If we have a partial read that is supposed to be treated as failure but
    in this code we forgot to set the error code.  Return -EINVAL.
    
    Fixes: 9d8d51735a3a ("HID: intel-thc-hid: intel-quickspi: Add HIDSPI protocol implementation")
    Signed-off-by: Dan Carpenter <error27@gmail.com>
    Reviewed-by: Even Xu <even.xu@intel.com>
    Reviewed-by: Mark Pearson <mpearson-lenovo@squebb.ca>
    Signed-off-by: Jiri Kosina <jkosina@suse.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

HID: quirks: really enable the intended work around for appledisplay [+ + +]

Author: Lukas Bulwahn <lukas.bulwahn@redhat.com>
Date:   Thu Feb 5 09:11:31 2026 +0100

    HID: quirks: really enable the intended work around for appledisplay
    
    [ Upstream commit 5f90dcfa8dc32a488581b78e575cdd7808ba5c78 ]
    
    Commit c7fabe4ad921 ("HID: quirks: work around VID/PID conflict for
    appledisplay") intends to add a quirk for kernels built with Apple Cinema
    Display support, but it refers to the non-existing config option
    CONFIG_APPLEDISPLAY, whereas the config option for Apple Cinema Display
    support is named CONFIG_USB_APPLEDISPLAY.
    
    Refer to the intended config option CONFIG_USB_APPLEDISPLAY in the ifdef
    directive.
    
    Fixes: c7fabe4ad921 ("HID: quirks: work around VID/PID conflict for appledisplay")
    Signed-off-by: Lukas Bulwahn <lukas.bulwahn@redhat.com>
    Signed-off-by: Jiri Kosina <jkosina@suse.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

HID: uclogic: Fix regression of input name assignment [+ + +]

Author: Takashi Iwai <tiwai@suse.de>
Date:   Tue Apr 28 10:33:16 2026 +0200

    HID: uclogic: Fix regression of input name assignment
    
    [ Upstream commit 487359284509a6745e14b8c0518768bc277809b0 ]
    
    The previous fix for adding the devm_kasprintf() return check in the
    commit bd07f751208b ("HID: uclogic: Add NULL check in
    uclogic_input_configured()") changed the condition of hi->input->name
    assignment, and it resulted in missing the proper input device name
    when no custom suffix is defined.
    
    Restore the conditional to the original content to address the
    regression.
    
    Fixes: bd07f751208b ("HID: uclogic: Add NULL check in uclogic_input_configured()")
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Jiri Kosina <jkosina@suse.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

hwmon: (lm90) Add lock protection to lm90_alert [+ + +]

Author: Guenter Roeck <linux@roeck-us.net>
Date:   Thu May 14 14:41:00 2026 -0700

    hwmon: (lm90) Add lock protection to lm90_alert
    
    [ Upstream commit 873e919e3101063a7a75989510ccfc125a4391cf ]
    
    Sashiko reports:
    
    lm90_alert() executes in the smbus alert context and calls
    lm90_update_confreg() to disable the hardware alert line, without
    acquiring hwmon_lock.
    
    Concurrently, sysfs write operations (such as lm90_write_convrate) hold
    the hwmon_lock, temporarily modify data->config, and then restore it.
    
    If an alert interrupt occurs concurrently with a sysfs write, the sysfs
    path will overwrite the alert handler's modifications to data->config
    and the hardware register.
    
    This unintentionally re-enables the hardware alert line while the alarm is
    still active, causing an interrupt storm.
    
    Add the missing lock to lm90_alert() to solve the problem.
    
    Fixes: 7a1d220ccb0cc ("hwmon: (lm90) Introduce function to update configuration register")
    Reported-by: Sashiko <sashiko-bot@kernel.org>
    Signed-off-by: Guenter Roeck <linux@roeck-us.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

hwmon: (lm90) Stop work before releasing hwmon device [+ + +]

Author: Guenter Roeck <linux@roeck-us.net>
Date:   Thu May 14 14:31:49 2026 -0700

    hwmon: (lm90) Stop work before releasing hwmon device
    
    [ Upstream commit b09a45601094c7f4ec4db8090b825fa61e169d93 ]
    
    Sashiko reports:
    
    In lm90_probe(), the devm action to cancel the alert_work and report_work
    (lm90_restore_conf) is registered in lm90_init_client() before
    devm_hwmon_device_register_with_info() is called.
    
    Because devm executes cleanup actions in reverse order during module
    unbind or probe failure, the hwmon device is unregistered and freed first.
    
    If lm90_alert_work() or lm90_report_alarms() runs in the window between
    the hwmon device being freed and the delayed works being cancelled,
    lm90_update_alarms() will dereference the freed data->hwmon_dev here.
    
    Fix the problem by canceling the workers separately after registering
    the hwmon device and before registering the interrupt handler. This ensures
    that the workers are canceled after interrupts are disabled and before
    the hwmon device is released. Add "shutdown" flag to indicate that device
    shutdown is in progress to prevent workers from being re-armed.
    
    Fixes: f6d0775119fb9 ("hwmon: (lm90) Rework alarm/status handling")
    Reported-by: Sashiko <sashiko-bot@kernel.org>
    Signed-off-by: Guenter Roeck <linux@roeck-us.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

hwmon: (pmbus/adm1266) bounce blackbox records through a protocol-sized buffer [+ + +]

Author: Abdurrahman Hussain <abdurrahman@nexthop.ai>
Date:   Fri May 15 15:11:51 2026 -0700

    hwmon: (pmbus/adm1266) bounce blackbox records through a protocol-sized buffer
    
    commit 43cae21424ff8e33894a0f86c6b80b840c049fd7 upstream.
    
    adm1266_pmbus_block_xfer() copies the device-supplied block payload
    into the caller-provided buffer using the device-supplied length:
    
            memcpy(data_r, &msgs[1].buf[1], msgs[1].buf[0]);
    
    The helper does not know how large data_r is and trusts the device to
    return at most one record's worth of bytes.  adm1266_nvmem_read_blackbox()
    violates that contract: it advances read_buff inside data->dev_mem in
    ADM1266_BLACKBOX_SIZE (64-byte) strides while the helper is willing to
    write up to ADM1266_PMBUS_BLOCK_MAX (255) bytes.  A device that returns
    more than 64 bytes on the trailing record (read_buff offset 1984 in
    the 2048-byte dev_mem allocation) overflows dev_mem by up to 191 bytes
    before the post-call
    
            if (ret != ADM1266_BLACKBOX_SIZE)
                    return -EIO;
    
    can reject the response.
    
    Contain the fix in the caller without changing the helper signature:
    read each record into a 255-byte local bounce buffer that matches the
    helper's maximum output, validate the returned length, and only then
    copy exactly ADM1266_BLACKBOX_SIZE bytes into the dev_mem slot.
    
    Fixes: 407dc802a9c0 ("hwmon: (pmbus/adm1266) Add Block process call")
    Cc: stable@vger.kernel.org
    Signed-off-by: Abdurrahman Hussain <abdurrahman@nexthop.ai>
    Link: https://lore.kernel.org/r/20260515-adm1266-fixes-v1-5-1c1ea1349cfe@nexthop.ai
    Signed-off-by: Guenter Roeck <linux@roeck-us.net>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

hwmon: (pmbus/adm1266) cap PDIO scan in get_multiple at ADM1266_PDIO_NR [+ + +]

Author: Abdurrahman Hussain <abdurrahman@nexthop.ai>
Date:   Mon May 18 17:52:25 2026 -0700

    hwmon: (pmbus/adm1266) cap PDIO scan in get_multiple at ADM1266_PDIO_NR
    
    commit d7834d92251baade796812876e95555e2066fa9f upstream.
    
    adm1266_gpio_get_multiple() iterates the PDIO portion of the
    caller-supplied mask using
    
            for_each_set_bit_from(gpio_nr, mask,
                                  ADM1266_GPIO_NR + ADM1266_PDIO_STATUS) {
                    ...
            }
    
    where ADM1266_PDIO_STATUS is the PMBus command code (0xE9, i.e. 233),
    not the number of PDIO pins.  The intended upper bound is
    ADM1266_GPIO_NR + ADM1266_PDIO_NR = 25.
    
    gpiolib hands in a mask sized for gc.ngpio (= 25 bits on this chip),
    so the iteration walks find_next_bit() up to 242, reading up to 217
    extra bits (a handful of unsigned-long words: four on 64-bit, seven
    on 32-bit) of whatever lives past the end of the mask in the
    caller's stack.  Any incidental set bit in that range then drives a
    set_bit(gpio_nr, bits) call that writes past the end of the
    caller-supplied bits array too -- both out-of-bounds.
    
    Substitute ADM1266_PDIO_NR for the constant so the scan stops at the
    last real PDIO bit.
    
    Fixes: d98dfad35c38 ("hwmon: (pmbus/adm1266) Add support for GPIOs")
    Cc: stable@vger.kernel.org
    Signed-off-by: Abdurrahman Hussain <abdurrahman@nexthop.ai>
    Reviewed-by: Bartosz Golaszewski <bartosz.golaszewski@oss.qualcomm.com>
    Reviewed-by: Linus Walleij <linusw@kernel.org>
    Link: https://lore.kernel.org/r/20260518-adm1266-gpio-fixes-v3-1-e425e4f88139@nexthop.ai
    Signed-off-by: Guenter Roeck <linux@roeck-us.net>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

hwmon: (pmbus/adm1266) don't clobber GPIO bits before PDIO read in get_multiple [+ + +]

Author: Abdurrahman Hussain <abdurrahman@nexthop.ai>
Date:   Mon May 18 17:52:26 2026 -0700

    hwmon: (pmbus/adm1266) don't clobber GPIO bits before PDIO read in get_multiple
    
    commit 3327a12aee9e10ffa903e28b8445dfd1af5307c0 upstream.
    
    adm1266_gpio_get_multiple() zeroes *bits before the GPIO_STATUS loop
    and then a second time before the PDIO_STATUS loop:
    
            *bits = 0;
            for_each_set_bit(gpio_nr, mask, ADM1266_GPIO_NR) {
                    ...
                    set_bit(gpio_nr, bits);
            }
    
            ret = i2c_smbus_read_block_data(data->client, ADM1266_PDIO_STATUS, ...);
            ...
            *bits = 0;
            for_each_set_bit_from(gpio_nr, mask, ADM1266_GPIO_NR + ADM1266_PDIO_NR) {
                    ...
                    set_bit(gpio_nr, bits);
            }
    
    The second *bits = 0 throws away every GPIO bit the first loop just
    populated, so callers asking for any combination of GPIO and PDIO
    pins always see the GPIO portion of the returned bits as zero.
    
    Drop the redundant second assignment so both halves of the result
    survive.
    
    Fixes: d98dfad35c38 ("hwmon: (pmbus/adm1266) Add support for GPIOs")
    Cc: stable@vger.kernel.org
    Signed-off-by: Abdurrahman Hussain <abdurrahman@nexthop.ai>
    Reviewed-by: Bartosz Golaszewski <bartosz.golaszewski@oss.qualcomm.com>
    Reviewed-by: Linus Walleij <linusw@kernel.org>
    Link: https://lore.kernel.org/r/20260518-adm1266-gpio-fixes-v3-2-e425e4f88139@nexthop.ai
    Signed-off-by: Guenter Roeck <linux@roeck-us.net>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

hwmon: (pmbus/adm1266) include PEC byte in pmbus_block_xfer read buffer [+ + +]

Author: Abdurrahman Hussain <abdurrahman@nexthop.ai>
Date:   Fri May 15 15:11:50 2026 -0700

    hwmon: (pmbus/adm1266) include PEC byte in pmbus_block_xfer read buffer
    
    commit 487566cb1ccdf3756fdd7bf8d875e612ff3169bb upstream.
    
    adm1266_pmbus_block_xfer() sets up the read transaction with
    
            .buf = data->read_buf,
            .len = ADM1266_PMBUS_BLOCK_MAX + 2,
    
    but read_buf in struct adm1266_data is declared as
    
            u8 read_buf[ADM1266_PMBUS_BLOCK_MAX + 1];
    
    For a max-length block response (length byte = 255 + up to 1 PEC
    byte), the i2c controller is told to write 257 bytes into a 256-byte
    buffer, putting one byte past the end of read_buf.  The same response
    also makes the subsequent PEC compare
    
            if (crc != msgs[1].buf[msgs[1].buf[0] + 1])
    
    read a byte beyond the array.
    
    Bump the read_buf declaration to ADM1266_PMBUS_BLOCK_MAX + 2 so the
    buffer can hold the length byte, up to 255 payload bytes, and the PEC
    byte the i2c_msg length already accounts for.
    
    Fixes: 407dc802a9c0 ("hwmon: (pmbus/adm1266) Add Block process call")
    Cc: stable@vger.kernel.org
    Signed-off-by: Abdurrahman Hussain <abdurrahman@nexthop.ai>
    Link: https://lore.kernel.org/r/20260515-adm1266-fixes-v1-4-1c1ea1349cfe@nexthop.ai
    Signed-off-by: Guenter Roeck <linux@roeck-us.net>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

hwmon: (pmbus/adm1266) register the gpio_chip after pmbus_do_probe() [+ + +]

Author: Abdurrahman Hussain <abdurrahman@nexthop.ai>
Date:   Mon May 18 17:52:28 2026 -0700

    hwmon: (pmbus/adm1266) register the gpio_chip after pmbus_do_probe()
    
    commit 491403b9b76cf66abd81301c5901aa4a4549f1e8 upstream.
    
    adm1266_probe() calls adm1266_config_gpio() -- which goes on to
    devm_gpiochip_add_data() and exposes the gpio_chip callbacks to
    gpiolib -- before pmbus_do_probe() has initialised the per-client
    PMBus state (notably the pmbus_lock mutex the core hands out via
    pmbus_get_data()).
    
    That ordering is already a latent hazard: any GPIO access that lands
    between adm1266_config_gpio() and the end of pmbus_do_probe() (for
    example a sysfs read from a user space agent that opens the gpiochip
    the instant gpiolib advertises it) races pmbus_do_probe()'s own
    device accesses with no serialisation.
    
    Move adm1266_config_gpio() down past pmbus_do_probe() so the chip
    isn't reachable from userspace until the PMBus state it depends on
    is fully initialised.
    
    Fixes: d98dfad35c38 ("hwmon: (pmbus/adm1266) Add support for GPIOs")
    Cc: stable@vger.kernel.org
    Signed-off-by: Abdurrahman Hussain <abdurrahman@nexthop.ai>
    Reviewed-by: Bartosz Golaszewski <bartosz.golaszewski@oss.qualcomm.com>
    Link: https://lore.kernel.org/r/20260518-adm1266-gpio-fixes-v3-4-e425e4f88139@nexthop.ai
    Signed-off-by: Guenter Roeck <linux@roeck-us.net>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

hwmon: (pmbus/adm1266) register the nvmem device after pmbus_do_probe() [+ + +]

Author: Abdurrahman Hussain <abdurrahman@nexthop.ai>
Date:   Mon May 18 17:52:29 2026 -0700

    hwmon: (pmbus/adm1266) register the nvmem device after pmbus_do_probe()
    
    commit 6af713af91d5c34ec049eb3cc2c5b3f5eba953b8 upstream.
    
    adm1266_probe() calls adm1266_config_nvmem() -- which goes on to
    devm_nvmem_register() and exposes adm1266_nvmem_read() to userspace --
    before pmbus_do_probe() has initialised the per-client PMBus state.
    
    Same latent hazard as the gpio_chip one fixed in the previous patch:
    once the nvmem device is registered, gpiolib's nvmem char-dev / sysfs
    interface is reachable, and any concurrent read triggers
    adm1266_nvmem_read() -> adm1266_nvmem_read_blackbox(), which issues
    PMBus traffic that races pmbus_do_probe()'s own device accesses with
    no serialisation.
    
    Move adm1266_config_nvmem() down past pmbus_do_probe() so the nvmem
    device isn't reachable from userspace until the PMBus state the
    nvmem accessors depend on is fully initialised.
    
    Fixes: 15609d189302 ("hwmon: (pmbus/adm1266) read blackbox")
    Cc: stable@vger.kernel.org
    Signed-off-by: Abdurrahman Hussain <abdurrahman@nexthop.ai>
    Link: https://lore.kernel.org/r/20260518-adm1266-gpio-fixes-v3-5-e425e4f88139@nexthop.ai
    Signed-off-by: Guenter Roeck <linux@roeck-us.net>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

hwmon: (pmbus/adm1266) reject implausible blackbox record_count [+ + +]

Author: Abdurrahman Hussain <abdurrahman@nexthop.ai>
Date:   Fri May 15 15:11:49 2026 -0700

    hwmon: (pmbus/adm1266) reject implausible blackbox record_count
    
    commit 4afca954622d672ea65ed961bed01cf91caa034e upstream.
    
    adm1266_nvmem_read_blackbox() loops over a record_count that comes
    straight from byte 3 of the BLACKBOX_INFO response.  The destination
    buffer is data->dev_mem, sized for the nvmem cell's declared 2048
    bytes (ADM1266_BLACKBOX_MAX_RECORDS * ADM1266_BLACKBOX_SIZE = 32 * 64).
    A device that reports a record_count greater than 32 -- whether due
    to firmware bugs, bus corruption, or a non-responsive slave returning
    0xff -- would walk read_buff past the end of the dev_mem allocation
    on the trailing iterations.
    
    Cap record_count at ADM1266_BLACKBOX_MAX_RECORDS (introduced here)
    before entering the loop and return -EIO on any larger value, so a
    malformed BLACKBOX_INFO response cannot drive the loop out of bounds.
    
    Fixes: 15609d189302 ("hwmon: (pmbus/adm1266) read blackbox")
    Cc: stable@vger.kernel.org
    Signed-off-by: Abdurrahman Hussain <abdurrahman@nexthop.ai>
    Link: https://lore.kernel.org/r/20260515-adm1266-fixes-v1-3-1c1ea1349cfe@nexthop.ai
    Signed-off-by: Guenter Roeck <linux@roeck-us.net>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

hwmon: (pmbus/adm1266) reject short block-read responses in the GPIO accessors [+ + +]

Author: Abdurrahman Hussain <abdurrahman@nexthop.ai>
Date:   Mon May 18 17:52:27 2026 -0700

    hwmon: (pmbus/adm1266) reject short block-read responses in the GPIO accessors
    
    commit a7232f68c43ca62f545049b7f5fbfc75137b843b upstream.
    
    adm1266_gpio_get() and adm1266_gpio_get_multiple() both compose the
    pin-status word as
    
            pins_status = read_buf[0] + (read_buf[1] << 8);
    
    right after i2c_smbus_read_block_data(), guarding only against an
    error return.  A well-behaved device returns 2 bytes for
    GPIO_STATUS/PDIO_STATUS, but the helper happily reports a 0- or
    1-byte response too.  If the device returns 0 bytes, both read_buf
    slots are uninitialized stack memory; if it returns 1 byte, read_buf[1]
    is.
    
    The composed value then flows through set_bit() into the caller's
    *bits in adm1266_gpio_get_multiple(), or into the return value of
    adm1266_gpio_get(), and ends up in userspace via gpiolib (sysfs and
    the char-dev ioctls).  That leaks a few bits of kernel stack per
    request on any device whose firmware glitch, bus error, or hostile
    slave produces a short block-read response.
    
    Add the missing length check to both call sites and surface a short
    response as -EIO.
    
    Fixes: d98dfad35c38 ("hwmon: (pmbus/adm1266) Add support for GPIOs")
    Cc: stable@vger.kernel.org
    Signed-off-by: Abdurrahman Hussain <abdurrahman@nexthop.ai>
    Reviewed-by: Bartosz Golaszewski <bartosz.golaszewski@oss.qualcomm.com>
    Link: https://lore.kernel.org/r/20260518-adm1266-gpio-fixes-v3-3-e425e4f88139@nexthop.ai
    Signed-off-by: Guenter Roeck <linux@roeck-us.net>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

hwmon: (pmbus/adm1266) seed timestamp from the real-time clock [+ + +]

Author: Abdurrahman Hussain <abdurrahman@nexthop.ai>
Date:   Fri May 15 15:11:47 2026 -0700

    hwmon: (pmbus/adm1266) seed timestamp from the real-time clock
    
    commit b86095e3d7dcf2bf80c747349a35912a87a85098 upstream.
    
    adm1266_set_rtc() seeds the chip's SET_RTC register from
    ktime_get_seconds(), which returns CLOCK_MONOTONIC -- i.e. seconds
    since the host last booted, not seconds since the Unix epoch.
    
    The chip stamps that value into every blackbox record it captures.
    Userspace reading those timestamps back expects wall-clock seconds:
    that's what the SET_RTC frame layout documents (datasheet Rev. D,
    Table 84) and what every other consumer of "seconds since epoch"
    assumes.  Seeding from CLOCK_MONOTONIC gives blackbox records a
    timestamp that is only meaningful within a single boot of the host
    and silently resets to small values on every reboot.
    
    Switch to ktime_get_real_seconds() so the seed matches what the
    register is documented to hold.
    
    Fixes: 15609d189302 ("hwmon: (pmbus/adm1266) read blackbox")
    Cc: stable@vger.kernel.org
    Signed-off-by: Abdurrahman Hussain <abdurrahman@nexthop.ai>
    Link: https://lore.kernel.org/r/20260515-adm1266-fixes-v1-1-1c1ea1349cfe@nexthop.ai
    Signed-off-by: Guenter Roeck <linux@roeck-us.net>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

hwmon: (pmbus/adm1266) widen blackbox-info buffer to I2C_SMBUS_BLOCK_MAX [+ + +]

Author: Abdurrahman Hussain <abdurrahman@nexthop.ai>
Date:   Fri May 15 15:11:48 2026 -0700

    hwmon: (pmbus/adm1266) widen blackbox-info buffer to I2C_SMBUS_BLOCK_MAX
    
    commit eee213daa1e1b402eb631bcd1b8c5aa340a6b081 upstream.
    
    adm1266_nvmem_read_blackbox() declares a 5-byte stack buffer and
    passes it to i2c_smbus_read_block_data() to retrieve the 4-byte
    BLACKBOX_INFO response.  i2c_smbus_read_block_data() does not honour
    caller buffer sizes -- it memcpy()s data.block[0] bytes from the
    SMBus transaction (where data.block[0] is the length byte returned by
    the slave device, up to I2C_SMBUS_BLOCK_MAX = 32):
    
            memcpy(values, &data.block[1], data.block[0]);
    
    If the device returns any block length above 5, the call overflows
    the caller's 5-byte stack buffer before the post-call
    
            if (ret != 4)
                    return -EIO;
    
    check has a chance to reject the response.
    
    Widen the local buffer to I2C_SMBUS_BLOCK_MAX so the helper has room
    for any well-formed SMBus block response, matching the convention used
    by the other i2c_smbus_read_block_data() callers in this driver.
    
    Fixes: 15609d189302 ("hwmon: (pmbus/adm1266) read blackbox")
    Cc: stable@vger.kernel.org
    Signed-off-by: Abdurrahman Hussain <abdurrahman@nexthop.ai>
    Link: https://lore.kernel.org/r/20260515-adm1266-fixes-v1-2-1c1ea1349cfe@nexthop.ai
    Signed-off-by: Guenter Roeck <linux@roeck-us.net>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

i2c: tegra: fix pm_runtime leak on mutex_lock failure [+ + +]

Author: Saurav Sachidanand <sauravsc@amazon.com>
Date:   Thu May 7 22:11:44 2026 +0000

    i2c: tegra: fix pm_runtime leak on mutex_lock failure
    
    commit 57cf4e8d6a57dc2ef5810f4852a23ba4c71b74bb upstream.
    
    If tegra_i2c_mutex_lock() fails, the function returns without calling
    pm_runtime_put(), leaking the runtime PM reference acquired by the
    preceding pm_runtime_get_sync(). This prevents the device from ever
    entering runtime suspend.
    
    Add the missing pm_runtime_put() before returning on lock failure.
    
    Fixes: 6077cfd716fb ("i2c: tegra: Add support for SW mutex register")
    Signed-off-by: Saurav Sachidanand <sauravsc@amazon.com>
    Cc: <stable@vger.kernel.org> # v7.0+
    Reviewed-by: Jon Hunter <jonathanh@nvidia.com>
    Acked-by: Thierry Reding <treding@nvidia.com>
    Signed-off-by: Andi Shyti <andi.shyti@kernel.org>
    Link: https://lore.kernel.org/r/20260507221145.62183-2-sauravsc@amazon.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ice: dpll: fix misplaced header macros [+ + +]

Author: Ivan Vecera <ivecera@redhat.com>
Date:   Wed May 6 14:48:17 2026 -0700

    ice: dpll: fix misplaced header macros
    
    [ Upstream commit 30f1658fc5387384c7a60b9d15c79cb959512c1a ]
    
    The CGU register definitions (ICE_CGU_R10, ICE_CGU_R11 and related field
    masks) were placed after the #endif of the _ICE_DPLL_H_ include guard,
    leaving them unprotected. Move them inside the guard.
    
    Fixes: ad1df4f2d591 ("ice: dpll: Support E825-C SyncE and dynamic pin discovery")
    Signed-off-by: Ivan Vecera <ivecera@redhat.com>
    Reviewed-by: Aleksandr Loktionov <aleksandr.loktionov@intel.com>
    Signed-off-by: Jacob Keller <jacob.e.keller@intel.com>
    Link: https://patch.msgid.link/20260506-jk-iwl-net-2026-05-04-v2-8-a5ea4dc837a9@intel.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ice: dpll: fix rclk pin state get for E810 [+ + +]

Author: Ivan Vecera <ivecera@redhat.com>
Date:   Wed May 6 14:48:16 2026 -0700

    ice: dpll: fix rclk pin state get for E810
    
    [ Upstream commit cce709d8df6ba6d2a0a0dbf34acc2cdd9e23bd46 ]
    
    The refactoring of ice_dpll_rclk_state_on_pin_get() to use
    ice_dpll_pin_get_parent_idx() omitted the base_rclk_idx adjustment that was
    correctly added in the ice_dpll_rclk_state_on_pin_set() path. This breaks
    E810 devices where base_rclk_idx is non-zero, causing the wrong hardware
    index to be used for pin state lookup and incorrect recovered clock state
    to be reported via the DPLL subsystem. E825C is unaffected as its
    base_rclk_idx is 0.
    
    While at it, add bounds check against ICE_DPLL_RCLK_NUM_MAX on hw_idx after
    the base_rclk_idx subtraction in both ice_dpll_rclk_state_on_pin_{get,set}()
    to prevent out-of-bounds access on the pin state array.
    
    Fixes: ad1df4f2d591 ("ice: dpll: Support E825-C SyncE and dynamic pin discovery")
    Signed-off-by: Ivan Vecera <ivecera@redhat.com>
    Reviewed-by: Aleksandr Loktionov <aleksandr.loktionov@intel.com>
    Signed-off-by: Jacob Keller <jacob.e.keller@intel.com>
    Link: https://patch.msgid.link/20260506-jk-iwl-net-2026-05-04-v2-7-a5ea4dc837a9@intel.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ice: fix locking around wait_event_interruptible_locked_irq [+ + +]

Author: Jacob Keller <jacob.e.keller@intel.com>
Date:   Fri May 15 11:24:08 2026 -0700

    ice: fix locking around wait_event_interruptible_locked_irq
    
    commit 89bbff099bfc94888eb942d5b981592bbbe0c856 upstream.
    
    Commit 50327223a8bb ("ice: add lock to protect low latency interface")
    introduced a wait queue used to protect the low latency timer interface.
    The queue is used with the wait_event_interruptible_locked_irq macro, which
    unlocks the wait queue lock while sleeping. The irq variant uses
    spin_lock_irq and spin_unlock_irq to manage this. The wait queue lock was
    previously locked using spin_lock_irqsave. This difference in lock variants
    could lead to issues, since wait_event would unlock the wait queue and
    restore interrupts while sleeping.
    
    The ice_read_phy_tstamp_ll_e810() function is ultimately called through
    ice_read_phy_tstamp, which is called from ice_ptp_process_tx_tstamp or
    ice_ptp_clear_unexpected_tx_ready. The former is called through the
    miscellaneous IRQ thread function, while the latter is called from the
    service task work queue thread. Neither of these functions has interrupts
    disabled, so use spin_lock_irq instead of spin_lock_irqsave.
    
    Fixes: 50327223a8bb ("ice: add lock to protect low latency interface")
    Cc: stable@vger.kernel.org
    Reported-by: Jakub Kicinski <kuba@kernel.org>
    Closes: https://lore.kernel.org/netdev/20250109181823.77f44c69@kernel.org/
    Signed-off-by: Jacob Keller <jacob.e.keller@intel.com>
    Signed-off-by: Aleksandr Loktionov <aleksandr.loktionov@intel.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Tested-by: Rinitha S <sx.rinitha@intel.com> (A Contingent worker at Intel)
    Signed-off-by: Tony Nguyen <anthony.l.nguyen@intel.com>
    Link: https://patch.msgid.link/20260515182419.1597859-2-anthony.l.nguyen@intel.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ice: fix locking in ice_dcb_rebuild() [+ + +]

Author: Bart Van Assche <bvanassche@acm.org>
Date:   Wed May 6 14:48:15 2026 -0700

    ice: fix locking in ice_dcb_rebuild()
    
    [ Upstream commit 0ded1f36ba4021cba50513e80be6b6e173710168 ]
    
    Move the mutex_lock() call up to prevent that DCB settings change after
    the first ice_query_port_ets() call. The second ice_query_port_ets()
    call in ice_dcb_rebuild() is already protected by pf->tc_mutex.
    
    This also fixes a bug in an error path, as before taking the first
    "goto dcb_error" in the function jumped over mutex_lock() to
    mutex_unlock().
    
    This bug has been detected by the clang thread-safety analyzer.
    
    Cc: intel-wired-lan@lists.osuosl.org
    Fixes: 242b5e068b25 ("ice: Fix DCB rebuild after reset")
    Signed-off-by: Bart Van Assche <bvanassche@acm.org>
    Reviewed-by: Aleksandr Loktionov <aleksandr.loktionov@intel.com>
    Reviewed-by: Przemek Kitszel <przemyslaw.kitszel@intel.com>
    Tested-by: Arpana Arland <arpanax.arland@intel.com>
    Signed-off-by: Jacob Keller <jacob.e.keller@intel.com>
    Link: https://patch.msgid.link/20260506-jk-iwl-net-2026-05-04-v2-6-a5ea4dc837a9@intel.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ice: fix setting promisc mode while adding VID filter [+ + +]

Author: Marcin Szycik <marcin.szycik@intel.com>
Date:   Fri May 15 11:24:10 2026 -0700

    ice: fix setting promisc mode while adding VID filter
    
    commit ebc8de716c9ec2be384abdc2dd866da26c6580d1 upstream.
    
    There are at least two paths through which VSI promiscuous mode can be
    independently configured via ice_fltr_set_vsi_promisc():
    - ice_vlan_rx_add_vid() (netdev op)
    - ice_service_task() -> ... -> ice_set_promisc()
    
    Both paths may try to program promiscuous mode concurrently. One such
    scenario is:
    
    1. Add ice netdev to bond
    2. Add the bond netdev to bridge
    3. ice netdev enters allmulticast mode (IFF_ALLMULTI)
    4. Service task programs promisc mode filter
    5. Bridge -> bond calls ice_vlan_rx_add_vid()
    
    Crucially, ice_vlan_rx_add_vid() fails if ice_fltr_set_vsi_promisc()
    returns any error, including -EEXIST. This causes VLAN filtering setup
    to fail on the bond interface. ice_set_promisc() already handles -EEXIST
    correctly.
    
    Fix by adding the same -EEXIST check to ice_vlan_rx_add_vid(): if the
    promisc filter is already programmed, continue without returning error.
    
    Fixes: 1273f89578f2 ("ice: Fix broken IFF_ALLMULTI handling")
    Cc: stable@vger.kernel.org
    Signed-off-by: Marcin Szycik <marcin.szycik@intel.com>
    Signed-off-by: Aleksandr Loktionov <aleksandr.loktionov@intel.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Tested-by: Rinitha S <sx.rinitha@intel.com> (A Contingent worker at Intel)
    Signed-off-by: Tony Nguyen <anthony.l.nguyen@intel.com>
    Link: https://patch.msgid.link/20260515182419.1597859-4-anthony.l.nguyen@intel.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ice: fix setting RSS VSI hash for E830 [+ + +]

Author: Marcin Szycik <marcin.szycik@linux.intel.com>
Date:   Wed May 6 14:48:14 2026 -0700

    ice: fix setting RSS VSI hash for E830
    
    [ Upstream commit b3cda96feb60d91fe88d52b974ff110dcfa91239 ]
    
    ice_set_rss_hfunc() performs a VSI update, in which it sets hashing
    function, leaving other VSI options unchanged. However, ::q_opt_flags is
    mistakenly set to the value of another field, instead of its original
    value, probably due to a typo. What happens next is hardware-dependent:
    
    On E810, only the first bit is meaningful (see
    ICE_AQ_VSI_Q_OPT_PE_FLTR_EN) and can potentially end up in a different
    state than before VSI update.
    
    On E830, some of the remaining bits are not reserved. Setting them
    to some unrelated values can cause the firmware to reject the update
    because of invalid settings, or worse - succeed.
    
    Reproducer:
      sudo ethtool -X $PF1 equal 8
    
    Output in dmesg:
      Failed to configure RSS hash for VSI 6, error -5
    
    Fixes: 352e9bf23813 ("ice: enable symmetric-xor RSS for Toeplitz hash function")
    Reviewed-by: Aleksandr Loktionov <aleksandr.loktionov@intel.com>
    Reviewed-by: Przemek Kitszel <przemyslaw.kitszel@intel.com>
    Signed-off-by: Marcin Szycik <marcin.szycik@linux.intel.com>
    Signed-off-by: Jacob Keller <jacob.e.keller@intel.com>
    Link: https://patch.msgid.link/20260506-jk-iwl-net-2026-05-04-v2-5-a5ea4dc837a9@intel.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ice: fix VF queue configuration with low MTU values [+ + +]

Author: Jose Ignacio Tornos Martinez <jtornosm@redhat.com>
Date:   Fri May 15 11:24:09 2026 -0700

    ice: fix VF queue configuration with low MTU values
    
    commit 3ba4dd024d26372733d1c02e13e076c6016e3320 upstream.
    
    The ice driver's VF queue configuration validation rejects
    databuffer_size values below 1024 bytes, which prevents VFs from
    using MTU values below 871 bytes.
    
    The iavf driver calculates databuffer_size based on the MTU using:
      databuffer_size = ALIGN(MTU + LIBETH_RX_LL_LEN, 128)
    
    where LIBETH_RX_LL_LEN = 26 (ETH_HLEN + 2*VLAN_HLEN + ETH_FCS_LEN).
    
    For MTU values below 871:
      MTU 870: 870 + 26 = 896, aligned to 128 = 896 (< 1024, rejected)
      MTU 871: 871 + 26 = 897, aligned to 128 = 1024 (>= 1024, accepted)
    
    The 1024-byte minimum seems unnecessarily restrictive, because the hardware
    supports databuffer_size as low as 128 bytes (the alignment boundary),
    which should allow MTU values down to the standard minimum of 68 bytes.
    
    I haven't found the reason why the limit was configured in the commit
    9c7dd7566d18 ("ice: add validation in OP_CONFIG_VSI_QUEUES VF message"), so
    with no more information and since it is working, change the minimum
    databuffer_size validation from 1024 to 128 bytes to allow standard low
    MTU values while still preventing invalid configurations.
    
    Fixes: 9c7dd7566d18 ("ice: add validation in OP_CONFIG_VSI_QUEUES VF message")
    cc: stable@vger.kernel.org
    Signed-off-by: Jose Ignacio Tornos Martinez <jtornosm@redhat.com>
    Reviewed-by: Jacob Keller <jacob.e.keller@intel.com>
    Reviewed-by: Michal Swiatkowski <michal.swiatkowski@linux.intel.com>
    Reviewed-by: Paul Menzel <pmenzel@molgen.mpg.de>
    Tested-by: Rafal Romanowski <rafal.romanowski@intel.com>
    Signed-off-by: Tony Nguyen <anthony.l.nguyen@intel.com>
    Link: https://patch.msgid.link/20260515182419.1597859-3-anthony.l.nguyen@intel.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ice: ptp: serialize E825 PHY timer start with PTP lock [+ + +]

Author: Grzegorz Nitka <grzegorz.nitka@intel.com>
Date:   Fri May 15 11:24:11 2026 -0700

    ice: ptp: serialize E825 PHY timer start with PTP lock
    
    [ Upstream commit 781ff8f2d575a794a2a4f11605288ae06757f5eb ]
    
    ice_start_phy_timer_eth56g() programs TIMETUS registers and issues
    INIT_INCVAL without holding the global PTP semaphore.
    
    This allows concurrent PTP command paths to interleave with PHY timer
    start, which can make the sequence fail and leave timer initialization
    inconsistent.
    
    Take the PTP lock around TIMETUS registers programming and INIT_INCVAL
    command execution, and make sure the lock is released on all error paths.
    
    Keep the subsequent sync step outside of this critical section, since
    ice_sync_phy_timer_eth56g() takes the same semaphore internally.
    
    Fixes: 7cab44f1c35f ("ice: Introduce ETH56G PHY model for E825C products")
    Reviewed-by: Arkadiusz Kubalewski <Arkadiusz.kubalewski@intel.com>
    Signed-off-by: Grzegorz Nitka <grzegorz.nitka@intel.com>
    Reviewed-by: Aleksandr Loktionov <aleksandr.loktionov@intel.com>
    Tested-by: Alexander Nowlin <alexander.nowlin@intel.com>
    Signed-off-by: Tony Nguyen <anthony.l.nguyen@intel.com>
    Link: https://patch.msgid.link/20260515182419.1597859-5-anthony.l.nguyen@intel.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ice: ptp: use primary NAC semaphore on E825 [+ + +]

Author: Grzegorz Nitka <grzegorz.nitka@intel.com>
Date:   Fri May 15 11:24:12 2026 -0700

    ice: ptp: use primary NAC semaphore on E825
    
    [ Upstream commit 7b28523546c7e4adbb8436f2986efcfc8382985e ]
    
    For E825 2xNAC configurations, PTP semaphore operations must hit the
    primary NAC register block so both sides coordinate on the same lock.
    
    Commit e2193f9f9ec9 ("ice: enable timesync operation on 2xNAC E825
    devices") updated other primary-only PTP register accesses to
    use the primary NAC on non-primary functions, but left ice_ptp_lock()
    and ice_ptp_unlock() operating on the local NAC. As a result, secondary
    NAC PTP paths can take a different semaphore than the primary side.
    
    Select the primary hardware in ice_ptp_lock() and ice_ptp_unlock() when
    the current function is not primary, keeping semaphore operations
    symmetric and consistent with the rest of the 2xNAC PTP register access
    path.
    
    Fixes: e2193f9f9ec9 ("ice: enable timesync operation on 2xNAC E825 devices")
    Reviewed-by: Arkadiusz Kubalewski <Arkadiusz.kubalewski@intel.com>
    Signed-off-by: Grzegorz Nitka <grzegorz.nitka@intel.com>
    Reviewed-by: Aleksandr Loktionov <aleksandr.loktionov@intel.com>
    Tested-by: Alexander Nowlin <alexander.nowlin@intel.com>
    Signed-off-by: Tony Nguyen <anthony.l.nguyen@intel.com>
    Link: https://patch.msgid.link/20260515182419.1597859-6-anthony.l.nguyen@intel.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ice: restore PTP Rx timestamp config after ethtool set-channels [+ + +]

Author: Grzegorz Nitka <grzegorz.nitka@intel.com>
Date:   Fri May 15 11:24:13 2026 -0700

    ice: restore PTP Rx timestamp config after ethtool set-channels
    
    commit 975b564d195b13ca6ee1ef5e6a9561734898eb17 upstream.
    
    When ethtool -L changes queue counts, ice_vsi_recfg_qs() closes and
    rebuilds the VSI, reallocating Rx rings. The newly allocated rings have
    ptp_rx cleared, so RX hardware timestamps are no longer attached to skb
    until hwtstamp configuration is applied again.
    
    Restore timestamp mode after ice_vsi_open() in the queue reconfiguration
    path, matching reset/rebuild behavior and ensuring newly rebuilt Rx rings
    have PTP RX timestamping re-enabled.
    
    Testing hints:
    - run ptp4l application in client synchronization mode:
             ptp4l -i ethX -m -s
    - run PTP traffic
    - change queue number on ethX netdev interface:
            ethtool -L ethX combined new_queue_size
    - observe ptp4l output
    - expected result: no "received DELAY_REQ without timestamp" messages
    
    Fixes: 77a781155a65 ("ice: enable receive hardware timestamping")
    Cc: stable@vger.kernel.org
    Reviewed-by: Aleksandr Loktionov <aleksandr.loktionov@intel.com>
    Signed-off-by: Grzegorz Nitka <grzegorz.nitka@intel.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Tested-by: Alexander Nowlin <alexander.nowlin@intel.com>
    Signed-off-by: Tony Nguyen <anthony.l.nguyen@intel.com>
    Link: https://patch.msgid.link/20260515182419.1597859-7-anthony.l.nguyen@intel.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

idpf: fix read_dev_clk_lock spinlock init in idpf_ptp_init() [+ + +]

Author: Emil Tantilov <emil.s.tantilov@intel.com>
Date:   Wed May 6 14:48:12 2026 -0700

    idpf: fix read_dev_clk_lock spinlock init in idpf_ptp_init()
    
    [ Upstream commit da4f76b6a84ede14a71282ef841768299ead0221 ]
    
    In idpf_ptp_init(), read_dev_clk_lock is initialized after
    ptp_schedule_worker() had already been called (and after
    idpf_ptp_settime64() could reach the lock). The PTP aux worker
    fires immediately upon scheduling and can call into
    idpf_ptp_read_src_clk_reg_direct(), which takes
    spin_lock(&ptp->read_dev_clk_lock) on an uninitialized lock, triggering
    the lockdep "non-static key" warning:
    
    [12973.796587] idpf 0000:83:00.0: Device HW Reset initiated
    [12974.094507] INFO: trying to register non-static key.
    ...
    [12974.097208] Call Trace:
    [12974.097213]  <TASK>
    [12974.097218]  dump_stack_lvl+0x93/0xe0
    [12974.097234]  register_lock_class+0x4c4/0x4e0
    [12974.097249]  ? __lock_acquire+0x427/0x2290
    [12974.097259]  __lock_acquire+0x98/0x2290
    [12974.097272]  lock_acquire+0xc6/0x310
    [12974.097281]  ? idpf_ptp_read_src_clk_reg+0xb7/0x150 [idpf]
    [12974.097311]  ? lockdep_hardirqs_on_prepare+0xde/0x190
    [12974.097318]  ? finish_task_switch.isra.0+0xd2/0x350
    [12974.097330]  ? __pfx_ptp_aux_kworker+0x10/0x10 [ptp]
    [12974.097343]  _raw_spin_lock+0x30/0x40
    [12974.097353]  ? idpf_ptp_read_src_clk_reg+0xb7/0x150 [idpf]
    [12974.097373]  idpf_ptp_read_src_clk_reg+0xb7/0x150 [idpf]
    [12974.097391]  ? kthread_worker_fn+0x88/0x3d0
    [12974.097404]  ? kthread_worker_fn+0x4e/0x3d0
    [12974.097411]  idpf_ptp_update_cached_phctime+0x26/0x120 [idpf]
    [12974.097428]  ? _raw_spin_unlock_irq+0x28/0x50
    [12974.097436]  idpf_ptp_do_aux_work+0x15/0x20 [idpf]
    [12974.097454]  ptp_aux_kworker+0x20/0x40 [ptp]
    [12974.097464]  kthread_worker_fn+0xd5/0x3d0
    [12974.097474]  ? __pfx_kthread_worker_fn+0x10/0x10
    [12974.097482]  kthread+0xf4/0x130
    [12974.097489]  ? __pfx_kthread+0x10/0x10
    [12974.097498]  ret_from_fork+0x32c/0x410
    [12974.097512]  ? __pfx_kthread+0x10/0x10
    [12974.097519]  ret_from_fork_asm+0x1a/0x30
    [12974.097540]  </TASK>
    
    Move the call to spin_lock_init() up a bit to make sure read_dev_clk_lock
    is not touched before it's been initialized.
    
    Fixes: 5cb8805d2366 ("idpf: negotiate PTP capabilities and get PTP clock")
    Signed-off-by: Emil Tantilov <emil.s.tantilov@intel.com>
    Reviewed-by: Madhu Chittim <madhu.chittim@intel.com>
    Reviewed-by: Aleksandr Loktionov <aleksandr.loktionov@intel.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Tested-by: Samuel Salin <Samuel.salin@intel.com>
    Signed-off-by: Jacob Keller <jacob.e.keller@intel.com>
    Link: https://patch.msgid.link/20260506-jk-iwl-net-2026-05-04-v2-3-a5ea4dc837a9@intel.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

igc: fix potential skb leak in igc_fpe_xmit_smd_frame() [+ + +]

Author: Kohei Enju <kohei@enjuk.jp>
Date:   Fri May 15 11:24:16 2026 -0700

    igc: fix potential skb leak in igc_fpe_xmit_smd_frame()
    
    commit e935c37b8a94bb256fada6395a5d05e1c0c6bdaf upstream.
    
    When igc_fpe_init_tx_descriptor() fails, no one takes care of an
    allocated skb, leaking it. [1]
    Use dev_kfree_skb_any() on failure.
    
    Tested on an I226 adapter with the following command, while injecting
    faults in igc_fpe_init_tx_descriptor() to trigger the error path.
     # ethtool --set-mm $DEV verify-enabled on tx-enabled on pmac-enabled on
    
    [1]
    unreferenced object 0xffff888113c6cdc0 (size 224):
    ...
      backtrace (crc be3d3fda):
        kmem_cache_alloc_node_noprof+0x3b1/0x410
        __alloc_skb+0xde/0x830
        igc_fpe_xmit_smd_frame.isra.0+0xad/0x1b0
        igc_fpe_send_mpacket+0x37/0x90
        ethtool_mmsv_verify_timer+0x15e/0x300
    
    Cc: stable@vger.kernel.org
    Fixes: 5422570c0010 ("igc: add support for frame preemption verification")
    Signed-off-by: Kohei Enju <kohei@enjuk.jp>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Reviewed-by: Faizal Rahim <faizal.abdul.rahim@linux.intel.com>
    Tested-by: Avigail Dahan <avigailx.dahan@intel.com>
    Signed-off-by: Tony Nguyen <anthony.l.nguyen@intel.com>
    Link: https://patch.msgid.link/20260515182419.1597859-10-anthony.l.nguyen@intel.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

igc: set tx buffer type for SMD frames [+ + +]

Author: Kohei Enju <kohei@enjuk.jp>
Date:   Fri May 15 11:24:15 2026 -0700

    igc: set tx buffer type for SMD frames
    
    [ Upstream commit 5acc641e590e008caaed480ed9ffae47cf7ecbdf ]
    
    Sashiko pointed out that igc_fpe_init_smd_frame() initializes
    igc_tx_buffer fields for an SMD skb, but does not set the buffer type:
    https://sashiko.dev/#/patchset/20260415025226.114115-1-kohei%40enjuk.jp
    
    Since igc_tx_buffer entries are reused, a stale XDP or XSK type can
    remain and make TX completion use the wrong cleanup path.
    
    Set the buffer type to IGC_TX_BUFFER_TYPE_SKB.
    
    Fixes: 5422570c0010 ("igc: add support for frame preemption verification")
    Signed-off-by: Kohei Enju <kohei@enjuk.jp>
    Reviewed-by: Aleksandr Loktionov <aleksandr.loktionov@intel.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Tested-by: Avigail Dahan <avigailx.dahan@intel.com>
    Signed-off-by: Tony Nguyen <anthony.l.nguyen@intel.com>
    Link: https://patch.msgid.link/20260515182419.1597859-9-anthony.l.nguyen@intel.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

io_uring/net: punt IORING_OP_BIND async if it needs file create [+ + +]

Author: Jens Axboe <axboe@kernel.dk>
Date:   Fri May 15 10:19:09 2026 -0600

    io_uring/net: punt IORING_OP_BIND async if it needs file create
    
    [ Upstream commit ccd25890f73c082fe2657ed227b497d6ac5fdc40 ]
    
    For two reasons:
    
    1) An opcode cannot block inside io_uring_enter() doing submissions, as
       it'll stall the submission side pipeline.
    
    2) Ending up in sb_start_write() -> __sb_start_write() ->
       percpu_down_read_freezable() introduces a new lockdep edge, which it
       correctly complains about.
    
    Check if the socket type is AF_UNIX and has a non-empty pathname. If it
    does, mark it REQ_F_FORCE_ASYNC to punt the submission to io-wq rather
    than attempt to do it inline.
    
    Fixes: 7481fd93fa0a ("io_uring: Introduce IORING_OP_BIND")
    Reviewed-by: Gabriel Krisman Bertazi <krisman@suse.de>
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

io_uring/nop: pass all errors to userspace [+ + +]

Author: Alexander A. Klimov <grandmaster@al2klimov.de>
Date:   Wed May 20 20:00:44 2026 +0200

    io_uring/nop: pass all errors to userspace
    
    [ Upstream commit e97ff8b62d4690c69297f0f6de874f0564cc01a4 ]
    
    This fixes an inconsistency where io_nop() called req_set_fail()
    based on ret, but passed just nop->result to userspace.
    Originally, ret is a even copy of nop->result, but is set to an error
    when such happens subsequently. Now that's also passed to userspace.
    
    Fixes: a85f31052bce ("io_uring/nop: add support for testing registered files and buffers")
    Signed-off-by: Alexander A. Klimov <grandmaster@al2klimov.de>
    Link: https://patch.msgid.link/20260520180045.538533-1-grandmaster@al2klimov.de
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

io_uring/waitid: clear waitid info before copying it to userspace [+ + +]

Author: Heechan Kang <gganji11@naver.com>
Date:   Sun May 17 03:47:09 2026 +0900

    io_uring/waitid: clear waitid info before copying it to userspace
    
    commit 93d93f5f8da791e98159795c6ef683f45bd95d13 upstream.
    
    IORING_OP_WAITID stores its result fields in struct io_waitid::info and
    later copies them to userspace siginfo. The prep path initializes the
    request arguments, but it does not initialize info itself.
    
    If the wait operation completes without reporting a child event, the common
    wait code can return without writing wo_info. In that case io_waitid_finish()
    still copies iw->info to userspace, exposing stale bytes from the reused
    io_kiocb command storage.
    
    Clear the result storage during prep so the io_uring path matches the
    regular waitid syscall, which uses a zero-initialized struct waitid_info.
    
    Fixes: f31ecf671ddc ("io_uring: add IORING_OP_WAITID support")
    Cc: stable@vger.kernel.org # 6.7+
    Signed-off-by: Heechan Kang <gganji11@naver.com>
    Link: https://patch.msgid.link/20260516184709.852814-1-gganji11@naver.com
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

io_uring: propagate array_index_nospec opcode into req->opcode [+ + +]

Author: Michael Bommarito <michael.bommarito@gmail.com>
Date:   Sun May 17 17:30:10 2026 -0400

    io_uring: propagate array_index_nospec opcode into req->opcode
    
    [ Upstream commit cf18e36455603d65d4745de83e2d1743c54ada47 ]
    
    Commit 1e988c3fe126 ("io_uring: prevent opcode speculation") added
    array_index_nospec() to io_init_req(), but applied it only to a local
    opcode variable. req->opcode is initialized from sqe->opcode before the
    bounds check and remains the raw value.
    
    Keep req->opcode as the canonical opcode in io_init_req(): reject
    out-of-range values architecturally, then write the array_index_nospec()
    result back to req->opcode before any table lookup. This keeps downstream
    users of req->opcode from observing the raw user byte on a mispredicted
    path.
    
    No functional change: array_index_nospec() is a no-op for opcodes in
    [0, IORING_OP_LAST), and out-of-range opcodes are still rejected at the
    bounds check above the assignment.
    
    Fixes: 1e988c3fe126 ("io_uring: prevent opcode speculation")
    Assisted-by: Claude:claude-opus-4-7
    Signed-off-by: Michael Bommarito <michael.bommarito@gmail.com>
    Link: https://patch.msgid.link/20260517213010.696135-1-michael.bommarito@gmail.com
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

iommu/amd: Fix illegal cap/mmio access in IOMMU debugfs [+ + +]

Author: Guanghui Feng <guanghuifeng@linux.alibaba.com>
Date:   Thu Mar 19 15:37:54 2026 +0800

    iommu/amd: Fix illegal cap/mmio access in IOMMU debugfs
    
    [ Upstream commit 0e59645683b7b6fa20eceb21a6f420e4f7412943 ]
    
    In the current AMD IOMMU debugfs, when multiple processes simultaneously
    access the IOMMU mmio/cap registers using the IOMMU debugfs, illegal
    access issues can occur in the following execution flow:
    
    1. CPU1: Sets a valid access address using iommu_mmio/capability_write,
    and verifies the access address's validity in iommu_mmio/capability_show
    
    2. CPU2: Sets an invalid address using iommu_mmio/capability_write
    
    3. CPU1: accesses the IOMMU mmio/cap registers based on the invalid
    address, resulting in an illegal access.
    
    This patch modifies the execution process to first verify the address's
    validity and then access it based on the same address, ensuring
    correctness and robustness.
    
    Signed-off-by: Guanghui Feng <guanghuifeng@linux.alibaba.com>
    Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
    Stable-dep-of: 8dfd3d8d7443 ("iommu/amd: Remove latent out-of-bounds access in IOMMU debugfs")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

iommu/amd: Remove latent out-of-bounds access in IOMMU debugfs [+ + +]

Author: Eder Zulian <ezulian@redhat.com>
Date:   Fri Apr 10 14:55:50 2026 +0200

    iommu/amd: Remove latent out-of-bounds access in IOMMU debugfs
    
    [ Upstream commit 8dfd3d8d74435344ee8dc9237596959c8b2a6cbe ]
    
    In iommu_mmio_write() and iommu_capability_write(), the variables
    dbg_mmio_offset and dbg_cap_offset are declared as int. However, they
    are populated using kstrtou32_from_user(). If a user provides a
    sufficiently large value, it can become a negative integer.
    
    Prior to this patch, the AMD IOMMU debugfs implementation was already
    protected by different mechanisms.
    
    1. #define OFS_IN_SZ 8 ensures the user string <= 8 bytes, so
       e.g. 0xffffffff isn't a valid input.
    
      if (cnt > OFS_IN_SZ)
         return -EINVAL;
    
    2. Implicit type promotion in iommu_mmio_write(), dbg_mmio_offset is int
       and iommu->mmio_phys_end is u64
    
      if (dbg_mmio_offset > iommu->mmio_phys_end - sizeof(u64))
          return -EINVAL;
    
    3. The show handlers would currently catch the negative number and
       refuse to perform the read.
    
    Replace kstrtou32_from_user() with kstrtos32_from_user() to parse the
    input, and check for negative values to explicitly prevent out-of-bounds
    memory accesses directly in iommu_mmio_write() and
    iommu_capability_write().
    
    Signed-off-by: Eder Zulian <ezulian@redhat.com>
    Fixes: 7a4ee419e8c1 ("iommu/amd: Add debugfs support to dump IOMMU MMIO registers")
    Cc: stable@vger.kernel.org
    Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

iommu: Fix loss of errno on map failure for classic ops [+ + +]

Author: Jason Gunthorpe <jgg@ziepe.ca>
Date:   Tue May 12 13:46:13 2026 -0300

    iommu: Fix loss of errno on map failure for classic ops
    
    [ Upstream commit 6fc7e8a3b8115294f60f5c89de27330bf1b9c98e ]
    
    A typo, likely from a rebase, inverted the condition and caused
    errors to be lost. Fix it to be "if (ret)".
    
    This was breaking iommu_create_device_direct_mappings() on drivers
    that don't use iommupt and don't fully set up their domain in
    alloc_pages() (i.e., SMMUv2). In this case the first call of
    iommu_create_device_direct_mappings() should fail due to the
    incompletely initialized domain. Since it wrongly returns success,
    the second call to iommu_create_device_direct_mappings() doesn't
    happen and IOMMU_RESV_DIRECT is never set up.
    
    Cc: stable@vger.kernel.org
    Fixes: d6c65b0fd621 ("iommupt: Avoid rewalking during map")
    Reported-by: Josua Mayer <josua@solid-run.com>
    Closes: https://lore.kernel.org/all/321c2e57-6a17-4aef-ba42-d2ebd577e472@solid-run.com/
    Signed-off-by: Jason Gunthorpe <jgg@nvidia.com>
    Reviewed-by: Pranjal Shrivastava <praan@google.com>
    Reviewed-by: Samiullah Khawaja <skhawaja@google.com>
    Reviewed-by: Mostafa Saleh <smostafa@google.com>
    Tested-by: Josua Mayer <josua@solid-run.com>
    Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
    Stable-dep-of: 0735c54804c7 ("iommu: Handle unmap error when iommu_debug is enabled")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

iommu: Fix up map/unmap debugging for iommupt domains [+ + +]

Author: Jason Gunthorpe <jgg@ziepe.ca>
Date:   Tue May 12 13:46:14 2026 -0300

    iommu: Fix up map/unmap debugging for iommupt domains
    
    [ Upstream commit b948a87228482235afbaf5f4d8037860b5c470fd ]
    
    Sashiko noticed a few issues in this path, and a few more were
    found on review. Tidy them up further. These are intertwined
    because the debug code depends on some of the WARN_ONs to function
    right:
    
    Lift into iommu_map_nosync():
    - The might_sleep_if()
    - 0 pgsize_bitmap WARN_ON
    - Promote the illegal domain->type to a WARN_ON
    - WARN_ON for illegal gfp flags
    
    Then remove the return 0 since it is now safe to call
    iommu_debug_map().
    
    Lift into __iommu_unmap():
    - 0 pgsize_bitmap WARN_ON
    - Promote the illegal domain->type to a WARN_ON
    - iommu_debug_unmap_begin()
    
    This now pairs with the unconditional iommu_debug_map() on the
    mapping side. Thus iommu debugging now works for iommupt along
    with some of the other debugging features.
    
    Fixes: 99fb8afa16ad ("iommupt: Directly call iommupt's unmap_range()")
    Fixes: d6c65b0fd621 ("iommupt: Avoid rewalking during map")
    Signed-off-by: Jason Gunthorpe <jgg@nvidia.com>
    Reviewed-by: Pranjal Shrivastava <praan@google.com>
    Reviewed-by: Samiullah Khawaja <skhawaja@google.com>
    Reviewed-by: Mostafa Saleh <smostafa@google.com>
    Tested-by: Josua Mayer <josua@solid-run.com>
    Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
    Stable-dep-of: 0735c54804c7 ("iommu: Handle unmap error when iommu_debug is enabled")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

iommu: Handle unmap error when iommu_debug is enabled [+ + +]

Author: Jason Gunthorpe <jgg@ziepe.ca>
Date:   Tue May 12 13:46:15 2026 -0300

    iommu: Handle unmap error when iommu_debug is enabled
    
    [ Upstream commit 0735c54804c709d1b292f3b6947cfb560b2ce552 ]
    
    Sashiko noticed a latent bug where the map error flow called iommu_unmap()
    which calls iommu_debug_unmap_begin()/iommu_debug_unmap_end() however
    since this is an error path the map flow never actually established the
    original iommu_debug_map() it will malfunction.
    
    Lift the unmap error handling into iommu_map_nosync() and reorder it so
    the trace_map()/iommu_debug_map() records the partial mapping and then
    immediately unmaps it. This avoid creating the unbalanced tracking and
    provides saner tracing instead of a unmap unmatched to any map.
    
    Fixes: ccc21213f013 ("iommu: Add calls for IOMMU_DEBUG_PAGEALLOC")
    Signed-off-by: Jason Gunthorpe <jgg@nvidia.com>
    Reviewed-by: Pranjal Shrivastava <praan@google.com>
    Reviewed-by: Samiullah Khawaja <skhawaja@google.com>
    Reviewed-by: Mostafa Saleh <smostafa@google.com>
    Tested-by: Josua Mayer <josua@solid-run.com>
    Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

iommupt: Avoid rewalking during map [+ + +]

Author: Jason Gunthorpe <jgg@ziepe.ca>
Date:   Fri Feb 27 15:30:11 2026 -0400

    iommupt: Avoid rewalking during map
    
    [ Upstream commit d6c65b0fd6218bd21ed0be7a8d3218e8f6dc91de ]
    
    Currently the core code provides a simplified interface to drivers where
    it fragments a requested multi-page map into single page size steps after
    doing all the calculations to figure out what page size is
    appropriate. Each step rewalks the page tables from the start.
    
    Since iommupt has a single implementation of the mapping algorithm it can
    internally compute each step as it goes while retaining its current
    position in the walk.
    
    Add a new function pt_pgsz_count() which computes the same page size
    fragement of a large mapping operations.
    
    Compute the next fragment when all the leaf entries of the current
    fragement have been written, then continue walking from the current
    point.
    
    The function pointer is run through pt_iommu_ops instead of
    iommu_domain_ops to discourage using it outside iommupt. All drivers with
    their own page tables should continue to use the simplified map_pages()
    style interfaces.
    
    Reviewed-by: Samiullah Khawaja <skhawaja@google.com>
    Reviewed-by: Kevin Tian <kevin.tian@intel.com>
    Signed-off-by: Jason Gunthorpe <jgg@nvidia.com>
    Reviewed-by: Lu Baolu <baolu.lu@linux.intel.com>
    Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
    Stable-dep-of: 0735c54804c7 ("iommu: Handle unmap error when iommu_debug is enabled")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

iommupt: Check for missing PAGE_SIZE in the pgsize_bitmap [+ + +]

Author: Jason Gunthorpe <jgg@ziepe.ca>
Date:   Tue May 12 13:46:16 2026 -0300

    iommupt: Check for missing PAGE_SIZE in the pgsize_bitmap
    
    [ Upstream commit 8ef3f77c440005c7f04229a75976bfc078364247 ]
    
    Sashiko pointed out that the driver could drop PAGE_SIZE from the
    pgsize_bitmap. That is technically allowed but nothing does it, and
    such an iommu_domain would not be used with the DMA API today.
    
    Still, it is against the design and it is trivial to fix up. Lift
    the PT_WARN_ON to the if branch and just skip the fast path.
    
    Fixes: dcd6a011a8d5 ("iommupt: Add map_pages op")
    Signed-off-by: Jason Gunthorpe <jgg@nvidia.com>
    Reviewed-by: Pranjal Shrivastava <praan@google.com>
    Reviewed-by: Samiullah Khawaja <skhawaja@google.com>
    Tested-by: Josua Mayer <josua@solid-run.com>
    Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

iommupt: Directly call iommupt's unmap_range() [+ + +]

Author: Jason Gunthorpe <jgg@ziepe.ca>
Date:   Fri Feb 27 15:30:10 2026 -0400

    iommupt: Directly call iommupt's unmap_range()
    
    [ Upstream commit 99fb8afa16add85ed016baee9735231bca0c32b4 ]
    
    The common algorithm in iommupt does not require the iommu_pgsize()
    calculations, it can directly unmap any arbitrary range. Add a new function
    pointer to directly call an iommupt unmap_range op and make
    __iommu_unmap() call it directly.
    
    Gives about a 5% gain on single page unmappings.
    
    The function pointer is run through pt_iommu_ops instead of
    iommu_domain_ops to discourage using it outside iommupt. All drivers with
    their own page tables should continue to use the simplified
    map/unmap_pages() style interfaces.
    
    Reviewed-by: Samiullah Khawaja <skhawaja@google.com>
    Reviewed-by: Kevin Tian <kevin.tian@intel.com>
    Signed-off-by: Jason Gunthorpe <jgg@nvidia.com>
    Reviewed-by: Lu Baolu <baolu.lu@linux.intel.com>
    Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
    Stable-dep-of: 0735c54804c7 ("iommu: Handle unmap error when iommu_debug is enabled")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

iommupt: Fix the end_index calculation in __map_range_leaf() [+ + +]

Author: Jason Gunthorpe <jgg@ziepe.ca>
Date:   Tue May 12 13:46:17 2026 -0300

    iommupt: Fix the end_index calculation in __map_range_leaf()
    
    [ Upstream commit 58829512ad461af8f35941069c209941e3a97b65 ]
    
    Sashiko noticed a mismatch of units in this math: num_leaves is
    actually the number of leaf *entries* (so a 16-item contiguous leaf
    is one num_leaves), while index is in items. The mismatch in maths
    causes __map_range_leaf() to exit early instead of efficiently
    filling a larger range of contiguous PTEs.
    
    The early exit is caught by the functions above and then
    __map_range_leaf() is re-invoked, so there is no functional issue.
    
    Correct the misuse of units by adjusting num_leaves with the leaf
    size and avoid the performance cost of looping externally.
    
    There are also some mismatched types for num_leaves; simplify
    things to remove the duplicated calculations.
    
    Fixes: d6c65b0fd621 ("iommupt: Avoid rewalking during map")
    Signed-off-by: Jason Gunthorpe <jgg@nvidia.com>
    Reviewed-by: Samiullah Khawaja <skhawaja@google.com>
    Reviewd-by: Pranjal Shrivastava <praan@google.com>
    Tested-by: Josua Mayer <josua@solid-run.com>
    Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ipv4: raw: reject IP_HDRINCL packets with ihl < 5 [+ + +]

Author: Michael Bommarito <michael.bommarito@gmail.com>
Date:   Tue May 12 16:51:14 2026 -0400

    ipv4: raw: reject IP_HDRINCL packets with ihl < 5
    
    commit 915fab69823a14c170dbaa3b41978768e0fe62fc upstream.
    
    raw_send_hdrinc() validates that the caller-supplied IPv4 header
    fits within the message length:
    
        iphlen = iph->ihl * 4;
        err = -EINVAL;
        if (iphlen > length)
            goto error_free;
    
        if (iphlen >= sizeof(*iph)) {
            /* fix up saddr, tot_len, id, csum, transport_header */
        }
    
    It does not, however, reject ihl < 5.  For such a packet the
    "if (iphlen >= sizeof(*iph))" branch is skipped, leaving the
    crafted iphdr untouched, but the packet is still handed to
    __ip_local_out() and onward.  Downstream consumers that read
    iph->ihl assume a sane value: net/ipv4/ah4.c:ah_output() in
    particular subtracts sizeof(struct iphdr) from top_iph->ihl * 4
    and passes the (signed-int-negative, then cast to size_t)
    result to memcpy(), producing an OOB access of length close to
    SIZE_MAX and a host kernel panic.
    
    An IPv4 header with ihl < 5 is malformed by definition (RFC 791:
    "Internet Header Length is the length of the internet header in
    32 bit words ... Note that the minimum value for a correct header
    is 5.").  The kernel should not be willing to inject such a
    packet into its own output path.
    
    Reject "iphlen < sizeof(*iph)" alongside the existing
    "iphlen > length" check.  This matches the principle that locally
    constructed packets that re-enter the IP stack must pass the same
    basic sanity tests that a foreign packet would be subjected to.
    
    Once this lands, the "if (iphlen >= sizeof(*iph))" wrapper around
    the fixup branch becomes redundant; left in place to keep the
    patch minimal and backport-friendly.  A follow-up can unwrap it.
    
    Note that commit 86f4c90a1c5c ("ipv4, ipv6: ensure raw socket
    message is big enough to hold an IP header") ensures the message
    buffer is large enough to hold an iphdr, but does not constrain
    the self-reported iph->ihl.
    
    Reachability: the malformed packet source is any caller with
    CAP_NET_RAW, including an unprivileged process in a user+net
    namespace on a kernel with CONFIG_USER_NS=y.  The reproduced AH
    crash also requires a matching xfrm AH policy on the outgoing
    route; a container granted CAP_NET_ADMIN can install that state
    and policy in its netns.  Loopback bypasses xfrm_output, so the
    trigger uses a real netdev.
    
    Reproduced on UML + KASAN: kernel-mode fault at addr 0x0 with
    memcpy_orig at the crash site.  Same shape reproduces inside a
    rootless Docker container with --cap-add NET_ADMIN on a stock
    distro kernel.
    
    Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
    Cc: stable@vger.kernel.org
    Suggested-by: Herbert Xu <herbert@gondor.apana.org.au>
    Signed-off-by: Michael Bommarito <michael.bommarito@gmail.com>
    Link: https://patch.msgid.link/77ec2b5e8111961c2c39883c92e8aa2709039c17.1778614451.git.michael.bommarito@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ipv6: ioam: add NULL check for idev in ipv6_hop_ioam() [+ + +]

Author: Justin Iurman <justin.iurman@gmail.com>
Date:   Sun May 17 20:30:59 2026 +0200

    ipv6: ioam: add NULL check for idev in ipv6_hop_ioam()
    
    commit d4ea0dfd75011b78cebf3808f98ac4c4f51a6fb9 upstream.
    
    Reported by Sashiko:
    
    The function ipv6_hop_ioam() accesses
    __in6_dev_get(skb->dev)->cnf.ioam6_enabled without validating the returned
    idev pointer. Because addrconf_ifdown() can concurrently clear dev->ip6_ptr
    via RCU, __in6_dev_get() can return NULL during interface teardown, which
    could cause a NULL pointer dereference when processing an IOAM Hop-by-Hop
    option.
    
    Let's add a check and use SKB_DROP_REASON_IPV6DISABLED accordingly.
    
    Fixes: 9ee11f0fff20 ("ipv6: ioam: Data plane support for Pre-allocated Trace")
    Cc: stable@vger.kernel.org
    Signed-off-by: Justin Iurman <justin.iurman@gmail.com>
    Reviewed-by: Ido Schimmel <idosch@nvidia.com>
    Link: https://patch.msgid.link/20260517183059.29140-1-justin.iurman@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ipv6: ioam: refresh hdr pointer before ioam6_event() [+ + +]

Author: Justin Iurman <justin.iurman@gmail.com>
Date:   Wed May 20 14:42:42 2026 +0200

    ipv6: ioam: refresh hdr pointer before ioam6_event()
    
    commit e46e6bc97fb1f339730ff1ba74267fbf48e7a422 upstream.
    
    Reported by Sashiko:
    
    In ipv6_hop_ioam(), the hdr pointer is initialized to point into the
    skb's linear data buffer. Later, the code calls skb_ensure_writable(),
    which might reallocate the buffer:
    
            if (skb_ensure_writable(skb, optoff + 2 + hdr->opt_len))
                    goto drop;
    
            /* Trace pointer may have changed */
            trace = (struct ioam6_trace_hdr *)(skb_network_header(skb)
                                               + optoff + sizeof(*hdr));
    
            ioam6_fill_trace_data(skb, ns, trace, true);
    
            ioam6_event(IOAM6_EVENT_TRACE, dev_net(skb->dev),
                        GFP_ATOMIC, (void *)trace, hdr->opt_len - 2);
    
    If the skb is cloned or lacks sufficient linear headroom,
    skb_ensure_writable() will invoke pskb_expand_head(), which reallocates
    the skb's data buffer and frees the old one, invalidating pointers to
    it. While the code recalculates the trace pointer immediately after the
    call to skb_ensure_writable(), it fails to recalculate the hdr pointer.
    
    This patch fixes the above by recalculating the hdr pointer before
    passing hdr->opt_len to ioam6_event(), so that we avoid any UaF.
    
    Fixes: f655c78d6225 ("net: exthdrs: ioam6: send trace event")
    Cc: stable@vger.kernel.org
    Signed-off-by: Justin Iurman <justin.iurman@gmail.com>
    Reviewed-by: Ido Schimmel <idosch@nvidia.com>
    Link: https://patch.msgid.link/20260520124242.32320-1-justin.iurman@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

irq_work: Fix use-after-free in irq_work_single() on PREEMPT_RT [+ + +]

Author: Jiayuan Chen <jiayuan.chen@linux.dev>
Date:   Mon Mar 30 15:32:29 2026 +0800

    irq_work: Fix use-after-free in irq_work_single() on PREEMPT_RT
    
    [ Upstream commit 91840be8f710370607f949a627e070896faeddb8 ]
    
    On PREEMPT_RT, non-HARD irq_work runs in per-CPU kthreads via
    run_irq_workd(), so irq_work_sync() uses rcuwait() to wait for BUSY==0.
    
    After irq_work_single() clears BUSY via atomic_cmpxchg(), it still
    dereferences @work for irq_work_is_hard() and rcuwait_wake_up().
    
    An irq_work_sync() caller on another CPU that enters after BUSY is cleared
    can observe BUSY==0 immediately, return, and free the work before those
    accesses complete — causing a use-after-free.
    
    Fix this by wrapping run_irq_workd() in guard(rcu)() so that the entire
    irq_work_single() execution is within an RCU read-side critical
    section. Then add synchronize_rcu() in irq_work_sync() after
    rcuwait_wait_event() to ensure the caller waits for the RCU grace period
    before returning, preventing premature frees.
    
    Fixes: 810979682ccc ("irq_work: Allow irq_work_sync() to sleep if irq_work() no IRQ support.")
    Suggested-by: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
    Suggested-by: Steven Rostedt <rostedt@goodmis.org>
    Signed-off-by: Jiayuan Chen <jiayuan.chen@linux.dev>
    Signed-off-by: Thomas Gleixner <tglx@kernel.org>
    Reviewed-by: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
    Link: https://patch.msgid.link/20260330073234.303732-1-jiayuan.chen@linux.dev
    Signed-off-by: Sasha Levin <sashal@kernel.org>

irqchip/ath79-cpu: Remove unused function [+ + +]

Author: Rosen Penev <rosenp@gmail.com>
Date:   Wed May 6 01:55:22 2026 -0700

    irqchip/ath79-cpu: Remove unused function
    
    [ Upstream commit 0fa10fb77069fb67aa51384868ef3702b7791465 ]
    
    ath79_cpu_irq_init() was part of the legacy pre-OF code that got removed a
    while back.
    
    Remove it to get rid of a missing prototype warning, reported by the kernel test
    robot.
    
    [ tglx: Fix the subject prefix. Sigh ... ]
    
    Fixes: 51fa4f8912c0 ("MIPS: ath79: drop legacy IRQ code")
    Reported-by: kernel test robot <lkp@intel.com>
    Signed-off-by: Rosen Penev <rosenp@gmail.com>
    Signed-off-by: Thomas Gleixner <tglx@kernel.org>
    Link: https://patch.msgid.link/20260506085522.1210143-1-rosenp@gmail.com
    Closes: https://lore.kernel.org/oe-kbuild-all/202412011509.kGQkDr1y-lkp@intel.com/
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ixgbevf: fix use-after-free in VEPA multicast source pruning [+ + +]

Author: Michael Bommarito <michael.bommarito@gmail.com>
Date:   Fri May 15 11:24:14 2026 -0700

    ixgbevf: fix use-after-free in VEPA multicast source pruning
    
    commit 5d49b568c188dc77199d8d2b959c91da8cc27cf1 upstream.
    
    ixgbevf_clean_rx_irq() prunes frames whose source MAC matches the VF's
    own address (VEPA multicast workaround) by freeing the skb and
    continuing to the next descriptor:
    
        dev_kfree_skb_irq(skb);
        continue;
    
    The skb pointer is declared outside the while loop and persists across
    iterations.  Because the continue skips the "skb = NULL" reset at the
    bottom of the loop, the next iteration enters the "else if (skb)" path
    and calls ixgbevf_add_rx_frag() on the freed skb, dereferencing
    skb_shinfo(skb)->nr_frags - a use-after-free in NAPI softirq context.
    
    The sibling driver iavf already handles this correctly by nulling the
    pointer before continuing.  Apply the same pattern here.
    
    I do not have ixgbevf hardware; the bug was found by static analysis
    (scan_drop_continue_loops.py + semgrep drop_continue_in_loop, multi-tool
    corroboration with the highest score in the scan).  The UAF was confirmed
    under KASAN by loading a test module that reproduces the exact code
    pattern (alloc skb, kfree_skb, then read skb_shinfo(skb)->nr_frags):
    
      BUG: KASAN: slab-use-after-free in ixgbevf_uaf_test_init+0x100/0x1000
      Read of size 8 at addr 000000006163ae78 by task insmod/30
      freed 208-byte region [000000006163adc0, 000000006163ae90)
    
    QEMU emulates igb (82576) but not ixgbe (82599), and the igbvf VF
    driver does not include the VEPA source pruning path, so a full
    end-to-end reproduction with emulated hardware was not possible.
    
    Fixes: bad17234ba70 ("ixgbevf: Change receive model to use double buffered page based receives")
    Cc: stable@vger.kernel.org
    Signed-off-by: Michael Bommarito <michael.bommarito@gmail.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Tested-by: Rafal Romanowski <rafal.romanowski@intel.com>
    Signed-off-by: Tony Nguyen <anthony.l.nguyen@intel.com>
    Link: https://patch.msgid.link/20260515182419.1597859-8-anthony.l.nguyen@intel.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

kbuild: pacman-pkg: make "rc" releases adhere to pacman versioning scheme [+ + +]

Author: Viktor Jägersküpper <viktor_jaegerskuepper@freenet.de>
Date:   Fri May 15 23:58:45 2026 +0200

    kbuild: pacman-pkg: make "rc" releases adhere to pacman versioning scheme
    
    [ Upstream commit 202550713128da20d9381d6d2dc0f6b73839f434 ]
    
    The package versioning scheme does not enable smooth upgrades from "rc"
    releases to the corresponding stable releases (e.g. 7.0.0-rc7 -> 7.0.0)
    because pacman considers that a downgrade due to the underscore in
    pkgver (e.g. 7.0.0_rc7), see e.g. vercmp(8) for an explanation of the
    package version comparison used by pacman. Package versions which are
    derived from said releases (e.g. built from git revisions) are
    similarly affected. Fix this by modifying pkgver in order to remove the
    hyphen from kernel versions containing "-rcN", where N is a
    non-negative integer.
    
    Acked-by: Thomas Weißschuh <linux@weissschuh.net>
    Signed-off-by: Viktor Jägersküpper <viktor_jaegerskuepper@freenet.de>
    Reviewed-by: Nathan Chancellor <nathan@kernel.org>
    Tested-by: Nathan Chancellor <nathan@kernel.org>
    Link: https://patch.msgid.link/20260515215913.92481-1-viktor_jaegerskuepper@freenet.de
    Fixes: c8578539deba ("kbuild: add script and target to generate pacman package")
    Signed-off-by: Nicolas Schier <nsc@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

kho: skip KHO for crash kernel [+ + +]

Author: Evangelos Petrongonas <epetron@amazon.de>
Date:   Fri Apr 10 01:16:05 2026 +0000

    kho: skip KHO for crash kernel
    
    [ Upstream commit a6715d7ec472a476db17787697a4abda62962284 ]
    
    kho_fill_kimage() unconditionally populates the kimage with KHO
    metadata for every kexec image type. When the image is a crash kernel,
    this can be problematic as the crash kernel can run in a small reserved
    region and the KHO scratch areas can sit outside it.
    The crash kernel then faults during kho_memory_init() when it
    tries phys_to_virt() on the KHO FDT address:
    
      Unable to handle kernel paging request at virtual address xxxxxxxx
      ...
        fdt_offset_ptr+...
        fdt_check_node_offset_+...
        fdt_first_property_offset+...
        fdt_get_property_namelen_+...
        fdt_getprop+...
        kho_memory_init+...
        mm_core_init+...
        start_kernel+...
    
    kho_locate_mem_hole() already skips KHO logic for KEXEC_TYPE_CRASH
    images, but kho_fill_kimage() was missing the same guard. As
    kho_fill_kimage() is the single point that populates image->kho.fdt
    and image->kho.scratch, fixing it here is sufficient for both arm64
    and x86 as the FDT and boot_params path are bailing out when these
    fields are unset.
    
    Fixes: d7255959b69a ("kho: allow kexec load before KHO finalization")
    Signed-off-by: Evangelos Petrongonas <epetron@amazon.de>
    Reviewed-by: Mike Rapoport (Microsoft) <rppt@kernel.org>
    Link: https://patch.msgid.link/20260410011609.1103-1-epetron@amazon.de
    Signed-off-by: Mike Rapoport (Microsoft) <rppt@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

kprobes: skip non-symbol addresses in kprobe_add_ksym_blacklist() [+ + +]

Author: Jianpeng Chang <jianpeng.chang.cn@windriver.com>
Date:   Fri May 8 09:56:36 2026 +0900

    kprobes: skip non-symbol addresses in kprobe_add_ksym_blacklist()
    
    [ Upstream commit 307abfac04a254c09c5705d816b33354acee97a0 ]
    
    When kprobe_add_area_blacklist() iterates through a section like
    .kprobes.text, the start address may not correspond to a named symbol.
    On ARM64 with CONFIG_DYNAMIC_FTRACE_WITH_CALL_OPS=y (introduced by
    commit baaf553d3bc3 ("arm64: Implement
    HAVE_DYNAMIC_FTRACE_WITH_CALL_OPS")), the compiler flag
    -fpatchable-function-entry=4,2 inserts 2 NOPs before each function entry
    point for ftrace call_ops. These pre-function NOPs sit at the section base
    address, before the first named function symbol. The compiler emits a $x
    mapping symbol at offset 0x00 to mark the start of code, but
    find_kallsyms_symbol() ignores mapping symbols.
    
    Without CONFIG_DYNAMIC_FTRACE_WITH_CALL_OPS (e.g. defconfig), no
    pre-function NOPs are inserted, the first function starts at offset
    0x00, and the bug does not trigger.
    
    This only affects modules that have a .kprobes.text section (i.e. those
    using the __kprobes annotation). Modules using NOKPROBE_SYMBOL() instead
    (like kretprobe_example.ko) blacklist exact function addresses via the
    _kprobe_blacklist section and are not affected.
    
    For kprobe_example.ko on ARM64 with -fpatchable-function-entry=4,2,
    the .kprobes.text section layout is:
    
      offset 0x00: $x + 2 NOPs    (mapping symbol + ftrace preamble)
      offset 0x08: handler_post   (64 bytes)
      offset 0x50: handler_pre    (68 bytes)
    
    kprobe_add_area_blacklist() starts iterating from the section base
    address (offset 0x00), which only has the $x mapping symbol.
    kprobe_add_ksym_blacklist() then calls kallsyms_lookup_size_offset()
    for this address, which goes through:
    
      kallsyms_lookup_size_offset()
        -> module_address_lookup()
          -> find_kallsyms_symbol()
    
    find_kallsyms_symbol() scans all module symbols to find the closest
    preceding symbol.
    
    Since no named text symbol exists at offset 0x00,
    find_kallsyms_symbol() picks __UNIQUE_ID_vermagic (a .modinfo symbol
    whose address is in the temporary image) as the "best" match. The
    computed "size" = next_text_symbol - modinfo_symbol spans across
    these two unrelated memory regions, creating a blacklist entry with
    a bogus range of tens of terabytes.
    
    Whether this causes a visible failure depends on address randomization,
    here is what happens on Raspberry Pi 4/5:
    
      - On RPi5, the bogus size was ~35 TB. start + size stayed within
        64-bit range, so the blacklist entry covered the entire kernel
        text. register_kprobe() in the module's own init function failed
        with -EINVAL.
    
      - On RPi4, the bogus size was ~75 TB. start + size overflowed
        64 bits and wrapped to a small address near zero. The range
        check (addr >= start && addr < end) then failed because end
        wrapped around, so the bogus entry was accidentally harmless
        and kprobes worked by luck.
    
    The same bug exists on both machines, but randomization determines whether
    the integer overflow masks it or not.
    
    Fix this by adding notrace to the __kprobes macro. Functions in
    .kprobes.text are kprobe infrastructure handlers that should never be
    traced by ftrace. With notrace, the compiler stops inserting them and the
    non-symbol gap at the section start disappears entirely.
    
    Link: https://lore.kernel.org/all/20260506012706.2785785-1-jianpeng.chang.cn@windriver.com/
    
    Fixes: baaf553d3bc3 ("arm64: Implement HAVE_DYNAMIC_FTRACE_WITH_CALL_OPS")
    Signed-off-by: Jianpeng Chang <jianpeng.chang.cn@windriver.com>
    Signed-off-by: Masami Hiramatsu (Google) <mhiramat@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ksmbd: close durable scavenger races against m_fp_list lookups [+ + +]

Author: DaeMyung Kang <charsyam@gmail.com>
Date:   Tue Apr 28 23:08:56 2026 +0900

    ksmbd: close durable scavenger races against m_fp_list lookups
    
    [ Upstream commit bf736184d063da1a552ffeff0481813599a182cc ]
    
    ksmbd_durable_scavenger() has two related races against any walker
    that iterates f_ci->m_fp_list, including ksmbd_lookup_fd_inode()
    (used by ksmbd_vfs_rename) and the share-mode checks in
    fs/smb/server/smb_common.c.
    
    (1) fp->node list-head reuse.  Durable-preserved handles can remain
    linked on f_ci->m_fp_list after session teardown so share-mode checks
    still see them while the handle is reconnectable.  The scavenger
    collected expired handles by adding fp->node to a local
    scavenger_list after removing them from the global durable idr.
    Because fp->node is the same list_head used by m_fp_list,
    list_add(&fp->node, &scavenger_list) overwrites the m_fp_list links
    and corrupts both lists.  CONFIG_DEBUG_LIST can report this on the
    share-mode walk path.
    
    (2) Refcount race against m_fp_list walkers.  The scavenger qualifies
    an expired durable handle with atomic_read(&fp->refcount) > 1 and
    fp->conn under global_ft.lock, removes fp from global_ft, then drops
    global_ft.lock before unlinking fp from m_fp_list and freeing it.
    During that gap fp is still linked on m_fp_list with f_state ==
    FP_INITED.  ksmbd_lookup_fd_inode() under m_lock read calls
    ksmbd_fp_get() (atomic_inc_not_zero on refcount that is still 1) and
    takes a live reference; the scavenger then unlinks and frees fp
    while the holder owns a reference, leading to UAF on the holder's
    subsequent ksmbd_fd_put() and on any field reads performed by a
    concurrent share-mode walker that iterates m_fp_list without taking
    ksmbd_fp_get() (smb_check_perm_dleases-like paths).
    
    Fix both:
    
      * Stop reusing fp->node as a scavenger-private list node.  Remove
        one expired handle from global_ft under global_ft.lock, take an
        explicit transient reference, drop the lock, unlink fp->node
        from m_fp_list under f_ci->m_lock, then drop both the durable
        lifetime and transient references with atomic_sub_and_test(2,
        &fp->refcount).  If the scavenger is the last putter the close
        runs there; otherwise an in-flight holder that already raced
        through the m_fp_list lookup owns the final close via its
        ksmbd_fd_put() path.  The one-at-a-time disposal can rescan the
        durable idr when multiple handles expire in the same pass, but
        durable scavenging is a background expiration path and the final
        full scan recomputes min_timeout before the next wait.
    
      * Clear fp->persistent_id inside __ksmbd_remove_durable_fd() right
        after idr_remove(), so a delayed final close from a holder that
        snatched fp does not re-issue idr_remove() on a persistent id
        that idr_alloc_cyclic() in ksmbd_open_durable_fd() may have
        already handed out to a brand-new durable handle.
    
      * Bypass the per-conn open_files_count decrement in
        __put_fd_final() when fp is detached from any session table
        (fp->conn cleared by session_fd_check() at durable preserve --
        paired with the volatile_id clear at unpublish, so checking
        fp->conn alone is sufficient).  The walker that owns the final
        close runs from an unrelated work->conn whose
        stats.open_files_count never tracked this durable fp; without
        this guard the holder would underflow that unrelated counter.
    
    The two races are folded into one patch because patch (1) alone
    cleans up the corrupted list but leaves a deterministic UAF window
    for m_fp_list walkers that the transient-reference and
    persistent_id discipline in (2) close; bisecting onto an
    intermediate state would land on a UAF that pre-patch chaos merely
    made less reproducible.
    
    Validation:
      * CONFIG_DEBUG_LIST coverage for the list_head reuse path.
      * KASAN-enabled direct SMB2 durable-handle coverage that exercised
        ksmbd_durable_scavenger() and non-NULL ksmbd_lookup_fd_inode()
        returns while durable handles expired under concurrent rename
        lookups, with no KASAN, UAF, list-corruption, ODEBUG, or WARNING
        reports.
      * checkpatch --strict
      * make -j$(nproc) M=fs/smb/server
    
    Fixes: d484d621d40f ("ksmbd: add durable scavenger timer")
    Signed-off-by: DaeMyung Kang <charsyam@gmail.com>
    Acked-by: Namjae Jeon <linkinjeon@kernel.org>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ksmbd: fix durable reconnect error path file lifetime [+ + +]

Author: Junyi Liu <moss80199@gmail.com>
Date:   Mon May 18 23:27:19 2026 +0900

    ksmbd: fix durable reconnect error path file lifetime
    
    [ Upstream commit 3515503322f4819277091839eed46b695096aca5 ]
    
    After a durable reconnect succeeds, ksmbd_reopen_durable_fd() republishes
    the same ksmbd_file into the session volatile-id table. If smb2_open()
    then takes a later error path, cleanup first calls ksmbd_fd_put(work, fp)
    and then unconditionally calls ksmbd_put_durable_fd(dh_info.fp).
    
    In this case fp and dh_info.fp are the same object. The first put drops the
    reconnect lookup reference, but the final durable put can run
    __ksmbd_close_fd(NULL, fp). Because the final close is not session-aware,
    it can free the file object without removing the volatile-id entry that was
    just published into the session table.
    
    Use the session-aware put for the final reconnect drop when the reconnect
    had already succeeded and the error path is cleaning up the republished
    file. Earlier reconnect failures, before fp is assigned to dh_info.fp, keep
    using the durable-only put path.
    
    Fixes: 1baff47b81f9 ("ksmbd: fix use-after-free in smb2_open during durable reconnect")
    Signed-off-by: Junyi Liu <moss80199@gmail.com>
    Acked-by: Namjae Jeon <linkinjeon@kernel.org>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ksmbd: fix null pointer dereference in compare_guid_key() [+ + +]

Author: Jeremy Laratro <research@aradex.io>
Date:   Wed May 13 08:26:16 2026 +0900

    ksmbd: fix null pointer dereference in compare_guid_key()
    
    commit 4b83cbc4c15f09b000cc06f033f64b0824b6dc87 upstream.
    
    session_fd_check() walks the per-inode m_op_list during durable-handle
    session teardown and sets op->conn = NULL for every opinfo whose conn
    matched the closing session's connection. The matching opinfo, however,
    stays linked in its per-ClientGuid lease_table_list entry's lb->lease_list
    because destroy_lease_table() only runs on full TCP-connection teardown,
    not on SESSION_LOGOFF.
    
    If the same TCP connection then negotiates a fresh session with the
    same ClientGuid (ClientGuid is bound to NEGOTIATE, not the session, and
    is unchanged across LOGOFF + SETUP) and issues a SMB2 CREATE with a
    lease context on a different inode, find_same_lease_key() walks
    lb->lease_list, reaches the stale opinfo, and calls compare_guid_key(),
    which unconditionally dereferences opinfo->conn->ClientGUID. The conn
    pointer is NULL and the kernel panics.
    
    Reproducer requires only a successful SMB2 SESSION_SETUP and a share
    configured with 'durable handles = yes'. KASAN report on mainline
    70390501d194:
    
      general protection fault, probably for non-canonical address
      0xdffffc0000000069: 0000 [#1] SMP KASAN PTI
      KASAN: null-ptr-deref in range [0x0000000000000348-0x000000000000034f]
      Workqueue: ksmbd-io handle_ksmbd_work
      RIP: 0010:bcmp+0x5b/0x230
      Call Trace:
       compare_guid_key+0x4b/0xd0
       find_same_lease_key+0x324/0x690
       smb2_open+0x6aea/0x8e60
       handle_ksmbd_work+0x796/0xee0
       ...
    
    Faulting address 0x348 is the offset of ClientGUID within struct
    ksmbd_conn, confirming opinfo->conn was NULL.
    
    Read opinfo->conn once and bail out if it has been cleared by a
    concurrent session_fd_check(). A half-detached opinfo cannot be the
    owner of an active lease, so returning 0 is the correct match result.
    
    Fixes: c8efcc786146 ("ksmbd: add support for durable handles v1/v2")
    Cc: stable@vger.kernel.org
    Signed-off-by: Jeremy Laratro <research@aradex.io>
    Acked-by: Namjae Jeon <linkinjeon@kernel.org>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ksmbd: fix null pointer dereference in proc_show_files() [+ + +]

Author: Jeremy Laratro <research@aradex.io>
Date:   Wed May 13 08:23:26 2026 +0900

    ksmbd: fix null pointer dereference in proc_show_files()
    
    commit 904901561e61a2b559070b20c74a8c95491f30aa upstream.
    
    When a SMB2 client opens a file with a durable v2 handle and then issues
    SMB2 SESSION_LOGOFF, session_fd_check() clears fp->tcon = NULL on the
    reconnectable file pointer but leaves the fp registered in global_ft.idr
    until the durable scavenger fires (up to fp->durable_timeout seconds
    later).
    
    During that window any read of /proc/fs/ksmbd/files (mode 0400) panics
    the kernel because proc_show_files() walks global_ft.idr and
    unconditionally dereferences fp->tcon->id with no NULL guard.
    
    Reproducer requires only a successful SMB2 SESSION_SETUP and a share
    configured with 'durable handles = yes'. KASAN report on mainline
    70390501d194:
    
      general protection fault, probably for non-canonical address
      0xdffffc0000000000: 0000 [#1] SMP KASAN PTI
      KASAN: null-ptr-deref in range [0x0000000000000000-0x0000000000000007]
      RIP: 0010:proc_show_files+0x118/0x740
      Call Trace:
       proc_show_files+0x118/0x740
       seq_read_iter+0x4ef/0xe10
       proc_reg_read_iter+0x1b7/0x280
       ...
    
    Guard the dereference. A durable-disconnected fp legitimately has no
    tcon; report its tree id as 0 rather than oopsing.
    
    Fixes: b38f99c1217a ("ksmbd: add procfs interface for runtime monitoring and statistics")
    Cc: stable@vger.kernel.org
    Signed-off-by: Jeremy Laratro <research@aradex.io>
    Acked-by: Namjae Jeon <linkinjeon@kernel.org>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ksmbd: fix SID memory leak in set_posix_acl_entries_dacl() on overflow [+ + +]

Author: Ferry Meng <mengferry@linux.alibaba.com>
Date:   Mon May 11 21:18:16 2026 +0800

    ksmbd: fix SID memory leak in set_posix_acl_entries_dacl() on overflow
    
    commit af92ee994cc7f7e83a41c2025f32257a2f82a7ef upstream.
    
    Commit 299f962c0b02 ("ksmbd: use check_add_overflow() to prevent u16
    DACL size overflow") added check_add_overflow() guards that break out
    of the ACE-building loops in set_posix_acl_entries_dacl() when the
    accumulated DACL size would wrap past 65535.
    
    However, each iteration allocates a struct smb_sid via kmalloc_obj()
    at the top of the loop and relies on the kfree(sid) call at the end
    of the loop body (the 'pass_same_sid' label in the first loop, and
    the explicit kfree at the tail of the second loop) to release it.
    The newly introduced 'break' statements bypass those kfree() calls,
    leaking the sid buffer every time an overflow is detected.
    
    A malicious or malformed file with enough POSIX ACL entries to trip
    the overflow check will leak one or more struct smb_sid allocations
    on every request that touches the file's DACL, providing a trivial
    kernel memory exhaustion vector.
    
    Free sid before breaking out of the loops to plug the leak.
    
    Fixes: 299f962c0b02 ("ksmbd: use check_add_overflow() to prevent u16 DACL size overflow")
    Cc: stable@vger.kernel.org
    Signed-off-by: Ferry Meng <mengferry@linux.alibaba.com>
    Acked-by: Namjae Jeon <linkinjeon@kernel.org>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ksmbd: validate SID in parent security descriptor during ACL inheritance [+ + +]

Author: Junyi Liu <moss80199@gmail.com>
Date:   Tue May 19 16:12:04 2026 +0900

    ksmbd: validate SID in parent security descriptor during ACL inheritance
    
    commit 69f030cf95488ae1186c72ac8c66fd279664ea7f upstream.
    
    Introduce smb_validate_ntsd_sid() helper to safely validate Owner SID
    and Group SID inside the NT Security Descriptor (smb_ntsd) retrieved
    from the parent directory.
    
    Cc: stable@vger.kernel.org
    Signed-off-by: Junyi Liu <moss80199@gmail.com>
    Signed-off-by: Namjae Jeon <linkinjeon@kernel.org>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

kunit: config: Enable KUNIT_DEBUGFS by default [+ + +]

Author: David Gow <david@davidgow.net>
Date:   Sat Apr 25 11:41:53 2026 +0800

    kunit: config: Enable KUNIT_DEBUGFS by default
    
    [ Upstream commit 17e4c68ff35090d8cb743e3c82c09f92fda1ebda ]
    
    The KUNIT_DEBUGFS option is currently enabled based on the value of
    KUNIT_ALL_TESTS, but it really doesn't have anything to do with the set of
    enabled tests, so just enable it by default anyway. In particular, this
    shouldn't be only visible if KUNIT_ALL_TESTS is set, which is quite
    confusing.
    
    Link: https://lore.kernel.org/r/20260425034155.53913-1-david@davidgow.net
    Fixes: beaed42c427d ("kunit: default KUNIT_* fragments to KUNIT_ALL_TESTS")
    Signed-off-by: David Gow <david@davidgow.net>
    Signed-off-by: Shuah Khan <skhan@linuxfoundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

kunit: config: KUNIT_DEBUGFS should depend on DEBUG_FS [+ + +]

Author: David Gow <david@davidgow.net>
Date:   Sat Apr 25 11:41:54 2026 +0800

    kunit: config: KUNIT_DEBUGFS should depend on DEBUG_FS
    
    [ Upstream commit 8f80b5b227ef9ea422080487715c841856339aed ]
    
    CONFIG_KUNIT_DEBUGFS is totally useless without debugfs, so it should
    depend on CONFIG_DEBUG_FS.
    
    Link: https://lore.kernel.org/r/20260425034155.53913-2-david@davidgow.net
    Fixes: e2219db280e3 ("kunit: add debugfs /sys/kernel/debug/kunit/<suite>/results display")
    Signed-off-by: David Gow <david@davidgow.net>
    Signed-off-by: Shuah Khan <skhan@linuxfoundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

KVM: arm64: vgic-its: Reject restored DTE with out-of-range num_eventid_bits [+ + +]

Author: Michael Bommarito <michael.bommarito@gmail.com>
Date:   Tue May 19 09:25:19 2026 -0400

    KVM: arm64: vgic-its: Reject restored DTE with out-of-range num_eventid_bits
    
    commit 9ce754ed8e7ab4e3999767ce1505f85c449ccb07 upstream.
    
    Userspace can restore an ITS Device Table Entry whose Size field encodes
    more EventID bits than the virtual ITS supports.  The live MAPD path
    rejects that state, but vgic_its_restore_dte() accepts it and stores the
    out-of-range value in dev->num_eventid_bits.
    
    Reject restored DTEs with num_eventid_bits > VITS_TYPER_IDBITS before
    allocating the device.  This mirrors the MAPD check and prevents the
    restored state from reaching vgic_its_restore_itt(), where the unchecked
    value can be converted into an oversized scan_its_table() range.
    
    Fixes: 57a9a117154c ("KVM: arm64: vgic-its: Device table save/restore")
    Assisted-by: Claude:claude-opus-4-7
    Signed-off-by: Michael Bommarito <michael.bommarito@gmail.com>
    Link: https://lore.kernel.org/r/20260519132519.2142458-1-michael.bommarito@gmail.com
    Signed-off-by: Marc Zyngier <maz@kernel.org>
    Cc: stable@vger.kernel.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

KVM: arm64: vgic: Free private_irqs when init fails after allocation [+ + +]

Author: Michael Bommarito <michael.bommarito@gmail.com>
Date:   Tue May 19 09:50:42 2026 -0400

    KVM: arm64: vgic: Free private_irqs when init fails after allocation
    
    commit f19c354dbd457759dfcf1195ab4bdba2bb568323 upstream.
    
    Companion to commit 250f25367b58 ("KVM: arm64: Tear down vGIC on
    failed vCPU creation"), which added the missing kvm_vgic_vcpu_destroy()
    call to the kvm_share_hyp() failure path in kvm_arch_vcpu_create(). The
    kvm_vgic_vcpu_init() failure path immediately above it has the same
    shape and still needs the same cleanup.
    
    Call kvm_vgic_vcpu_destroy() when kvm_vgic_vcpu_init() fails so private
    IRQs allocated before a redistributor iodev registration failure are
    released before the failed vCPU is freed.
    
    Fixes: 03b3d00a70b5 ("KVM: arm64: vgic: Allocate private interrupts on demand")
    Cc: stable@vger.kernel.org
    Cc: Will Deacon <will@kernel.org>
    Reviewed-by: Yuan Yao <yaoyuan@linux.alibaba.com>
    Assisted-by: Claude:claude-opus-4-7
    Signed-off-by: Michael Bommarito <michael.bommarito@gmail.com>
    Link: https://lore.kernel.org/r/20260519135042.2219239-1-michael.bommarito@gmail.com
    Signed-off-by: Marc Zyngier <maz@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

KVM: SVM: Disable AVIC IPI virtualization on Hygon Family 18h (erratum #1235) [+ + +]

Author: Tina Zhang <zhang_wei@open-hieco.net>
Date:   Fri May 22 12:00:14 2026 +0800

    KVM: SVM: Disable AVIC IPI virtualization on Hygon Family 18h (erratum #1235)
    
    commit 9a12fa5213cfc391e0eed63902d3be98f0913765 upstream.
    
    Hygon Family 18h CPUs are derived from AMD Family 17h (Zen1) silicon and
    share the same erratum #1235: hardware may read a stale IsRunning=1 bit
    during ICR write emulation and silently fail to generate an
    AVIC_IPI_FAILURE_TARGET_NOT_RUNNING VM-Exit on the sending vCPU.
    
    The absence of the VM-Exit causes KVM to miss the required wakeup of
    blocking target vCPUs, leading to hung vCPUs and unbounded delays in
    guest execution.
    
    Extend the existing AMD Family 17h erratum #1235 workaround to also cover
    Hygon Family 18h.  With IPI virtualization disabled, KVM never sets
    IsRunning=1 in the Physical ID table, so every non-self IPI generates a
    VM-Exit and is correctly emulated.
    
    Fixes: 8de4a1c8164e ("KVM: SVM: Disable (x2)AVIC IPI virtualization if CPU has erratum #1235")
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Tina Zhang <zhang_wei@open-hieco.net>
    Message-ID: <20260522040014.3380201-1-zhang_wei@open-hieco.net>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

l2tp: use list_del_rcu in l2tp_session_unhash [+ + +]

Author: Michael Bommarito <michael.bommarito@gmail.com>
Date:   Mon May 18 14:34:47 2026 -0400

    l2tp: use list_del_rcu in l2tp_session_unhash
    
    commit 979c017803c40829b03acd9e5236e354b7622360 upstream.
    
    An unprivileged local user can pin a host CPU indefinitely in
    l2tp_session_get_by_ifname() by issuing L2TP_CMD_SESSION_GET on
    L2TP_ATTR_IFNAME concurrently with L2TP_CMD_SESSION_CREATE and
    L2TP_CMD_SESSION_DELETE on the same tunnel. All three commands take
    GENL_UNS_ADMIN_PERM, so CAP_NET_ADMIN in the netns user namespace
    suffices; on any host that has l2tp_core loaded the trigger is
    reachable from a standard `unshare -Urn` sandbox.
    
    l2tp_session_unhash() removes a session from tunnel->session_list
    with list_del_init(), but that list is walked by
    l2tp_session_get_by_ifname() with list_for_each_entry_rcu() under
    rcu_read_lock_bh(). list_del_init() leaves the deleted entry's
    next/prev self-pointing; a reader that has loaded the entry and
    then advances pos->list.next reads &session->list, container_of()s
    back to the same session, and list_for_each_entry_rcu() never
    reaches the list head. The CPU stays in strcmp() inside the
    walker, with BH and preemption disabled, so RCU grace periods on
    the host stall behind it and the wedged thread cannot be killed
    (SIGKILL is delivered on syscall return).
    
    Use list_del_rcu() to match the existing list_add_rcu() in
    l2tp_session_register(); the deleted session remains visible to
    in-flight walkers with consistent next/prev pointers until
    kfree_rcu() in l2tp_session_free() releases it. tunnel->session_list
    has exactly one list_del_init() call site; the list_del_init
    (&session->clist) at l2tp_core.c:533 operates on the per-collision
    list, which is not walked under RCU. list_empty(&session->list) is
    not used anywhere in net/l2tp/ after the unhash point, so dropping
    the post-delete self-init is safe; the fix has no userspace-visible
    behavior change.
    
    Fixes: 89b768ec2dfef ("l2tp: use rcu list add/del when updating lists")
    Cc: stable@vger.kernel.org # 6.11+
    Signed-off-by: Michael Bommarito <michael.bommarito@gmail.com>
    Link: https://patch.msgid.link/20260518183447.64078-1-michael.bommarito@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

Linux: Linux 7.0.11 [+ + +]

Author: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Date:   Mon Jun 1 17:54:55 2026 +0200

    Linux 7.0.11
    
    Link: https://lore.kernel.org/r/20260528194646.819809818@linuxfoundation.org
    Tested-by: Ronald Warsow <rwarsow@gmx.de>
    Tested-by: Takeshi Ogasawara <takeshi.ogasawara@futuring-girl.com>
    Tested-by: Ron Economos <re@w6rz.net>
    Tested-by: Luna Jernberg <droidbittin@gmail.com>
    Tested-by: Miguel Ojeda <ojeda@kernel.org>
    Tested-by: Brett A C Sheffield <bacs@librecast.net>
    Tested-by: Salvatore Bonaccorso <carnil@debian.org>
    Tested-by: Pavel Machek (CIP) <pavel@nabladev.com>
    Tested-by: Jeffrin Jose T <jeffrin@rajagiritech.edu.in>
    Tested-by: Peter Schneider <pschneider1968@googlemail.com>
    Tested-by: Masoud Aghasi <maghasi@disroot.org>
    Tested-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Tested-by: Mark Brown <broonie@kernel.org>
    Tested-by: Markus Reichelt <lkt+2023@mareichelt.com>
    Tested-by: Barry K. Nathan <barryn@pobox.com>
    Tested-by: Kalden Elphick <kalden.elphick@gmail.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

LoongArch: kprobes: Fix handling of fatal unrecoverable recursions [+ + +]

Author: Tiezhu Yang <yangtiezhu@loongson.cn>
Date:   Fri May 22 15:05:07 2026 +0800

    LoongArch: kprobes: Fix handling of fatal unrecoverable recursions
    
    [ Upstream commit 1c856e158fd34ef2c4475a81c1dc386329989938 ]
    
    KPROBE_HIT_SS and KPROBE_REENTER are two types of fatal recursions that
    can not be safely recovered in kprobes.
    
    KPROBE_HIT_SS means that a kprobe is hit during single-stepping. At
    this point, the architecture-specific single-step context is already
    active. Nested single-stepping would corrupt the state, as the kprobe
    control block (kcb) and hardware registers cannot safely store multiple
    levels of stepping state.
    
    KPROBE_REENTER means that a third-level recursion occurs when a probe
    is hit while the system is already handling a nested probe (second-
    level). The kcb only provides a single slot (prev_kprobe) to backup the
    state. When a third probe is hit, there is no more space to save the
    state without corrupting the first-level backup.
    
    Kprobes work by replacing instructions with breakpoints. In order to
    execute the original instruction and continue, it must be moved to a
    temporary "single-step" slot. Since there is no backup space left to
    set up this slot safely, the CPU would be forced to return to the same
    original breakpoint address, triggering an endless loop.
    
    Currently, the code only prints a warning and returns. This leads to
    an infinite re-entry loop as the CPU repeatedly hits the same trap and
    a "stuck" CPU core because preemption was disabled at the start of the
    handler and never re-enabled in this early return path.
    
    Fix the logic by:
    1. Merging KPROBE_HIT_SS and KPROBE_REENTER cases, as both represent
       fatal recursions that cannot be safely recovered.
    2. Replacing WARN_ON_ONCE() with BUG() to terminate the system. This
       aligns LoongArch with other architectures (x86, arm64, riscv) and
       prevents stack overflow while providing diagnostic information.
    
    Fixes: 6d4cc40fb5f5 ("LoongArch: Add kprobes support")
    Signed-off-by: Tiezhu Yang <yangtiezhu@loongson.cn>
    Signed-off-by: Huacai Chen <chenhuacai@loongson.cn>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

LoongArch: kprobes: Use larch_insn_text_copy() to patch instructions [+ + +]

Author: Tiezhu Yang <yangtiezhu@loongson.cn>
Date:   Fri May 22 15:05:07 2026 +0800

    LoongArch: kprobes: Use larch_insn_text_copy() to patch instructions
    
    commit e3ef9a28f558d1cbf0b42d6dcd16c60da557562b upstream.
    
    On SMP systems, kprobe handlers would occasionally fail to execute on
    certain CPU cores. The issue is hard to reproduce and typically occurs
    randomly under high system load.
    
    The root cause is a software-side instruction hazard. According to the
    LoongArch Reference Manual, while the cache coherency is maintained by
    hardware, software must explicitly use the "IBAR" instruction to ensure
    the instruction fetch unit (IFU) observes the effects of recent stores.
    
    The current arch_arm_kprobe() and arch_disarm_kprobe() only execute the
    "IBAR" barrier (via flush_insn_slot -> local_flush_icache_range) on the
    local CPU. This leaves a vulnerable window where remote CPU cores may
    continue executing stale instructions from their pipelines or prefetch
    buffers, as they have not executed an "IBAR" since the code modification.
    
    Switch to larch_insn_text_copy() to fix this:
    1. Synchronization: It uses stop_machine_cpuslocked() to synchronize all
       online CPUs, ensuring no CPU is executing the target code area during
       modification.
    2. Visibility: By passing cpu_online_mask to stop_machine_cpuslocked(),
       the callback text_copy_cb() is executed on all online cores. Each CPU
       core invokes local_flush_icache_range() to execute "IBAR", clearing
       instruction hazards system-wide and ensuring the "break" instruction
       is visible to the fetch units of all cores.
    3. Robustness: It properly manages memory write permissions (ROX/RW) for
       the kernel text segment during patching, ensuring compatibility with
       CONFIG_STRICT_KERNEL_RWX.
    
    Cc: <stable@vger.kernel.org>  # 6.18+
    Fixes: 6d4cc40fb5f5 ("LoongArch: Add kprobes support")
    Signed-off-by: Tiezhu Yang <yangtiezhu@loongson.cn>
    Signed-off-by: Huacai Chen <chenhuacai@loongson.cn>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

LoongArch: Remove unused code to avoid build warning [+ + +]

Author: Huacai Chen <chenhuacai@kernel.org>
Date:   Thu May 21 20:58:40 2026 +0800

    LoongArch: Remove unused code to avoid build warning
    
    commit 0ccc9d47cf020994097ff51827cebd04aa2b0bf4 upstream.
    
    After commit feee6b2989165631b1 ("mm/memory_hotplug: shrink zones when
    offlining memory"), __remove_pages() doesn't need the "zone" parameter
    so the "page" variable is also unused. Remove the unused code to avoid
    such build warning:
    
    arch/loongarch/mm/init.c: In function 'arch_remove_memory':
    arch/loongarch/mm/init.c:134:22: warning: variable 'page' set but not used [-Wunused-but-set-variable=]
      134 |         struct page *page = pfn_to_page(start_pfn);
    
    Cc: <stable@vger.kernel.org>
    Reviewed-by: Guo Ren <guoren@kernel.org>
    Signed-off-by: Huacai Chen <chenhuacai@loongson.cn>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

lsm: hold cred_guard_mutex for lsm_set_self_attr() [+ + +]

Author: Stephen Smalley <stephen.smalley.work@gmail.com>
Date:   Wed May 13 14:05:06 2026 -0400

    lsm: hold cred_guard_mutex for lsm_set_self_attr()
    
    commit 4a9b16541ad3faf8bccb398532bf3f8b6bbf1188 upstream.
    
    Just as proc_pid_attr_write() already does before calling the LSM
    hook. This only matters for SELinux and AppArmor which check
    whether the process is being ptraced and if so, whether to
    allow the transition.
    
    Cc: stable@vger.kernel.org
    Signed-off-by: Stephen Smalley <stephen.smalley.work@gmail.com>
    Acked-by: Casey Schaufler <casey@schaufler-ca.com>
    Signed-off-by: Paul Moore <paul@paul-moore.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mm/damon/sysfs-schemes: call missing mem_cgroup_iter_break() [+ + +]

Author: SeongJae Park <sj@kernel.org>
Date:   Sun Apr 26 10:36:12 2026 -0700

    mm/damon/sysfs-schemes: call missing mem_cgroup_iter_break()
    
    commit d4e7b5c4cc353f154d5ab8bb2e1ce7714d77a6e9 upstream.
    
    damon_sysfs_memcg_path_to_id() breaks mem_cgroup_iter() loop without
    calling mem_cgroup_iter_break().  This leaks the cgroup reference.  Fix
    the issue by calling mem_cgroup_iter_break() before the break.
    
    The issue was discovered [1] by Sashiko.
    
    Link: https://lore.kernel.org/20260426173625.86521-1-sj@kernel.org
    Link: https://lore.kernel.org/20260423004148.74722-1-sj@kernel.org [1]
    Fixes: 29cbb9a13f05 ("mm/damon/sysfs-schemes: implement scheme filters")
    Signed-off-by: SeongJae Park <sj@kernel.org>
    Cc: <stable@vger.kernel.org> # 6.3.x
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mm/damon: fix damos_stat tracepoint format for sz_applied [+ + +]

Author: SeongJae Park <sj@kernel.org>
Date:   Sun Apr 26 12:31:17 2026 -0700

    mm/damon: fix damos_stat tracepoint format for sz_applied
    
    commit 620072fd783290ad92c2d445a47b0a61b161f352 upstream.
    
    The print format is wrongly marking sz_applied as sz_tried.  Fix it.
    
    Link: https://lore.kernel.org/20260426193119.88095-1-sj@kernel.org
    Fixes: 804c26b961da ("mm/damon/core: add trace point for damos stat per apply interval")
    Signed-off-by: SeongJae Park <sj@kernel.org>
    Cc: "Masami Hiramatsu (Google)" <mhiramat@kernel.org>
    Cc: Mathieu Desnoyers <mathieu.desnoyers@efficios.com>
    Cc: Steven Rostedt <rostedt@goodmis.org>
    Cc: <stable@vger.kernel.org> # 7.0.x
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mm/memfd_luo: report error when restoring a folio fails mid-loop [+ + +]

Author: David Carlier <devnexen@gmail.com>
Date:   Wed Apr 15 06:23:00 2026 +0100

    mm/memfd_luo: report error when restoring a folio fails mid-loop
    
    [ Upstream commit 0fb1daf0b78d0e23b63b6b65de56d4a3fd83bc14 ]
    
    memfd_luo_retrieve_folios() initialises err to -EIO, but the per-iteration
    calls to mem_cgroup_charge(), shmem_add_to_page_cache() and
    shmem_inode_acct_blocks() reuse and overwrite err.  Once any iteration
    completes successfully, err becomes zero.
    
    If a later iteration's kho_restore_folio() returns NULL, the failure path
    jumps to put_folios without resetting err, so the function returns 0.
    The caller memfd_luo_retrieve() then takes the success path, sets
    args->file and reports the restore as successful, leaving userspace with
    a partially populated memfd and no indication that anything went wrong.
    
    Set err to -EIO in the kho_restore_folio() failure branch so the error
    is propagated to the caller.
    
    Signed-off-by: David Carlier <devnexen@gmail.com>
    Reviewed-by: Pratyush Yadav <pratyush@kernel.org>
    Fixes: b3749f174d68 ("mm: memfd_luo: allow preserving memfd")
    Link: https://patch.msgid.link/20260415052300.362539-1-devnexen@gmail.com
    Signed-off-by: Mike Rapoport (Microsoft) <rppt@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

mm/memory: fix spurious warning when unmapping device-private/exclusive pages [+ + +]

Author: Alistair Popple <apopple@nvidia.com>
Date:   Fri May 1 16:51:16 2026 +1000

    mm/memory: fix spurious warning when unmapping device-private/exclusive pages
    
    commit be3f38d05cc5a7c3f13e51994c5dd043ab604d28 upstream.
    
    Device private and exclusive entries are only supported for anonymous
    folios.  This condition is tested in __migrate_device_pages() and
    make_device_exclusive() using folio_test_anon().  However the unmap path
    tests this assumption using vma_is_anonymous().
    
    This is wrong because whilst anonymous VMAs can only contain folios where
    folio_test_anon() is true the opposite relation does not hold.  A folio
    for which folio_test_anon() is true does not imply vma_is_anonymous() is
    true.  Such a condition can occur if for example a folio is part of a
    private filebacked mapping.
    
    In this case vma_is_anonymous() is false as the mapping is filebacked, but
    folio_test_anon() may be true, thus permitting devices to migrate the
    folio to device private memory.  This can lead to the following spurious
    warnings during process teardown:
    
    [  772.737706] ------------[ cut here ]------------
    [  772.739201] WARNING: mm/memory.c:1754 at unmap_page_range.cold+0x26/0x18a, CPU#17: hmm-tests/2041
    [  772.742050] Modules linked in: test_hmm nvidia_uvm(O) nvidia(O)
    [  772.743959] CPU: 17 UID: 0 PID: 2041 Comm: hmm-tests Tainted: G        W  O        7.0.0+ #387 PREEMPT(full)
    [  772.747104] Tainted: [W]=WARN, [O]=OOT_MODULE
    [  772.748509] Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS rel-1.17.0-0-gb52ca86e094d-prebuilt.qemu.org 04/01/2014
    [  772.752117] RIP: 0010:unmap_page_range.cold+0x26/0x18a
    [  772.753780] Code: 7e fe ff ff 48 89 4c 24 78 4c 89 44 24 38 e8 f2 ff b1 00 48 8b 4c 24 78 4c 8b 44 24 38 48 8b 44 24 18 48 83 78 48 00 74 04 90 <0f> 0b 90 48 89 ca b8 ff ff 37 00 48 c1 ea 03 48 c1 e0 2a 80 3c 02
    [  772.759602] RSP: 0018:ffff888112607550 EFLAGS: 00010286
    [  772.761310] RAX: ffff88811bbf4dc0 RBX: dffffc0000000000 RCX: ffffea03e9bfffd8
    [  772.763583] RDX: 1ffff1102377e9c1 RSI: 0000000000000008 RDI: ffff88811bbf4e08
    [  772.765914] RBP: 0000000000000006 R08: ffff8881059f7448 R09: ffffed10224c0e68
    [  772.768184] R10: ffff888112607347 R11: 0000000000000001 R12: 0000000000000001
    [  772.770461] R13: ffffea03e9bfffc0 R14: ffff888112607908 R15: ffffea03e9bfffc0
    [  772.772782] FS:  00007f327caa2780(0000) GS:ffff888427b7d000(0000) knlGS:0000000000000000
    [  772.775328] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    [  772.777187] CR2: 00007f327ca89000 CR3: 00000001994d5000 CR4: 00000000000006f0
    [  772.779135] Call Trace:
    [  772.779792]  <TASK>
    [  772.780317]  ? dmirror_interval_invalidate+0x1a3/0x290 [test_hmm]
    [  772.781873]  ? vm_normal_page_pud+0x2b0/0x2b0
    [  772.782992]  ? __rwlock_init+0x150/0x150
    [  772.784006]  ? lock_release+0x216/0x2b0
    [  772.785008]  ? __mmu_notifier_invalidate_range_start+0x505/0x6e0
    [  772.786522]  ? lock_release+0x216/0x2b0
    [  772.787498]  ? unmap_single_vma+0xb6/0x210
    [  772.788573]  unmap_vmas+0x27d/0x520
    [  772.789506]  ? unmap_single_vma+0x210/0x210
    [  772.790607]  ? mas_update_gap.part.0+0x620/0x620
    [  772.791834]  unmap_region+0x19e/0x350
    [  772.792769]  ? remove_vma+0x130/0x130
    [  772.793684]  ? mas_alloc_nodes+0x1f2/0x300
    [  772.794730]  vms_complete_munmap_vmas+0x8c1/0xe20
    [  772.795926]  ? unmap_region+0x350/0x350
    [  772.796917]  do_vmi_align_munmap+0x36a/0x4e0
    [  772.798018]  ? lock_release+0x216/0x2b0
    [  772.799024]  ? vma_shrink+0x620/0x620
    [  772.799983]  do_vmi_munmap+0x150/0x2c0
    [  772.800939]  __vm_munmap+0x161/0x2c0
    [  772.801872]  ? expand_downwards+0xd60/0xd60
    [  772.802948]  ? clockevents_program_event+0x1ef/0x540
    [  772.804217]  ? lock_release+0x216/0x2b0
    [  772.805158]  __x64_sys_munmap+0x59/0x80
    [  772.805776]  do_syscall_64+0xfc/0x670
    [  772.806336]  ? irqentry_exit+0xda/0x580
    [  772.806976]  entry_SYSCALL_64_after_hwframe+0x4b/0x53
    [  772.807772] RIP: 0033:0x7f327cbb2717
    [  772.808323] Code: 73 01 c3 48 8b 0d f9 76 0d 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 b8 0b 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d c9 76 0d 00 f7 d8 64 89 01 48
    [  772.811337] RSP: 002b:00007ffde7f57d38 EFLAGS: 00000202 ORIG_RAX: 000000000000000b
    [  772.812564] RAX: ffffffffffffffda RBX: 00007f327cc9c000 RCX: 00007f327cbb2717
    [  772.813733] RDX: 0000000000000000 RSI: 0000000000400000 RDI: 00007f327c289000
    [  772.814867] RBP: 0000000000421360 R08: 000000000000001a R09: 0000000000000000
    [  772.815991] R10: 0000000000000003 R11: 0000000000000202 R12: 00007ffde7f57d74
    [  772.817121] R13: 00007f327c689010 R14: 0000000000100000 R15: 00007f327c289000
    [  772.818272]  </TASK>
    [  772.818614] irq event stamp: 0
    [  772.819159] hardirqs last  enabled at (0): [<0000000000000000>] 0x0
    [  772.820174] hardirqs last disabled at (0): [<ffffffff82a57ab3>] copy_process+0x19f3/0x6440
    [  772.821511] softirqs last  enabled at (0): [<ffffffff82a57b00>] copy_process+0x1a40/0x6440
    [  772.822869] softirqs last disabled at (0): [<0000000000000000>] 0x0
    [  772.823871] ---[ end trace 0000000000000000 ]---
    
    Fix this by using the same check for folio_test_anon() in
    zap_nonpresent_ptes(). Also add a hmm-test case for this.
    
    Link: https://lore.kernel.org/20260501065116.2057242-1-apopple@nvidia.com
    Fixes: 999dad824c39 ("mm/shmem: persist uffd-wp bit across zapping for file-backed")
    Signed-off-by: Alistair Popple <apopple@nvidia.com>
    Reported-by: Arsen Arsenović <aarsenovic@baylibre.com>
    Reviewed-by: Balbir Singh <balbirs@nvidia.com>
    Cc: David Hildenbrand <david@kernel.org>
    Cc: Jason Gunthorpe <jgg@ziepe.ca>
    Cc: John Hubbard <jhubbard@nvidia.com>
    Cc: Leon Romanovsky <leon@kernel.org>
    Cc: Liam R. Howlett <liam@infradead.org>
    Cc: Lorenzo Stoakes <ljs@kernel.org>
    Cc: Peter Xu <peterx@redhat.com>
    Cc: Matthew Brost <matthew.brost@intel.com>
    Cc: Michal Hocko <mhocko@suse.com>
    Cc: Mike Rapoport <rppt@kernel.org>
    Cc: Shuah Khan <shuah@kernel.org>
    Cc: Suren Baghdasaryan <surenb@google.com>
    Cc: Thomas Hellström <thomas.hellstrom@linux.intel.com>
    Cc: Vlastimil Babka <vbabka@kernel.org>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mm/memory_hotplug: fix memory block reference leak on remove [+ + +]

Author: Muchun Song <muchun.song@linux.dev>
Date:   Tue Apr 28 16:52:17 2026 +0800

    mm/memory_hotplug: fix memory block reference leak on remove
    
    commit 93866f55f7e292fe3d47d36c9efe5ee10213a06b upstream.
    
    Patch series "mm: Fix memory block leaks and locking", v2.
    
    This series fixes two memory block device reference leaks and one locking
    issue around the per-memory_block hwpoison counter.
    
    
    This patch (of 2):
    
    remove_memory_blocks_and_altmaps() looks up each memory block with
    find_memory_block(), which acquires a reference to the memory block
    device.
    
    That reference is never dropped on this path, resulting in a leaked device
    reference when removing memory blocks and their altmaps.  Drop the
    reference after retrieving mem->altmap and clearing mem->altmap, before
    removing the memory block device.
    
    Link: https://lore.kernel.org/20260428085219.1316047-1-songmuchun@bytedance.com
    Link: https://lore.kernel.org/20260428085219.1316047-2-songmuchun@bytedance.com
    Fixes: 6b8f0798b85a ("mm/memory_hotplug: split memmap_on_memory requests across memblocks")
    Signed-off-by: Muchun Song <songmuchun@bytedance.com>
    Acked-by: Oscar Salvador <osalvador@suse.de>
    Acked-by: David Hildenbrand (Arm) <david@kernel.org>
    Cc: Danilo Krummrich <dakr@kernel.org>
    Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
    Cc: "Huang, Ying" <huang.ying.caritas@gmail.com>
    Cc: Miaohe Lin <linmiaohe@huawei.com>
    Cc: Naoya Horiguchi <nao.horiguchi@gmail.com>
    Cc: "Rafael J. Wysocki" <rafael@kernel.org>
    Cc: Vishal Verma <vishal.l.verma@intel.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mm/migrate_device: fix spinlock leak in migrate_vma_insert_huge_pmd_page [+ + +]

Author: Sunny Patel <nueralspacetech@gmail.com>
Date:   Sat Apr 25 19:05:27 2026 +0530

    mm/migrate_device: fix spinlock leak in migrate_vma_insert_huge_pmd_page
    
    commit 63451de16e0a08be40f9ab5e7c5c8f5c79676fb1 upstream.
    
    When check_stable_address_space() fails after the PMD spinlock has
    been acquired via pmd_lock(), the code jumps directly to the abort
    label, bypassing the spin_unlock() call in unlock_abort. This causes
    the PMD spinlock to be permanently held, leading to a deadlock.
    
    Change the goto target from abort to unlock_abort to ensure the
    spinlock is always released on this error path.
    
    Link: https://lore.kernel.org/20260425133537.17463-1-nueralspacetech@gmail.com
    Fixes: a30b48bf1b24 ("mm/migrate_device: implement THP migration of zone device pages")
    Signed-off-by: Sunny Patel <nueralspacetech@gmail.com>
    Reviewed-by: Andrew Morton <akpm@linux-foundation.org>
    Acked-by: Zi Yan <ziy@nvidia.com>
    Acked-by: Balbir Singh <balbirs@nvidia.com>
    Acked-by: David Hildenbrand (Arm) <david@kernel.org>
    Cc: Alistair Popple <apopple@nvidia.com>
    Cc: Byungchul Park <byungchul@sk.com>
    Cc: Gregory Price <gourry@gourry.net>
    Cc: "Huang, Ying" <ying.huang@linux.alibaba.com>
    Cc: Joshua Hahn <joshua.hahnjy@gmail.com>
    Cc: Matthew Brost <matthew.brost@intel.com>
    Cc: Rakie Kim <rakie.kim@sk.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mm/page_alloc: fix initialization of tags of the huge zero folio with init_on_free [+ + +]

Author: David Hildenbrand (Arm) <david@kernel.org>
Date:   Tue Apr 21 17:39:07 2026 +0200

    mm/page_alloc: fix initialization of tags of the huge zero folio with init_on_free
    
    commit 6a288a4ddb4a994490505ab5f41c445f8e6b6467 upstream.
    
    __GFP_ZEROTAGS semantics are currently a bit weird, but effectively this
    flag is only ever set alongside __GFP_ZERO and __GFP_SKIP_KASAN.
    
    If we run with init_on_free, we will zero out pages during
    __free_pages_prepare(), to skip zeroing on the allocation path.
    
    However, when allocating with __GFP_ZEROTAG set, post_alloc_hook() will
    consequently not only skip clearing page content, but also skip clearing
    tag memory.
    
    Not clearing tags through __GFP_ZEROTAGS is irrelevant for most pages that
    will get mapped to user space through set_pte_at() later: set_pte_at() and
    friends will detect that the tags have not been initialized yet
    (PG_mte_tagged not set), and initialize them.
    
    However, for the huge zero folio, which will be mapped through a PMD
    marked as special, this initialization will not be performed, ending up
    exposing whatever tags were still set for the pages.
    
    The docs (Documentation/arch/arm64/memory-tagging-extension.rst) state
    that allocation tags are set to 0 when a page is first mapped to user
    space.  That no longer holds with the huge zero folio when init_on_free is
    enabled.
    
    Fix it by decoupling __GFP_ZEROTAGS from __GFP_ZERO, passing to
    tag_clear_highpages() whether we want to also clear page content.
    
    Invert the meaning of the tag_clear_highpages() return value to have
    clearer semantics.
    
    Reproduced with the huge zero folio by modifying the check_buffer_fill
    arm64/mte selftest to use a 2 MiB area, after making sure that pages have
    a non-0 tag set when freeing (note that, during boot, we will not actually
    initialize tags, but only set KASAN_TAG_KERNEL in the page flags).
    
            $ ./check_buffer_fill
            1..20
            ...
            not ok 17 Check initial tags with private mapping, sync error mode and mmap memory
            not ok 18 Check initial tags with private mapping, sync error mode and mmap/mprotect memory
            ...
    
    This code needs more cleanups; we'll tackle that next, like
    decoupling __GFP_ZEROTAGS from __GFP_SKIP_KASAN.
    
    [akpm@linux-foundation.org: s/__GPF_ZERO/__GFP_ZERO/, per David]
    Link: https://lore.kernel.org/20260421-zerotags-v2-1-05cb1035482e@kernel.org
    Fixes: adfb6609c680 ("mm/huge_memory: initialise the tags of the huge zero folio")
    Signed-off-by: David Hildenbrand (Arm) <david@kernel.org>
    Reviewed-by: Catalin Marinas <catalin.marinas@arm.com>
    Tested-by: Lance Yang <lance.yang@linux.dev>
    Cc: Brendan Jackman <jackmanb@google.com>
    Cc: Dev Jain <dev.jain@arm.com>
    Cc: Johannes Weiner <hannes@cmpxchg.org>
    Cc: Liam Howlett <liam@infradead.org>
    Cc: Lorenzo Stoakes (Oracle) <ljs@kernel.org>
    Cc: Mark Brown <broonie@kernel.org>
    Cc: Michal Hocko <mhocko@suse.com>
    Cc: Mike Rapoport <rppt@kernel.org>
    Cc: Ryan Roberts <ryan.roberts@arm.com>
    Cc: Suren Baghdasaryan <surenb@google.com>
    Cc: Will Deacon <will@kernel.org>
    Cc: Zi Yan <ziy@nvidia.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mm/slub: hold cpus_read_lock around flush_rcu_sheaves_on_cache() [+ + +]

Author: Qing Wang <wangqing7171@gmail.com>
Date:   Tue May 12 11:50:35 2026 +0800

    mm/slub: hold cpus_read_lock around flush_rcu_sheaves_on_cache()
    
    commit 67ea9d353d0ba12bdbc9183ff568dead9e949b80 upstream.
    
    flush_rcu_sheaves_on_cache() calls queue_work_on() in a
    for_each_online_cpu() loop, which requires the cpu to stay online.
    But cpus_read_lock() is not held in kvfree_rcu_barrier_on_cache() and the
    set of "online cpus" is subject to change.
    
    There are two paths that call flush_rcu_sheaves_on_cache():
    
      // has cpus_read_lock()
      flush_all_rcu_sheaves()
        -> flush_rcu_sheaves_on_cache()
    
      // no cpus_read_lock()
      kvfree_rcu_barrier_on_cache()
        -> flush_rcu_sheaves_on_cache()
    
    Fix this by holding cpus_read_lock() in kvfree_rcu_barrier_on_cache().
    
    Why not move cpus_read_lock() from flush_all_rcu_sheaves() into
    flush_rcu_sheaves_on_cache()? The reason is it would introduce a new lock
    order (slab_mutex -> cpu_hotplug_lock). The reverse order
    (cpu_hotplug_lock -> slab_mutex) is established by
    
    - cpuhp_setup_state_nocalls(..., slub_cpu_setup, ...)
    - kmem_cache_destroy()
    
    The two orders together would form an AB-BA deadlock.
    
    Finally, add lockdep_assert_cpus_held() in flush_rcu_sheaves_on_cache()
    to catch the same problem in the future.
    
    Fixes: 0f35040de593 ("mm/slab: introduce kvfree_rcu_barrier_on_cache() for cache destruction")
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Qing Wang <wangqing7171@gmail.com>
    Link: https://patch.msgid.link/20260512035035.762317-1-wangqing7171@gmail.com
    Signed-off-by: Vlastimil Babka (SUSE) <vbabka@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mm: fix __vm_normal_page() to handle missing support for pmd_special()/pud_special() [+ + +]

Author: David Hildenbrand (Arm) <david@kernel.org>
Date:   Thu Apr 30 13:31:22 2026 +0200

    mm: fix __vm_normal_page() to handle missing support for pmd_special()/pud_special()
    
    commit c0c6ccd9828c3a1950623b546fa57292a77b5c73 upstream.
    
    On x86 32-bit with THP enabled, zap_huge_pmd() is seen to generate a
    "WARNING: mm/memory.c:735 at __vm_normal_page+0x6a/0x7d", from the
    VM_WARN_ON_ONCE(is_zero_pfn(pfn) || is_huge_zero_pfn(pfn)); followed by
    "BUG: Bad rss-counter state"s, then later "BUG: Bad page state"s when
    reclaim gets to call shrink_huge_zero_folio_scan().
    
    It's as if the _PAGE_SPECIAL bit never got set in the huge_zero pmd: and
    indeed, whereas pte_special() and pte_mkspecial() are subject to a
    dedicated CONFIG_ARCH_HAS_PTE_SPECIAL, pmd_special() and pmd_mkspecial()
    are subject to CONFIG_ARCH_SUPPORTS_PMD_PFNMAP, which is never enabled on
    any 32-bit architecture.
    
    While the problem was exposed through commit d80a9cb1a64a
    ("mm/huge_memory: add and use normal_or_softleaf_folio_pmd()"), it was an
    oversight in commit af38538801c6 ("mm/memory: factor out common code from
    vm_normal_page_*()") and would result in other problems:
    * huge zero folio accounted in smaps, pagemap (PAGE_IS_FILE) and
      numamaps as file-backed THP
    * folio_walk_start() returning the folio even without FW_ZEROPAGE set.
      Callers seem to tolerate that, though.
    
    ... and triggering the VM_WARN_ON_ONE(), although never reported so far.
    
    To fix it, teach vm_normal_page_pmd()/vm_normal_page_pud() to consider
    whether pmd_special/pud_special is actually implemented.
    
    Link: https://lore.kernel.org/20260430-pmd_special-v1-1-dbcbcfd72c20@kernel.org
    Fixes: af38538801c6 ("mm/memory: factor out common code from vm_normal_page_*()")
    Signed-off-by: David Hildenbrand (Arm) <david@kernel.org>
    Reported-by: Hugh Dickins <hughd@google.com>
    Closes: https://lore.kernel.org/r/74a75b59-2e13-3985-ee99-d5521f39df2a@google.com
    Reported-by: Bibo Mao <maobibo@loongson.cn>
    Closes: https://lore.kernel.org/r/20260430041121.2839350-1-maobibo@loongson.cn
    Debugged-by: Hugh Dickins <hughd@google.com>
    Reviewed-by: Lance Yang <lance.yang@linux.dev>
    Tested-by: Bibo Mao <maobibo@loongson.cn>
    Reviewed-by: Baolin Wang <baolin.wang@linux.alibaba.com>
    Reviewed-by: Oscar Salvador <osalvador@suse.de>
    Reviewed-by: Lorenzo Stoakes <ljs@kernel.org>
    Cc: Liam R. Howlett <liam@infradead.org>
    Cc: Michal Hocko <mhocko@suse.com>
    Cc: Mike Rapoport <rppt@kernel.org>
    Cc: Suren Baghdasaryan <surenb@google.com>
    Cc: Vlastimil Babka <vbabka@kernel.org>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mptcp: do not drop partial packets [+ + +]

Author: Shardul Bankar <shardul.b@mpiricsoftware.com>
Date:   Fri May 15 06:27:32 2026 +0200

    mptcp: do not drop partial packets
    
    commit 50c2d91c5dfa0e465826ec1f8dbad9cdc254bd85 upstream.
    
    When a packet arrives with map_seq < ack_seq < end_seq, the beginning
    of the packet has already been acknowledged but the end contains new
    data. Currently the entire packet is dropped as "old data," forcing
    the sender to retransmit.
    
    Instead, skip the already-acked bytes by adjusting the skb offset and
    enqueue only the new portion. Update bytes_received and ack_seq to
    reflect the new data consumed.
    
    A previous attempt at this fix has been sent by Paolo Abeni [1], but had
    issues [2]: it also added a zero-window check and changed rcv_wnd_sent
    initialization, which caused test regressions. This version addresses
    only the partial packet handling without modifying receive window
    accounting.
    
    Fixes: ab174ad8ef76 ("mptcp: move ooo skbs into msk out of order queue.")
    Cc: stable@vger.kernel.org
    Link: https://lore.kernel.org/c9b426a4e163aa3c4fe8b80c79f1a610f47ae7d8.1763075056.git.pabeni@redhat.com [1]
    Closes: https://github.com/multipath-tcp/mptcp_net-next/issues/600 [2]
    Signed-off-by: Shardul Bankar <shardul.b@mpiricsoftware.com>
    [pabeni@redhat.com: update map]
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Reviewed-by: Matthieu Baerts (NGI0) <matttbe@kernel.org>
    Signed-off-by: Matthieu Baerts (NGI0) <matttbe@kernel.org>
    Link: https://patch.msgid.link/20260515-net-mptcp-misc-fixes-7-1-rc4-v2-1-701e96419f2f@kernel.org
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mptcp: pm: fix ADD_ADDR timer infinite retry on option space insufficient [+ + +]

Author: Li Xiasong <lixiasong1@huawei.com>
Date:   Fri May 15 06:27:33 2026 +0200

    mptcp: pm: fix ADD_ADDR timer infinite retry on option space insufficient
    
    commit 51e398a3b8961b26a8c0a4ba9a777c5339791707 upstream.
    
    When TCP option space is insufficient (e.g., when sending ADD_ADDR with an
    IPv6 address and port while tcp_timestamps is enabled), the original code
    jumped to out_unlock without clearing the addr_signal flag. This caused
    mptcp_pm_add_timer to keep rescheduling indefinitely, not sending ADD_ADDR,
    preventing subsequent addresses in the endpoint list from being announced.
    
    Handle this case by clearing the ADD_ADDR signal and skipping the matching
    ADD_ADDR retransmission entry. The skip path cancels the matching timer
    (with id check) and advances PM state progression, preserving forward
    progress to subsequent PM work.
    
    This cancellation is inherently best-effort. A concurrent add_timer
    callback may already be running and may acquire pm.lock before the
    cancel path updates entry state. In that case, one final ADD_ADDR
    transmit attempt can still be executed.
    
    Once the cancel path sets entry->retrans_times to ADD_ADDR_RETRANS_MAX,
    the callback-side retrans_times check suppresses further ADD_ADDR
    retransmissions.
    
    Note that when an ADD_ADDR is being prepared, a pure-ACK is queued. On
    the output side, it means that it is fine to skip non-pure-ACK packets,
    when drop_other_suboptions is set: a pure-ACK will be processed soon
    after.
    
    Fixes: 00cfd77b9063 ("mptcp: retransmit ADD_ADDR when timeout")
    Cc: stable@vger.kernel.org
    Signed-off-by: Li Xiasong <lixiasong1@huawei.com>
    Reviewed-by: Matthieu Baerts (NGI0) <matttbe@kernel.org>
    Signed-off-by: Matthieu Baerts (NGI0) <matttbe@kernel.org>
    Link: https://patch.msgid.link/20260515-net-mptcp-misc-fixes-7-1-rc4-v2-2-701e96419f2f@kernel.org
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

mptcp: reset rcv wnd on disconnect [+ + +]

Author: Paolo Abeni <pabeni@redhat.com>
Date:   Fri May 15 06:27:35 2026 +0200

    mptcp: reset rcv wnd on disconnect
    
    commit 0981f90e1a05773a4c29c6e720f5ea1e3c8f1876 upstream.
    
    If the MPTCP socket fallback to TCP before the MP handshake completion,
    the IASN remain 0, and the rcv_wnd_sent field is not explicitly
    initialized, just incremented over time with the data transfer.
    
    At disconnect time such value is not cleared. If the next connection falls
    back to TCP before the MP handshake completion, the data transfer will
    keep incrementing the receive window end sequence starting from the last
    value used in the previous connection: the announced window will be
    unrelated from the actual receiver buffer size and likely too big.
    
    Address the issue zeroing the field at disconnect time.
    
    Fixes: b29fcfb54cd7 ("mptcp: full disconnect implementation")
    Cc: stable@vger.kernel.org
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Reviewed-by: Matthieu Baerts (NGI0) <matttbe@kernel.org>
    Signed-off-by: Matthieu Baerts (NGI0) <matttbe@kernel.org>
    Link: https://patch.msgid.link/20260515-net-mptcp-misc-fixes-7-1-rc4-v2-4-701e96419f2f@kernel.org
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net/mlx5: Do not restore destination-less TC rules [+ + +]

Author: Jeroen Massar <jmassar@nvidia.com>
Date:   Wed May 13 09:33:02 2026 +0300

    net/mlx5: Do not restore destination-less TC rules
    
    [ Upstream commit 8d0a5af8b1ba598e7340761729801624e7a9330e ]
    
    After IPsec policy/state TX rules are added, any TC flow rule, which
    forwards packets to uplink, is modified to forward to IPsec TX tables.
    As these tables are destroyed dynamically, whenever there is no
    reference to them, the destinations of this kind of rules must be
    restored to uplink, unless there is no destination for that rule.
    
    The flow rules FLOW_ACTION_ACCEPT, DROP, TRAP, GOTO and SAMPLE do not
    have a destination port, and thus out_count = 0.
    
    At cleanup time of the rules in mlx5_esw_ipsec_modify_flow_dests
    we call mlx5_eswitch_restore_ipsec_rule but as the above types
    do not have a destination we get an underflow of out_count, as
    the port is passed, which is esw_attr->out_count - 1.
    
    This change avoids calling mlx5_eswitch_restore_ipsec_rule when
    there are no output destinations and thus avoids the underflow.
    
    Fixes: d1569537a837 ("net/mlx5e: Modify and restore TC rules for IPSec TX rules")
    Signed-off-by: Jeroen Massar <jmassar@nvidia.com>
    Reviewed-by: Jianbo Liu <jianbol@nvidia.com>
    Reviewed-by: Cosmin Ratiu <cratiu@nvidia.com>
    Signed-off-by: Tariq Toukan <tariqt@nvidia.com>
    Link: https://patch.msgid.link/20260513063302.333761-1-tariqt@nvidia.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net/mlx5: Skip disabled vports when setting max TX speed [+ + +]

Author: Or Har-Toov <ohartoov@nvidia.com>
Date:   Wed May 13 09:36:40 2026 +0300

    net/mlx5: Skip disabled vports when setting max TX speed
    
    [ Upstream commit c6df9a65cbb0fe7808a4b2872095f4c849b3196a ]
    
    When setting vports max TX speed during LAG activation or bond state
    changes, the code iterates over all eswitch vports. However, some
    vports may not be enabled yet.
    
    Skip vports that are not enabled to avoid sending FW commands for
    uninitialized vports. Save the LAG aggregated speed in the vport
    struct so it can be applied when the vport is enabled later.
    
    Fixes: 50f1d188c580 ("net/mlx5: Propagate LAG effective max_tx_speed to vports")
    Signed-off-by: Or Har-Toov <ohartoov@nvidia.com>
    Reviewed-by: Mark Bloch <mbloch@nvidia.com>
    Signed-off-by: Tariq Toukan <tariqt@nvidia.com>
    Link: https://patch.msgid.link/20260513063640.334132-1-tariqt@nvidia.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net/mlx5e: Fix eswitch mode block underflow on IPsec acquire SA [+ + +]

Author: Prathamesh Deshpande <prathameshdeshpande7@gmail.com>
Date:   Sun May 10 23:59:00 2026 +0100

    net/mlx5e: Fix eswitch mode block underflow on IPsec acquire SA
    
    [ Upstream commit abe003b33223ff33552f291644bf35d9c2f992fb ]
    
    mlx5e_xfrm_add_state() handles acquire-flow temporary SAs by allocating
    software state and skipping hardware offload setup.
    
    That path jumps to the common success label before taking the eswitch mode
    block. After tunnel-mode validation was moved earlier, the common success
    label unconditionally calls mlx5_eswitch_unblock_mode(). For acquire SAs,
    this decrements esw->offloads.num_block_mode without a matching increment.
    
    Return directly after installing the acquire SA offload handle, so only the
    paths that successfully called mlx5_eswitch_block_mode() call the matching
    unblock.
    
    Fixes: 22239eb258bc ("net/mlx5e: Prevent tunnel reformat when tunnel mode not allowed")
    Signed-off-by: Prathamesh Deshpande <prathameshdeshpande7@gmail.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Reviewed-by: Tariq Toukan <tariqt@nvidia.com>
    Link: https://patch.msgid.link/20260510225903.13184-1-prathameshdeshpande7@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net/mlx5e: Fix use-after-free in mlx5e_tx_reporter_timeout_recover [+ + +]

Author: Matt Fleming <mfleming@cloudflare.com>
Date:   Wed May 13 12:22:26 2026 +0100

    net/mlx5e: Fix use-after-free in mlx5e_tx_reporter_timeout_recover
    
    commit 7d260c5d2d89eb2c8c528d54b576b3aae3e20231 upstream.
    
    mlx5e_tx_reporter_timeout_recover() accesses sq->netdev after
    mlx5e_safe_reopen_channels() has torn down and freed the channel (and
    its embedded SQs). Replace the three sq->netdev references with
    priv->netdev which is safe because priv outlives channel teardown.
    
    The netdev_err() call already used priv->netdev for this reason; make
    the trylock/unlock and health_channel_eq_recover calls consistent.
    
    This fixes the following KASAN splat:
    
      BUG: KASAN: use-after-free in mlx5e_tx_reporter_timeout_recover+0x1dd/0x360 [mlx5_core]
      Read of size 8 at addr ffff889860ed0b28 by task kworker/u113:2/5277
    
      Call Trace:
       mlx5e_tx_reporter_timeout_recover+0x1dd/0x360 [mlx5_core]
       devlink_health_reporter_recover+0xa2/0x150
       devlink_health_report+0x254/0x7c0
       mlx5e_reporter_tx_timeout+0x297/0x380 [mlx5_core]
       mlx5e_tx_timeout_work+0x109/0x170 [mlx5_core]
       process_one_work+0x677/0xf20
       worker_thread+0x51f/0xd90
       kthread+0x3a5/0x810
       ret_from_fork+0x208/0x400
       ret_from_fork_asm+0x1a/0x30
    
    Fixes: 83ac0304a2d7 ("net/mlx5e: Fix deadlocks between devlink and netdev instance locks")
    Cc: stable@vger.kernel.org
    Reviewed-by: Cosmin Ratiu <cratiu@nvidia.com>
    Reviewed-by: Tariq Toukan <tariqt@nvidia.com>
    Signed-off-by: Matt Fleming <mfleming@cloudflare.com>
    Link: https://patch.msgid.link/20260513112226.140512-1-matt@readmodwrite.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net/mlx5e: xsk: Fix unlocked writing to ICOSQ [+ + +]

Author: Dragos Tatulea <dtatulea@nvidia.com>
Date:   Wed May 13 09:46:13 2026 +0300

    net/mlx5e: xsk: Fix unlocked writing to ICOSQ
    
    [ Upstream commit c326f9c68921e2f14dfcecb2f6b4216313d50248 ]
    
    During napi poll, when the affinity changes and there's still XSK work
    to be done, we trigger an ICOSQ interrupt on the new CPU. However, this
    triggering on the ICOSQ is done unprotected.
    
    There are 2 such races:
    
    A) mlx5e_trigger_irq() is called while mlx5e_xsk_alloc_rx_mpwqe() is
    running from a different CPU due to affinity change. This can happen
    because IRQ triggering is done after napi_complete_done(). At this point
    the NAPI can be scheduled on a different CPU. Like this:
    
      CPU A (old affinity, NAPI tail)    CPU B (new affinity, fresh NAPI)
      -------------------------------    --------------------------------
      napi_complete_done()  clears SCHED
      mlx5e_cq_arm(...)
                                         napi_schedule_prep() sets SCHED
                                         mlx5e_napi_poll()
                                           mlx5e_xsk_alloc_rx_mpwqe()
                                             mlx5e_icosq_sync_lock() // noop
                                             memcpy 640 B UMR body
                                             advance sq->pc by 10
      mlx5e_trigger_irq(&c->icosq)
        wqe_info[pi] = {NOP, 1}
        mlx5e_post_nop() advances sq->pc
    
    B) mlx5e_trigger_irq() is called on the ICOSQ when
    mlx5e_trigger_napi_icosq() is running.
    
    The obvious fix would be to lock the ICOSQ. But ICOSQ has an optimized
    locking scheme that doesn't work for this scenario. Kick the async ICOSQ
    instead which is always locked.
    
    This issue was noticed in the wild with the following splat:
    
      netdevice: ge-0-0-1: Bad OP in ICOSQ CQE: 0xd
      WARNING: drivers/net/ethernet/mellanox/mlx5/core/en_rx.c:826 [...]
      [...]
      Call Trace:
       <IRQ>
       mlx5e_napi_poll+0x11d/0x7f0 [mlx5_core]
       __napi_poll+0x30/0x200
       ? skb_defer_free_flush+0x9c/0xc0
       net_rx_action+0x2fe/0x3f0
       handle_softirqs+0xd8/0x340
       __irq_exit_rcu+0xbc/0xe0
       common_interrupt+0x85/0xa0
       </IRQ>
       <TASK>
       asm_common_interrupt+0x26/0x40
      [...]
      ---[ end trace 0000000000000000 ]---
      mlx5_core 0000:08:00.0 ge-0-0-1: Error cqe on cqn 0x548, ci 0x2022, qn 0x8f4,
      opcode 0xd, syndrome 0x2, vendor syndrome 0x68
      00000000: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
      00000010: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
      00000020: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
      00000030: 00 00 00 00 01 00 68 02 01 00 08 f4 de 14 59 d2
      WQE DUMP: WQ size 16384 WQ cur size 0, WQE index 0x1e14, len: 64
      00000000: 00 00 00 01 d9 ed 80 02 00 00 00 01 d9 ed 90 02
      00000010: 00 00 00 01 d9 ed a0 02 00 00 00 01 d9 ed b0 02
      00000020: 00 00 00 01 d9 ed c0 02 00 00 00 01 d9 ed d0 02
      00000030: 00 00 00 01 d9 ed e0 02 00 00 00 01 d9 ed f0 02
      mlx5_core 0000:08:00.0 ge-0-0-1: Error cqe on cqn 0x548, ci 0x2023, qn 0x8f4,
      opcode 0xd, syndrome 0x5, vendor syndrome 0xf9
      00000000: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
      00000010: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
      00000020: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
      00000030: 00 00 00 00 01 00 f9 05 01 00 08 f4 de 15 cf d2
    
    Fixes: db05815b36cb ("net/mlx5e: Add XSK zero-copy support")
    Reported-by: Paul Saab <ps@mu.org>
    Signed-off-by: Dragos Tatulea <dtatulea@nvidia.com>
    Signed-off-by: Tariq Toukan <tariqt@nvidia.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://patch.msgid.link/20260513064613.334602-1-tariqt@nvidia.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net/smc: avoid NULL deref of conn->lnk in smc_msg_event tracepoint [+ + +]

Author: Xiang Mei <xmei5@asu.edu>
Date:   Sun May 10 15:26:40 2026 -0700

    net/smc: avoid NULL deref of conn->lnk in smc_msg_event tracepoint
    
    [ Upstream commit 7bf563badd37cb796df5477d2b78bb64148a1268 ]
    
    The smc_msg_event tracepoint class, shared by smc_tx_sendmsg and
    smc_rx_recvmsg, unconditionally dereferences smc->conn.lnk:
    
            __string(name, smc->conn.lnk->ibname)
    
    conn->lnk is only set for SMC-R; for SMC-D it is NULL. Other code on
    these paths already handles this (e.g. !conn->lnk in
    SMC_STAT_RMB_TX_SIZE_SMALL()). With the tracepoint enabled, the first
    sendmsg()/recvmsg() on an SMC-D socket crashes:
    
      Oops: general protection fault, probably for non-canonical address
      KASAN: null-ptr-deref in range [...]
      RIP: 0010:strlen+0x1e/0xa0
      Call Trace:
       trace_event_raw_event_smc_msg_event (net/smc/smc_tracepoint.h:44)
       smc_rx_recvmsg (net/smc/smc_rx.c:515)
       smc_recvmsg (net/smc/af_smc.c:2859)
       __sys_recvfrom (net/socket.c:2315)
       __x64_sys_recvfrom (net/socket.c:2326)
       do_syscall_64
    
    The faulting address 0x3e0 is offsetof(struct smc_link, ibname),
    confirming the NULL ->lnk deref. Enabling the tracepoint requires
    root, but the trigger itself is unprivileged: socket(AF_SMC, ...) has
    no capability check, and SMC-D negotiation needs no admin step on
    s390 or on x86 with the loopback ISM device loaded.
    
    Log an empty device name for SMC-D instead of dereferencing NULL.
    
    Fixes: aff3083f10bf ("net/smc: Introduce tracepoints for tx and rx msg")
    Reported-by: Weiming Shi <bestswngs@gmail.com>
    Signed-off-by: Xiang Mei <xmei5@asu.edu>
    Reviewed-by: Dust Li <dust.li@linux.alibaba.com>
    Reviewed-by: Sidraya Jayagond <sidraya@linux.ibm.com>
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net/smc: reject CHID-0 ACCEPT that matches an empty ism_dev slot [+ + +]

Author: Xiang Mei <xmei5@asu.edu>
Date:   Sun May 10 23:21:38 2026 -0700

    net/smc: reject CHID-0 ACCEPT that matches an empty ism_dev slot
    
    [ Upstream commit 277740023def559a4a2ddc3e8e784ee37a0f16a9 ]
    
    On the SMC-D client, slot 0 of ini->ism_dev[]/ini->ism_chid[] is
    reserved for an SMC-Dv1 device. smc_find_ism_v2_device_clnt()
    populates V2 entries starting at index 1, so when no V1 device is
    selected slot 0 is left in its kzalloc()'ed state with ism_dev[0] ==
    NULL and ism_chid[0] == 0.
    
    smc_v2_determine_accepted_chid() then matches the peer's CHID against
    the array starting from index 0 using the CHID alone. A malicious
    peer replying to a SMC-Dv2-only proposal with d1.chid == 0 matches
    the empty slot, ini->ism_selected becomes 0, and the subsequent
    ism_dev[0]->lgr_lock dereference in smc_conn_create() faults at
    offsetof(struct smcd_dev, lgr_lock) == 0x68:
    
      BUG: KASAN: null-ptr-deref in _raw_spin_lock_bh+0x79/0xe0
      Write of size 4 at addr 0000000000000068 by task exploit/144
      Call Trace:
       _raw_spin_lock_bh
       smc_conn_create (net/smc/smc_core.c:1997)
       __smc_connect (net/smc/af_smc.c:1447)
       smc_connect (net/smc/af_smc.c:1720)
       __sys_connect
       __x64_sys_connect
       do_syscall_64
    
    Require ism_dev[i] to be non-NULL before accepting a CHID match.
    
    Fixes: a7c9c5f4af7f ("net/smc: CLC accept / confirm V2")
    Reported-by: Weiming Shi <bestswngs@gmail.com>
    Assisted-by: Claude:claude-opus-4-7
    Signed-off-by: Xiang Mei <xmei5@asu.edu>
    Link: https://patch.msgid.link/20260511062138.2839584-1-xmei5@asu.edu
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: ag71xx: check error for platform_get_irq [+ + +]

Author: Rosen Penev <rosenp@gmail.com>
Date:   Sat May 16 14:26:16 2026 -0700

    net: ag71xx: check error for platform_get_irq
    
    [ Upstream commit e7c70bf97e90d974cd575e4c90f8f9b07d056da3 ]
    
    Complete error handling for a failed platform_get_irq() call
    
    Fixes: d51b6ce441d3 ("net: ethernet: add ag71xx driver")
    Signed-off-by: Rosen Penev <rosenp@gmail.com>
    Reviewed-by: Oleksij Rempel <o.rempel@pengutronix.de>
    Link: https://patch.msgid.link/20260516212616.11758-1-rosenp@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: airoha: Disable GDM2 forwarding before configuring GDM2 loopback [+ + +]

Author: Lorenzo Bianconi <lorenzo@kernel.org>
Date:   Wed May 20 15:12:02 2026 +0200

    net: airoha: Disable GDM2 forwarding before configuring GDM2 loopback
    
    [ Upstream commit 985d4a55e64e43bd86eeb896b81ceba453301989 ]
    
    Hw design requires to disable GDM2 forwarding before configuring GDM2
    loopback in airoha_set_gdm2_loopback routine.
    
    Fixes: 9cd451d414f6e ("net: airoha: Add loopback support for GDM2")
    Tested-by: Madhur Agrawal <madhur.agrawal@airoha.com>
    Signed-off-by: Lorenzo Bianconi <lorenzo@kernel.org>
    Link: https://patch.msgid.link/20260520-airoha-disable-gdm2-fwd-v1-1-1eeea5dffc2f@kernel.org
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: airoha: Fix NPU RX DMA descriptor bits [+ + +]

Author: Christian Marangi <ansuelsmth@gmail.com>
Date:   Mon May 18 15:44:57 2026 +0200

    net: airoha: Fix NPU RX DMA descriptor bits
    
    [ Upstream commit 0cb5a74faa3bdcfa3b18735d554e12c0f615e35d ]
    
    In an internal review from Airoha, it was notice that the RX DMA descriptor
    bits and mask are wrong. These values probably refer to an old NPU firmware
    never published. The previous value works correctly but it was reported
    that in some specific condition in mixed scenario with both Ethernet and
    WiFi offload it's possible that RX DMA descriptor signal wrong value with
    the problem to the RX ring or packets getting dropped.
    
    To handle these specific scenario, apply the new suggested bits mask from
    Airoha.
    
    Correct functionality of both AN7581 NPU and MT7996 variant were verified
    and confirmed working.
    
    Fixes: a7fc8c641cab ("net: airoha: Fix npu rx DMA definitions")
    Signed-off-by: Christian Marangi <ansuelsmth@gmail.com>
    Acked-by: Lorenzo Bianconi <lorenzo@kernel.org>
    Link: https://patch.msgid.link/20260518134530.3683-1-ansuelsmth@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: bcmgenet: keep RBUF EEE/PM disabled [+ + +]

Author: Nicolai Buchwitz <nb@tipi-net.de>
Date:   Wed May 20 20:43:20 2026 +0200

    net: bcmgenet: keep RBUF EEE/PM disabled
    
    commit 9a1730245e416d11ad5c0f2c100061d61cc43f60 upstream.
    
    Setting RBUF_EEE_EN | RBUF_PM_EN in RBUF_ENERGY_CTRL breaks the RX
    path on GENET hardware once MAC EEE becomes active. RX traffic stops
    flowing while the link stays up and the usual descriptor/RX error
    counters remain quiet. In that state the MAC still accepts frames
    (rbuf_ovflow_cnt keeps climbing) but RBUF no longer forwards them to
    DMA, so rx_packets is no longer incremented at the netdev level. On
    some boards the corruption ends up as a paging fault in
    skb_release_data via bcmgenet_rx_poll on an LPI exit.
    
    Reproduced on Pi 4B (BCM2711 + BCM54213PE) and confirmed by Florian
    Fainelli on an internal Broadcom 4908-family board with the same crash
    signature. RBUF_PM_EN is not publicly documented.
    
    This shows up more often now that phy_support_eee() enables EEE by
    default, but it also affects older kernels as soon as TX LPI is
    turned on via ethtool, so it is not specific to recent changes.
    
    Always clear RBUF_EEE_EN | RBUF_PM_EN in bcmgenet_eee_enable_set so
    the bits stay off across resets. UMAC and TBUF setup is left alone so
    TX-side EEE keeps working.
    
    Link: https://github.com/raspberrypi/linux/issues/7304
    Fixes: 6ef398ea60d9 ("net: bcmgenet: add EEE support")
    Cc: stable@vger.kernel.org
    Signed-off-by: Nicolai Buchwitz <nb@tipi-net.de>
    Reviewed-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Link: https://patch.msgid.link/20260520184320.652053-1-nb@tipi-net.de
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net: devmem: reject dma-buf bind with non-page-aligned size or SG length [+ + +]

Author: David Carlier <devnexen@gmail.com>
Date:   Tue May 19 21:35:30 2026 +0100

    net: devmem: reject dma-buf bind with non-page-aligned size or SG length
    
    commit 4eb82ba543421e9e38cc14e4e82058b78850df50 upstream.
    
    net_devmem_bind_dmabuf() trusts dmabuf->size and sg_dma_len() to be
    PAGE_SIZE multiples without checking:
    
      - tx_vec is sized dmabuf->size / PAGE_SIZE, and
        net_devmem_get_niov_at() only bounds-checks virt_addr < dmabuf->size
        before indexing tx_vec[virt_addr / PAGE_SIZE]. With size =
        N*PAGE_SIZE + r (1 <= r < PAGE_SIZE), sendmsg() at iov_base =
        N*PAGE_SIZE passes the bound check and reads tx_vec[N] -- one past.
    
      - owner->area.num_niovs = len / PAGE_SIZE while gen_pool_add_owner()
        covers the full byte len, so a non-page-multiple non-final sg
        desyncs num_niovs from the gen_pool region for every later sg, on
        both RX and TX.
    
    dma-buf does not require page-aligned sizes, so the bind path has to
    enforce what its own indexing assumes. Reject both with -EINVAL.
    
    The size check is TX-only (only tx_vec is sized off dmabuf->size); the
    SG-length check covers both directions.
    
    Fixes: bd61848900bf ("net: devmem: Implement TX path")
    Cc: stable@vger.kernel.org
    Signed-off-by: David Carlier <devnexen@gmail.com>
    Reviewed-by: Bobby Eshleman <bobbyeshleman@meta.com>
    Acked-by: Stanislav Fomichev <sdf@fomichev.me>
    Reviewed-by: Mina Almasry <almasrymina@google.com>
    Link: https://patch.msgid.link/20260519203530.66310-1-devnexen@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net: dsa: mt7530: fix FDB entries not aging out with short timeout [+ + +]

Author: Daniel Golle <daniel@makrotopia.org>
Date:   Thu May 14 15:04:21 2026 +0100

    net: dsa: mt7530: fix FDB entries not aging out with short timeout
    
    [ Upstream commit e824e40d0e841fab66ab7897d6c7b14dc81c66a7 ]
    
    The DSA forwarding selftests bridge_vlan_aware.sh and
    bridge_vlan_unaware.sh configure the bridge with ageing_time set to
    LOW_AGEING_TIME (1000 centiseconds, i.e. 10 seconds) and then run
    learning_test() in lib.sh, which expects a learned FDB entry to be
    removed after ageing_time + 10 seconds. On MT7530/MT7531 the entry
    persisted past the deadline and the "Found FDB record when should
    not" assertion failed.
    
    With msecs=10000, the algorithm in mt7530_set_ageing_time() finds
    AGE_CNT=0 and AGE_UNIT=9 as the first exact match (starting the
    search from tmp_age_count=0). The per-entry aging counter is
    initialized to AGE_CNT when a MAC address is learned, so with
    AGE_CNT=0 new entries start with a counter value of 0, which the
    hardware treats as "already aged" and never removes, effectively
    disabling aging.
    
    Fix this by starting the search from tmp_age_count=1 to ensure
    entries always have a non-zero initial aging counter. For a
    10-second ageing time this yields AGE_CNT=1 and AGE_UNIT=4 instead:
    the timer ticks every 5 seconds and entries are removed after 2
    ticks.
    
    Starting the search at AGE_CNT=1 raises the minimum representable
    ageing time from 1 to 2 seconds. Without bounds, a stale ageing_time
    of 1 second would now make the loop fall through without setting
    age_count and age_unit, leaving them uninitialized when written to
    the MT7530_AAC hardware register. Set ds->ageing_time_min and
    ds->ageing_time_max so the DSA core validates the range before the
    callback is invoked, and drop the now-redundant range check from
    mt7530_set_ageing_time().
    
    Fixes: ea6d5c924e39 ("net: dsa: mt7530: support setting ageing time")
    Signed-off-by: Daniel Golle <daniel@makrotopia.org>
    Link: https://patch.msgid.link/7788ded12dc07b1bce329ec35fa70f4b45f3f9b7.1778766629.git.daniel@makrotopia.org
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: dsa: mt7530: preserve VLAN tags on trapped link-local frames [+ + +]

Author: Daniel Golle <daniel@makrotopia.org>
Date:   Thu May 14 15:04:35 2026 +0100

    net: dsa: mt7530: preserve VLAN tags on trapped link-local frames
    
    [ Upstream commit 3ac85bcfd404b588298c95c6fba8aad4ad334f57 ]
    
    The BPC, RGAC1 and RGAC2 registers control the handling of link-local
    frames with reserved MAC DAs (01:80:C2:00:00:0x). These frames are
    correctly trapped to the CPU port, but the egress VLAN tag attribute was
    set to MT7530_VLAN_EG_UNTAGGED which causes the switch to strip any
    VLAN tags from trapped frames before they reach the CPU.
    
    This causes VLAN-tagged link-local frames (STP BPDUs, LLDP, PTP Peer
    Delay Requests) to arrive at the CPU without their VLAN tag, so they
    are delivered to the base network interface instead of the VLAN
    sub-interface. The DSA local_termination selftest confirms this: all
    link-local protocol tests on VLAN upper interfaces fail.
    
    Set the EG_TAG attribute to MT7530_VLAN_EG_DISABLED (system default)
    so that the switch does not modify VLAN tags in trapped frames. This
    way VLAN-tagged frames retain their original tag and are delivered to
    the correct VLAN sub-interface, matching the behavior of non-trapped
    frames which pass through without VLAN tag modification.
    
    Fixes: 69ddba9d170b ("net: dsa: mt7530: fix handling of all link-local frames")
    Signed-off-by: Daniel Golle <daniel@makrotopia.org>
    Acked-by: Chester A. Unal <chester.a.unal@arinc9.com>
    Link: https://patch.msgid.link/891e0cd34db2a5fe20ceb73283a81fb5f71427ca.1778766629.git.daniel@makrotopia.org
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: enetc: fix missing error code when pf->vf_state allocation fails [+ + +]

Author: Wei Fang <wei.fang@nxp.com>
Date:   Wed May 20 14:44:14 2026 +0800

    net: enetc: fix missing error code when pf->vf_state allocation fails
    
    [ Upstream commit 5027266dea471e140f93dd534845c9c4f43219a3 ]
    
    In enetc_pf_probe(), when the memory allocation for pf->vf_state fails,
    the code jumps to the error handling label but the variable 'err' is not
    assigned an appropriate error code beforehand. This causes the function
    to return 0 (success) on an allocation failure path, misleading the
    caller into thinking the probe succeeded. So set err to -ENOMEM before
    jumping to the error handling label when the allocation for pf->vf_state
    returns NULL.
    
    Fixes: e15c5506dd39 ("net: enetc: allocate vf_state during PF probes")
    Signed-off-by: Wei Fang <wei.fang@nxp.com>
    Reviewed-by: Harshitha Ramamurthy <hramamurthy@google.com>
    Link: https://patch.msgid.link/20260520064421.91569-3-wei.fang@nxp.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: ethernet: cortina: Carry over frag counter [+ + +]

Author: Linus Walleij <linusw@kernel.org>
Date:   Sat May 9 00:13:38 2026 +0200

    net: ethernet: cortina: Carry over frag counter
    
    [ Upstream commit ebd8ec2b309e3a447851b456ccaf8fb39f3661e7 ]
    
    The gmac_rx() NAPI poll function assembles packets in an
    SKB from a ring buffer.
    
    If the ring buffer gets completely emptied during a poll cycle,
    we exit gmac_rx(), but the packet is not yet completely
    assembled in the SKB, yet the fragment counter frag_nr is
    reset to zero on the next invocation.
    
    Solve this by making the RX fragment counter a part of the
    port struct, and carry it over between invocations.
    
    Reset the fragment counter only right after calling
    napi_gro_frags(), on error (after calling napi_free_frags())
    or if stopping the port.
    
    Reset it in some place where not strictly necessary just to
    emphasize what is going on.
    
    This was found by Sashiko during normal patch review.
    
    Fixes: 4d5ae32f5e1e ("net: ethernet: Add a driver for Gemini gigabit ethernet")
    Link: https://sashiko.dev/#/patchset/20260505-gemini-ethernet-fix-v2-1-997c31d06079%40kernel.org
    Signed-off-by: Linus Walleij <linusw@kernel.org>
    Link: https://patch.msgid.link/20260509-gemini-ethernet-fixes-v1-3-6c5d20ddc35b@kernel.org
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: ethernet: cortina: Drop half-assembled SKB [+ + +]

Author: Andreas Haarmann-Thiemann <eitschman@nebelreich.de>
Date:   Tue May 5 23:52:17 2026 +0200

    net: ethernet: cortina: Drop half-assembled SKB
    
    [ Upstream commit b266bacba796ff5c4dcd2ae2fc08aacf7ab39153 ]
    
    In gmac_rx() (drivers/net/ethernet/cortina/gemini.c), when
    gmac_get_queue_page() returns NULL for the second page of a multi-page
    fragment, the driver logs an error and continues — but does not free the
    partially assembled skb that was being assembled via napi_build_skb() /
    napi_get_frags().
    
    Free the in-progress partially assembled skb via napi_free_frags()
    and increase the number of dropped frames appropriately
    and assign the skb pointer NULL to make sure it is not lingering
    around, matching the pattern already used elsewhere in the driver.
    
    Fixes: 4d5ae32f5e1e ("net: ethernet: Add a driver for Gemini gigabit ethernet")
    Signed-off-by: Andreas Haarmann-Thiemann <eitschman@nebelreich.de>
    Signed-off-by: Linus Walleij <linusw@kernel.org>
    Reviewed-by: Alexander Lobakin <aleksander.lobakin@intel.com>
    Link: https://patch.msgid.link/20260505-gemini-ethernet-fix-v2-1-997c31d06079@kernel.org
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Stable-dep-of: ebd8ec2b309e ("net: ethernet: cortina: Carry over frag counter")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: ethernet: cortina: Make RX SKB per-port [+ + +]

Author: Linus Walleij <linusw@kernel.org>
Date:   Sat May 9 00:13:37 2026 +0200

    net: ethernet: cortina: Make RX SKB per-port
    
    [ Upstream commit 06937db21ee311ed07eba47954447245041a982d ]
    
    The SKB used to assemble packets from fragments in gmac_rx()
    is static local, but the Gemini has two ethernet ports, meaning
    there can be races between the ports on a bad day if a device
    is using both.
    
    Make the RX SKB a per-port variable and carry it over between
    invocations in the port struct instead.
    
    Zero the pointer once we call napi_gro_frags(), on error (after
    calling napi_free_frags()) or if the port is stopped.
    
    Zero it in some place where not strictly necessary just to
    emphasize what is going on.
    
    This was found by Sashiko during normal patch review.
    
    Fixes: 4d5ae32f5e1e ("net: ethernet: Add a driver for Gemini gigabit ethernet")
    Link: https://sashiko.dev/#/patchset/20260505-gemini-ethernet-fix-v2-1-997c31d06079%40kernel.org
    Signed-off-by: Linus Walleij <linusw@kernel.org>
    Link: https://patch.msgid.link/20260509-gemini-ethernet-fixes-v1-2-6c5d20ddc35b@kernel.org
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: ethernet: cs89x0: remove stale CONFIG_MACH_MX31ADS reference [+ + +]

Author: Ethan Nelson-Moore <enelsonmoore@gmail.com>
Date:   Fri May 8 19:37:28 2026 -0700

    net: ethernet: cs89x0: remove stale CONFIG_MACH_MX31ADS reference
    
    [ Upstream commit 36a8d04a8293afcb9304cf0cd3741f67698f2a1a ]
    
    The legacy ARM board file for MACH_MX31ADS was removed in commit
    c93197b0041d ("ARM: imx: Remove i.MX31 board files"), but a reference
    to it remained in the cs89x0 driver. Drop this unused code.
    
    Signed-off-by: Ethan Nelson-Moore <enelsonmoore@gmail.com>
    Fixes: c93197b0041d ("ARM: imx: Remove i.MX31 board files")
    Link: https://patch.msgid.link/20260509023732.42256-1-enelsonmoore@gmail.com
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: ethtool: fix NULL pointer dereference in phy_reply_size [+ + +]

Author: Quan Sun <2022090917019@std.uestc.edu.cn>
Date:   Fri May 22 08:50:59 2026 -0400

    net: ethtool: fix NULL pointer dereference in phy_reply_size
    
    [ Upstream commit 4908f1395fb1b832ceec11584af649874a2732ea ]
    
    In phy_prepare_data(), several strings such as 'name', 'drvname',
    'upstream_sfp_name', and 'downstream_sfp_name' are allocated using
    kstrdup(). However, these allocations were not checked  for failure.
    
    If kstrdup() fails for 'name', it returns NULL while the function
    continues. This leads to a kernel NULL pointer dereference and panic
    later in phy_reply_size() when it unconditionally calls strlen() on
    the NULL pointer.
    
    While other strings like 'upstream_sfp_name' might be checked before
    access in certain code paths, failing to handle these allocations
    consistently can lead to incomplete data reporting or hidden bugs.
    
    Fix this by adding proper NULL checks for all kstrdup() calls in
    phy_prepare_data() and implement a centralized error handling path
    using goto labels to ensure all previously allocated resources are
    freed on failure.
    
    Fixes: 9dd2ad5e92b9 ("net: ethtool: phy: Convert the PHY_GET command to generic phy dump")
    Signed-off-by: Quan Sun <2022090917019@std.uestc.edu.cn>
    Reviewed-by: Maxime Chevallier <maxime.chevallier@bootlin.com>
    Link: https://patch.msgid.link/20260507131738.1173835-1-2022090917019@std.uestc.edu.cn
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Stable-dep-of: e3adf69f8eb1 ("net: ethtool: phy: avoid NULL deref when PHY driver is unbound")
    Signed-off-by: Sasha Levin <sashal@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net: ethtool: phy: avoid NULL deref when PHY driver is unbound [+ + +]

Author: David Carlier <devnexen@gmail.com>
Date:   Fri May 22 08:51:00 2026 -0400

    net: ethtool: phy: avoid NULL deref when PHY driver is unbound
    
    [ Upstream commit e3adf69f8eb121a9128c2b0029efd050d3649153 ]
    
    phydev->drv can become NULL while the phy_device is still attached to
    its net_device, namely after the PHY driver is unbound via sysfs:
    
            echo <mdio_id> > /sys/bus/mdio_bus/drivers/<phy_drv>/unbind
    
    phy_remove() clears phydev->drv but doesn't call phy_detach(), so the
    phy_device stays in the link topology xarray and ethnl_req_get_phydev()
    still hands it back. ETHTOOL_MSG_PHY_GET then oopses on:
    
            rep_data->drvname = kstrdup(phydev->drv->name, GFP_KERNEL);
    
    drvname is already treated as optional by phy_reply_size(),
    phy_fill_reply() and phy_cleanup_data(), so just skip the allocation
    when there is no driver bound.
    
    Fixes: 9dd2ad5e92b9 ("net: ethtool: phy: Convert the PHY_GET command to generic phy dump")
    Cc: stable@vger.kernel.org # 6.13.x
    Signed-off-by: David Carlier <devnexen@gmail.com>
    Reviewed-by: Maxime Chevallier <maxime.chevallier@bootlin.com>
    Tested-by: Maxime Chevallier <maxime.chevallier@bootlin.com>
    Link: https://patch.msgid.link/20260509215046.107157-1-devnexen@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net: gro: don't merge zcopy skbs [+ + +]

Author: Sabrina Dubroca <sd@queasysnail.net>
Date:   Wed May 20 22:44:42 2026 +0200

    net: gro: don't merge zcopy skbs
    
    [ Upstream commit 4db79a322db8c97f7b73b8a347395ef4d685eb40 ]
    
    skb_gro_receive() can currently copy frags between the source and GRO
    skb, without checking the zerocopy status, and in particular the
    SKBFL_MANAGED_FRAG_REFS flag.
    
    When SKBFL_MANAGED_FRAG_REFS is set, the skb doesn't hold a reference
    on the pages in shinfo->frags. Appending those frags to another skb's
    frags without fixing up the page refcount can lead to UAF.
    
    When either the last skb in the GRO chain (the one we would append
    frags to) or the source skb is zerocopy, don't merge the skbs.
    
    Fixes: 753f1ca4e1e5 ("net: introduce managed frags infrastructure")
    Reported-by: Huzaifa Sidhpurwala <huzaifas@redhat.com>
    Signed-off-by: Sabrina Dubroca <sd@queasysnail.net>
    Reviewed-by: Willem de Bruijn <willemb@google.com>
    Link: https://patch.msgid.link/c3b7f906bbfcbdfd7b4fa9d6c18a438870df85be.1779307748.git.sd@queasysnail.net
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: hsr: defer node table free until after RCU readers [+ + +]

Author: Michael Bommarito <michael.bommarito@gmail.com>
Date:   Wed May 13 19:38:38 2026 -0400

    net: hsr: defer node table free until after RCU readers
    
    commit aaec7096f9961eb223b5b149abe9495525c205d9 upstream.
    
    HSR node-list and node-status generic-netlink operations run under
    rcu_read_lock(). They walk hsr->node_db through hsr_get_next_node() and
    hsr_get_node_data(), but RTM_DELLINK teardown removes the same node table
    with plain list_del() and frees each node immediately.
    
    That lets a generic-netlink reader hold a struct hsr_node pointer across
    hsr_dellink(). In a KASAN build, widening the reader window after
    hsr_get_next_node() obtains the node reproduces a slab-use-after-free
    when the reader copies node->macaddress_A; the freeing stack is
    hsr_del_nodes() from hsr_dellink().
    
    Use list_del_rcu() and defer the free through the existing
    hsr_free_node_rcu() callback. This matches the lifetime rule used by the
    HSR prune paths, which already delete nodes with list_del_rcu() and
    call_rcu().
    
    Fixes: b9a1e627405d ("hsr: implement dellink to clean up resources")
    Cc: stable@vger.kernel.org # v5.3+
    Signed-off-by: Michael Bommarito <michael.bommarito@gmail.com>
    Link: https://patch.msgid.link/20260513233838.3064715-2-michael.bommarito@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net: ifb: report ethtool stats over num_tx_queues [+ + +]

Author: Michael Bommarito <michael.bommarito@gmail.com>
Date:   Wed May 13 21:37:39 2026 -0400

    net: ifb: report ethtool stats over num_tx_queues
    
    commit 5db89c99566fc4728cc92e941d8e1975711e24b5 upstream.
    
    ifb_dev_init() allocates dp->tx_private to dev->num_tx_queues
    entries via kzalloc_objs(*txp, dev->num_tx_queues). Both IFB
    per-queue RX and TX stats live in those entries: ifb_xmit() updates
    txp->rx_stats using the skb queue mapping, ifb_ri_tasklet() updates
    txp->tx_stats, and ifb_stats64() aggregates both over
    dev->num_tx_queues.
    
    The ethtool stats callbacks instead size and walk the per-queue
    stats with dev->real_num_rx_queues and dev->real_num_tx_queues. With
    an asymmetric device where the RX queue count exceeds the TX queue
    count, for example:
    
        ip link add name ifb10 numtxqueues 1 numrxqueues 8 type ifb
        ethtool -S ifb10
    
    ifb_get_ethtool_stats() indexes past the tx_private allocation and
    copies adjacent slab data through ETHTOOL_GSTATS.
    
    Use dev->num_tx_queues consistently for the stats strings, the
    stats count, and the stats data walks. This reports one RX stats
    group and one TX stats group for each backing ifb_q_private entry,
    which is the queue set IFB can actually populate.
    
    Reproduced under UML+KASAN at v7.1-rc2:
    
      BUG: KASAN: slab-out-of-bounds in ifb_fill_stats_data+0x3c/0xae
      Read of size 8 at addr 0000000062dbd228 by task ethtool/36
      ifb_fill_stats_data+0x3c/0xae
      ifb_get_ethtool_stats+0xc0/0x129
      __dev_ethtool+0x1ca5/0x363c
      dev_ethtool+0x123/0x1b3
      dev_ioctl+0x56c/0x744
      sock_do_ioctl+0x15f/0x1b2
      sock_ioctl+0x4d5/0x50a
      sys_ioctl+0xd8b/0xde9
    
    With the patch applied, the same UML+KASAN repro is silent and
    ethtool -S ifb10 reports only the stats backed by the single
    allocated tx_private entry.
    
    Fixes: a21ee5b2fcb8 ("net: ifb: support ethtools stats")
    Cc: stable@vger.kernel.org
    Signed-off-by: Michael Bommarito <michael.bommarito@gmail.com>
    Link: https://patch.msgid.link/20260514013739.3549624-1-michael.bommarito@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net: lan966x: avoid unregistering netdev on register failure [+ + +]

Author: Myeonghun Pak <mhun512@gmail.com>
Date:   Wed May 6 21:43:11 2026 +0900

    net: lan966x: avoid unregistering netdev on register failure
    
    [ Upstream commit c4f3d6eb1fcf6cd9ce4644f604d5aad1ce594dfc ]
    
    lan966x_probe_port() stores the newly allocated net_device in the
    port before calling register_netdev(). If register_netdev() fails,
    the probe error path calls lan966x_cleanup_ports(), which sees
    port->dev and calls unregister_netdev() for a device that was never
    registered.
    
    Destroy the phylink instance created for this port and clear port->dev
    before returning the registration error. The common cleanup path now skips
    ports without port->dev before reaching the registered netdev cleanup, so
    it only handles ports that reached the registered-netdev lifetime.
    
    This also avoids treating an uninitialized FDMA netdev and the failed port
    as a NULL == NULL match in the common cleanup path.
    
    Fixes: d28d6d2e37d1 ("net: lan966x: add port module support")
    Co-developed-by: Ijae Kim <ae878000@gmail.com>
    Signed-off-by: Ijae Kim <ae878000@gmail.com>
    Signed-off-by: Myeonghun Pak <mhun512@gmail.com>
    Link: https://patch.msgid.link/20260506124331.31945-1-mhun512@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: mana: Fix TOCTOU double-fetch of hwc_msg_id from DMA buffer [+ + +]

Author: Erni Sri Satya Vennela <ernis@linux.microsoft.com>
Date:   Thu May 14 12:41:51 2026 -0700

    net: mana: Fix TOCTOU double-fetch of hwc_msg_id from DMA buffer
    
    [ Upstream commit 35f0f0a2536a4d604b4dbad92c85c4a8fdebb870 ]
    
    In mana_hwc_rx_event_handler(), resp->response.hwc_msg_id is read from
    DMA-coherent memory and bounds-checked, then mana_hwc_handle_resp()
    re-reads the same field from the same DMA buffer for test_bit() and
    pointer arithmetic.
    
    DMA-coherent memory is mapped uncacheable on x86 and is shared,
    unencrypted, in Confidential VMs (SEV-SNP/TDX), so each load goes
    directly to host-visible memory. A H/W can modify the value
    between the check and the use, bypassing the bounds validation.
    
    Fix this by reading hwc_msg_id exactly once using READ_ONCE() into a
    stack-local variable in mana_hwc_rx_event_handler(), and passing the
    validated value as a parameter to mana_hwc_handle_resp().
    
    Fixes: ca9c54d2d6a5 ("net: mana: Add a driver for Microsoft Azure Network Adapter (MANA)")
    Signed-off-by: Erni Sri Satya Vennela <ernis@linux.microsoft.com>
    Link: https://patch.msgid.link/20260514194156.466823-1-ernis@linux.microsoft.com
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: mana: validate rx_req_idx to prevent out-of-bounds array access [+ + +]

Author: Aditya Garg <gargaditya@linux.microsoft.com>
Date:   Tue May 19 22:15:53 2026 -0700

    net: mana: validate rx_req_idx to prevent out-of-bounds array access
    
    [ Upstream commit b809d0409991b75a6cff846a5ac27c3062953f84 ]
    
    In mana_hwc_rx_event_handler(), rx_req_idx is derived from
    sge->address in DMA-coherent memory. In Confidential VMs
    (SEV-SNP/TDX), this memory is shared unencrypted and HW can modify
    WQE contents at any time. No bounds check exists on rx_req_idx,
    which can lead to an out-of-bounds access into reqs[].
    
    Add bounds check on rx_req_idx in mana_hwc_rx_event_handler() before
    using it to index the reqs[] array.
    
    Fixes: ca9c54d2d6a5 ("net: mana: Add a driver for Microsoft Azure Network Adapter (MANA)")
    Signed-off-by: Aditya Garg <gargaditya@linux.microsoft.com>
    Reviewed-by: Haiyang Zhang <haiyangz@microsoft.com>
    Link: https://patch.msgid.link/20260520051553.857120-1-gargaditya@linux.microsoft.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: napi: Avoid gro timer misfiring at end of busypoll [+ + +]

Author: Dragos Tatulea <dtatulea@nvidia.com>
Date:   Wed May 6 09:08:08 2026 +0000

    net: napi: Avoid gro timer misfiring at end of busypoll
    
    [ Upstream commit 58e2330bd45572a6e3d46ea94cf7a9641f43591a ]
    
    When in irq deferral mode (defer-hard-irqs > 0), a short enough
    gro-flush timeout can trigger before NAPI_STATE_SCHED is cleared if the
    last poll in busy_poll_stop() takes too long. This can have the effect
    of leaving the queue stuck with interrupts disabled and no timer armed
    which results in a tx timeout if there is no subsequent busypoll cycle.
    
    To prevent this, defer the gro-flush timer arm after the last poll.
    
    Fixes: 7fd3253a7de6 ("net: Introduce preferred busy-polling")
    Co-developed-by: Martin Karsten <mkarsten@uwaterloo.ca>
    Signed-off-by: Martin Karsten <mkarsten@uwaterloo.ca>
    Signed-off-by: Dragos Tatulea <dtatulea@nvidia.com>
    Reviewed-by: Tariq Toukan <tariqt@nvidia.com>
    Reviewed-by: Cosmin Ratiu <cratiu@nvidia.com>
    Reviewed-by: Joe Damato <joe@dama.to>
    Link: https://patch.msgid.link/20260506090808.820559-2-dtatulea@nvidia.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: phy: DP83TC811: add reading of abilities [+ + +]

Author: Sven Schuchmann <schuchmann@schleissheimer.de>
Date:   Tue May 12 09:19:47 2026 +0200

    net: phy: DP83TC811: add reading of abilities
    
    [ Upstream commit c78bdba7b9666020c0832150a4fc4c0aebc7c6ac ]
    
    At this time the driver is not listing any speeds
    it supports. This should be ETHTOOL_LINK_MODE_100baseT1_Full_BIT
    for DP83TC811. Add the missing call for phylib to read the abilities.
    
    Fixes: b753a9faaf9a ("net: phy: DP83TC811: Introduce support for the DP83TC811 phy")
    Suggested-by: Andrew Lunn <andrew@lunn.ch>
    Signed-off-by: Sven Schuchmann <schuchmann@schleissheimer.de>
    Reviewed-by: Andrew Lunn <andrew@lunn.ch>
    Link: https://patch.msgid.link/20260512071949.6218-1-schuchmann@schleissheimer.de
    [pabeni@redhat.com: dropped revision history]
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: phy: honor eee_disabled_modes in phy_advertise_eee_all() [+ + +]

Author: Nicolai Buchwitz <nb@tipi-net.de>
Date:   Mon May 18 10:23:10 2026 +0200

    net: phy: honor eee_disabled_modes in phy_advertise_eee_all()
    
    [ Upstream commit 8baa7506d793f0636e3f6f01b01ef7be19674d06 ]
    
    phy_advertise_eee_all() copies supported_eee into advertising_eee
    unconditionally, overwriting any filtering applied during phy_probe()
    based on DT eee-broken-* properties or driver-populated
    eee_disabled_modes. genphy_c45_ethtool_set_eee() calls this helper
    when user space passes an empty advertisement, undoing the filtering.
    
    Apply the same eee_disabled_modes mask in phy_advertise_eee_all() so
    the filtering survives the copy, matching the pattern in phy_probe()
    and phy_support_eee().
    
    Fixes: b64691274f5d ("net: phy: add helper phy_advertise_eee_all")
    Signed-off-by: Nicolai Buchwitz <nb@tipi-net.de>
    Reviewed-by: Andrew Lunn <andrew@lunn.ch>
    Link: https://patch.msgid.link/20260518-devel-phy-support-eee-fix-v2-2-05b52626fa68@tipi-net.de
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: phy: honor eee_disabled_modes in phy_support_eee() [+ + +]

Author: Nicolai Buchwitz <nb@tipi-net.de>
Date:   Mon May 18 10:23:09 2026 +0200

    net: phy: honor eee_disabled_modes in phy_support_eee()
    
    [ Upstream commit 3655063e083889ed4b79b7dda9cec65478dce09a ]
    
    phy_support_eee() copies supported_eee into advertising_eee
    unconditionally, overwriting any filtering applied during phy_probe()
    based on DT eee-broken-* properties or driver-populated
    eee_disabled_modes. MAC drivers that call phy_support_eee() after
    probe (e.g. bcmgenet, fec, lan743x, lan78xx, r8169) then cause the PHY
    to advertise EEE for modes the user marked as broken.
    
    The symptom is that ethtool --show-eee on the local interface reports
    "not supported" (supported & ~eee_disabled_modes is empty) while the
    link partner sees EEE negotiated and active.
    
    phy_probe() already filters advertising_eee via eee_disabled_modes
    after calling of_set_phy_eee_broken(). Apply the same mask in
    phy_support_eee() so the filtering survives the copy.
    
    Fixes: 49168d1980e2 ("net: phy: Add phy_support_eee() indicating MAC support EEE")
    Signed-off-by: Nicolai Buchwitz <nb@tipi-net.de>
    Reviewed-by: Andrew Lunn <andrew@lunn.ch>
    Link: https://patch.msgid.link/20260518-devel-phy-support-eee-fix-v2-1-05b52626fa68@tipi-net.de
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: phy: skip EEE advertisement write when autoneg is disabled [+ + +]

Author: Nerijus Bendžiūnas <nerijus.bendziunas@gmail.com>
Date:   Sat May 16 18:02:51 2026 +0300

    net: phy: skip EEE advertisement write when autoneg is disabled
    
    commit 960e77ce14a83ef7f226e8e4b4d75765633ba48b upstream.
    
    genphy_c45_an_config_eee_aneg() writes the EEE advertisement to the
    auto-negotiation device's MMD register space (MDIO_MMD_AN, register
    MDIO_AN_EEE_ADV).  These registers are read by the link partner only
    during auto-negotiation, so writing them while autoneg is disabled
    cannot influence the link.  On some PHYs (e.g. Broadcom BCM54213PE)
    the write nevertheless reaches the chip and disturbs the receive
    datapath.
    
    Concretely, running
    
        ethtool -s eth0 speed 100 duplex full autoneg off
        ethtool --set-eee eth0 eee off
    
    leaves eth0 with TX working and RX completely silent on a
    Raspberry Pi 4 / CM4 board (bcmgenet + BCM54213PE in rgmii-rxid).
    Switching back to autoneg recovers the link.
    
    Prior to commit f26a29a038ee ("net: phy: ensure that genphy_c45_an_config_eee_aneg() sees new value of phydev->eee_cfg.eee_enabled"),
    the disable path was effectively a no-op because the helper read
    the stale eee_cfg.eee_enabled, so the underlying PHY behavior never
    surfaced.
    
    Bisected on rpi-6.12.y between commits 83943264 (good) and
    effcbc88 (bad) to f26a29a038ee.
    
    Fixes: f26a29a038ee ("net: phy: ensure that genphy_c45_an_config_eee_aneg() sees new value of phydev->eee_cfg.eee_enabled")
    Cc: stable@vger.kernel.org
    Signed-off-by: Nerijus Bendžiūnas <nerijus.bendziunas@gmail.com>
    Reviewed-by: Nicolai Buchwitz <nb@tipi-net.de>
    Tested-by: Nicolai Buchwitz <nb@tipi-net.de>
    Link: https://patch.msgid.link/20260516150251.879680-1-nerijus.bendziunas@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net: pse-pd: fix sign on -ENOENT check in of_load_pse_pis() [+ + +]

Author: Jonas Jelonek <jelonek.jonas@gmail.com>
Date:   Fri May 15 14:31:03 2026 +0000

    net: pse-pd: fix sign on -ENOENT check in of_load_pse_pis()
    
    commit 33d35975cbead3fa6b738ee57e5e45e14fbe0886 upstream.
    
    of_count_phandle_with_args() returns the count on success and a negative
    errno on failure, including -ENOENT when the "pairsets" property is
    absent. The existing comparison in of_load_pse_pis() checks against
    ENOENT (positive 2) instead of -ENOENT, so the branch is taken for any
    error return: legitimate DTs that omit "pairsets" trigger a spurious
    "wrong number of pairsets" error and probe fails with -EINVAL.
    
    Compare against -ENOENT so a missing "pairsets" property is correctly
    treated as "this PI has no pairsets, continue".
    
    Fixes: 9be9567a7c59 ("net: pse-pd: Add support for PSE PIs")
    Cc: stable@vger.kernel.org
    Signed-off-by: Jonas Jelonek <jelonek.jonas@gmail.com>
    Acked-by: Oleksij Rempel <o.rempel@pengutronix.de>
    Link: https://patch.msgid.link/20260515143103.1721888-1-jelonek.jonas@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

net: shaper: annotate the data races [+ + +]

Author: Jakub Kicinski <kuba@kernel.org>
Date:   Fri May 15 15:13:24 2026 -0700

    net: shaper: annotate the data races
    
    [ Upstream commit a3442936dd0523277e20aaf86207c574e755c634 ]
    
    As previously discussed we don't care about making the shaper
    state fully RCU-compliant because the hierarchy itself can't
    be dumped in one go over Netlink. Let's annotate the reads
    and writes to make that clear.
    
    The field-by-field assignments will also be useful for the
    next commit which adds explicit "valid" field (which we don't
    want to override with the current full struct assignment).
    
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://patch.msgid.link/20260515221325.1685455-2-kuba@kernel.org
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Stable-dep-of: b8d7519352ba ("net: shaper: rework the VALID marking (again)")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: shaper: enforce singleton NETDEV scope with id 0 [+ + +]

Author: Jakub Kicinski <kuba@kernel.org>
Date:   Sun May 10 12:29:03 2026 -0700

    net: shaper: enforce singleton NETDEV scope with id 0
    
    [ Upstream commit b62b29e6de6711f5918940aa6ff2bbab6d6af502 ]
    
    The NETDEV scope represents a singleton root shaper in the per-device
    hierarchy.  All code assumes NETDEV shapers have id 0:
    net_shaper_default_parent() hardcodes parent->id = 0 when returning
    the NETDEV parent for QUEUE/NODE children, and the UAPI documentation
    describes NETDEV scope as "the main shaper" (singular, not plural).
    
    Make sure we reject non-0 IDs.
    
    Fixes: 4b623f9f0f59 ("net-shapers: implement NL get operation")
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Link: https://patch.msgid.link/20260510192904.3987113-10-kuba@kernel.org
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: shaper: fix trivial ordering issue in net_shaper_commit() [+ + +]

Author: Jakub Kicinski <kuba@kernel.org>
Date:   Sun May 10 12:28:56 2026 -0700

    net: shaper: fix trivial ordering issue in net_shaper_commit()
    
    [ Upstream commit 235fb5376139c3419f2218349f1fa2f06f24f7ad ]
    
    We should update the entry before we mark it as valid.
    
    Fixes: 93954b40f6a4 ("net-shapers: implement NL set and delete operations")
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Link: https://patch.msgid.link/20260510192904.3987113-3-kuba@kernel.org
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: shaper: fix undersized reply skb allocation in GROUP command [+ + +]

Author: Jakub Kicinski <kuba@kernel.org>
Date:   Sun May 10 12:29:00 2026 -0700

    net: shaper: fix undersized reply skb allocation in GROUP command
    
    [ Upstream commit 0f9a857e34d0f8c018a3e4435c6f0e92e8d2f38c ]
    
    net_shaper_group_send_reply() writes both the NET_SHAPER_A_IFINDEX
    attribute (via net_shaper_fill_binding()) and the nested
    NET_SHAPER_A_HANDLE attribute (via net_shaper_fill_handle()), but
    the reply skb at the call site in net_shaper_nl_group_doit() is
    allocated using net_shaper_handle_size(), which only accounts for
    the nested handle.
    
    The allocation is therefore short by nla_total_size(sizeof(u32))
    (8 bytes) for the IFINDEX attribute.  In practice the slab allocator
    rounds up the small allocation so the bug is latent, but the size
    accounting is wrong and could bite if the reply grew further.
    
    Introduce net_shaper_group_reply_size() that accounts for the full
    reply payload and use it both at the genlmsg_new() call site and in
    the defensive WARN_ONCE message.
    
    Fixes: 5d5d4700e75d ("net-shapers: implement NL group operation")
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Link: https://patch.msgid.link/20260510192904.3987113-7-kuba@kernel.org
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: shaper: flip the polarity of the valid flag [+ + +]

Author: Jakub Kicinski <kuba@kernel.org>
Date:   Sun May 10 12:28:55 2026 -0700

    net: shaper: flip the polarity of the valid flag
    
    [ Upstream commit 7cee43fcb0c3f71441d2faaa8c2202b6a88b6bef ]
    
    The usual way of inserting entries which are not yet fully ready
    into XArray is to have a VALID flag. The shaper code has a NOT_VALID
    flag. Since XArray code does not let us create entries with marks
    already set - the creation of entries is currently not atomic.
    
    Flip the polarity of the VALID flag. This closes the tiny race
    in net_shaper_pre_insert() of entries being created without
    the NOT_VALID flag.
    
    Fixes: 93954b40f6a4 ("net-shapers: implement NL set and delete operations")
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Link: https://patch.msgid.link/20260510192904.3987113-2-kuba@kernel.org
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: shaper: reject duplicate leaves in GROUP request [+ + +]

Author: Jakub Kicinski <kuba@kernel.org>
Date:   Sun May 10 12:28:57 2026 -0700

    net: shaper: reject duplicate leaves in GROUP request
    
    [ Upstream commit a9a2fa1da619f276580b0d4c5d12efac89e8642b ]
    
    net_shaper_nl_group_doit() does not deduplicate NET_SHAPER_A_LEAVES
    entries. When userspace supplies the same leaf handle twice, the same
    old-parent pointer lands twice in old_nodes[]. The cleanup loop double
    frees the parent. Of course the same parent may still be in old_nodes[]
    twice if we are moving multiple of its leaves.
    
    Note that this patch also implicitly fixes the fact that the
    i >= leaves_count path forgets to set ret.
    
    Fixes: 5d5d4700e75d ("net-shapers: implement NL group operation")
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Link: https://patch.msgid.link/20260510192904.3987113-4-kuba@kernel.org
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: shaper: reject QUEUE scope handle with missing id [+ + +]

Author: Jakub Kicinski <kuba@kernel.org>
Date:   Sun May 10 12:29:04 2026 -0700

    net: shaper: reject QUEUE scope handle with missing id
    
    [ Upstream commit ce372e869f9f492f3d5aa9a0ae75ed52c61d2d6f ]
    
    net_shaper_parse_handle() does not enforce that the user provides
    the handle ID. For NODE the ID defaults to UNSPEC for all other
    cases it defaults to 0.
    
    For NETDEV 0 is the only option. For QUEUE defaulting to 0 makes
    less intuitive sense. Specifically because the behavior should
    (IMHO) be the same for all cases where there may be more than
    one ID (QUEUE and NODE).
    
    We should either document this as intentional or reject.
    I picked the latter with no strong conviction.
    
    Fixes: 4b623f9f0f59 ("net-shapers: implement NL get operation")
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Link: https://patch.msgid.link/20260510192904.3987113-11-kuba@kernel.org
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: shaper: Reject reparenting of existing nodes [+ + +]

Author: Mohsin Bashir <hmohsin@meta.com>
Date:   Wed May 6 16:37:45 2026 -0700

    net: shaper: Reject reparenting of existing nodes
    
    [ Upstream commit a77d5a069d959dc45f5f472d48cba37d8cba0f1c ]
    
    When an existing node-scope shaper is moved to a different parent
    via the group operation, the framework fails to update the leaves
    count on both the old and new parent shapers. Only newly created
    nodes (handle.id == NET_SHAPER_ID_UNSPEC) trigger the parent
    leaves increment at line 1039.
    
    This causes the parent's leaves counter to diverge from the
    actual number of children in the xarray. When the node is later
    deleted, pre_del_node() allocates an array sized by the stale
    leaves count, but the xarray iteration finds more children than
    expected, hitting the WARN_ON_ONCE guard and returning -EINVAL.
    
    Rather than adding reparenting support with complex leaves count
    bookkeeping, reject group calls that attempt to change an existing
    node's parent. Updates to an existing node's rate or leaves under
    the same parent remain permitted. We expect that for any modification
    of the topology user should always create new groups and let the
    kernel garbage collect the leaf-less nodes.
    
    Fixes: 5d5d4700e75d ("net-shapers: implement NL group operation")
    Signed-off-by: Mohsin Bashir <hmohsin@meta.com>
    Link: https://patch.msgid.link/20260506233745.111895-1-mohsin.bashr@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: shaper: rework the VALID marking (again) [+ + +]

Author: Jakub Kicinski <kuba@kernel.org>
Date:   Fri May 15 15:13:25 2026 -0700

    net: shaper: rework the VALID marking (again)
    
    [ Upstream commit b8d7519352ba8c6df83259295d4a3bad093cae90 ]
    
    Recent commit changed the semantics from NOT_VALID to VALID.
    I didn't realize that the flags are not stored atomically
    with the entry in XArray. There's still a race of reader
    observing a VALID mark for a slot, getting interrupted,
    writer replacing the entry with a different one, reader
    continuing, fetching the entry which is now a different
    pointer than the pointer for which VALID was meant.
    
    The biggest consequence of this is that we may see a UAF
    since net_shaper_rollback() assumed that entries without
    VALID can be freed without observing RCU.
    
    Looks like the XArray marks are buying us nothing at this
    point. Let's convert the code to an explicit valid field.
    The smp_load_acquire() / smp_store_release() barriers are
    marginally cleaner.
    
    Reported-by: Sashiko <sashiko-bot@kernel.org>
    Fixes: 93954b40f6a4 ("net-shapers: implement NL set and delete operations")
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://patch.msgid.link/20260515221325.1685455-3-kuba@kernel.org
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: shaper: set ret to -ENOMEM when genlmsg_new() fails in group_doit [+ + +]

Author: Jakub Kicinski <kuba@kernel.org>
Date:   Sun May 10 12:28:59 2026 -0700

    net: shaper: set ret to -ENOMEM when genlmsg_new() fails in group_doit
    
    [ Upstream commit 8054f85b83f42a37d482fc77ea7c9ff06a9407d9 ]
    
    genlmsg_new() alloc failure path in net_shaper_nl_group_doit() forgets
    to set ret before jumping to error handling.
    
    Fixes: 5d5d4700e75d ("net-shapers: implement NL group operation")
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Link: https://patch.msgid.link/20260510192904.3987113-6-kuba@kernel.org
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: stmmac: eswin: clear TXD and RXD delay registers during initialization [+ + +]

Author: Zhi Li <lizhi2@eswincomputing.com>
Date:   Mon May 18 10:21:37 2026 +0800

    net: stmmac: eswin: clear TXD and RXD delay registers during initialization
    
    [ Upstream commit 6872fb088edc1a3c36792b301f8e4a1c35dd7c35 ]
    
    Clear the TXD and RXD delay control registers during EIC7700 DWMAC
    initialization.
    
    These registers may retain values programmed by the bootloader. If left
    unchanged, residual delays can alter the effective RGMII timing seen by
    the MAC and override the configuration described by the device tree.
    
    This may violate the expected RGMII timing model and can cause link
    instability or prevent the Ethernet controller from operating correctly.
    
    Explicitly clearing these registers ensures that the MAC delay settings
    are determined solely by the kernel configuration.
    
    The corresponding register offsets are optional, and the registers are
    only cleared when the offsets are provided in the device tree.
    
    Fixes: ea77dbbdbc4e ("net: stmmac: add Eswin EIC7700 glue driver")
    Signed-off-by: Zhi Li <lizhi2@eswincomputing.com>
    Link: https://patch.msgid.link/20260518022137.464-1-lizhi2@eswincomputing.com
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: stmmac: eswin: correct RGMII delay granularity to 20 ps [+ + +]

Author: Zhi Li <lizhi2@eswincomputing.com>
Date:   Mon May 18 10:21:52 2026 +0800

    net: stmmac: eswin: correct RGMII delay granularity to 20 ps
    
    [ Upstream commit 6ffcef9bc1fc2ad8110777decd6d026e3cb468ce ]
    
    The EIC7700 MAC implements programmable RGMII delay adjustment with a
    granularity of 20 ps per hardware step.
    
    The driver previously converted rx-internal-delay-ps and
    tx-internal-delay-ps values using a 100 ps step size, resulting in
    incorrect delay programming.
    
    Update the conversion to use the correct 20 ps granularity so the
    programmed delay matches the values described in the device tree.
    
    Fixes: ea77dbbdbc4e ("net: stmmac: add Eswin EIC7700 glue driver")
    Signed-off-by: Zhi Li <lizhi2@eswincomputing.com>
    Link: https://patch.msgid.link/20260518022156.484-1-lizhi2@eswincomputing.com
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: stmmac: eswin: fix HSP CSR init ordering after clock enable [+ + +]

Author: Zhi Li <lizhi2@eswincomputing.com>
Date:   Mon May 18 10:20:55 2026 +0800

    net: stmmac: eswin: fix HSP CSR init ordering after clock enable
    
    [ Upstream commit 23386defe949c0db4f746bed7098fc5e06746083 ]
    
    Fix the initialization ordering of the HSP CSR configuration in the
    EIC7700 DWMAC glue driver.
    
    The HSP CSR registers control MAC-side RGMII delay behavior and must
    only be accessed after the corresponding clocks are enabled. The
    previous implementation could trigger register access before clock
    enablement, leading to undefined behavior depending on boot state.
    
    Move the HSP CSR configuration into the post-clock-enable initialization
    path to ensure all register accesses occur under valid clock domains.
    
    This change ensures deterministic initialization and prevents
    clock-dependent register access failures during probe or resume.
    
    Fixes: ea77dbbdbc4e ("net: stmmac: add Eswin EIC7700 glue driver")
    Signed-off-by: Zhi Li <lizhi2@eswincomputing.com>
    Link: https://patch.msgid.link/20260518022055.444-1-lizhi2@eswincomputing.com
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: stmmac: eswin: validate RGMII delay values [+ + +]

Author: Zhi Li <lizhi2@eswincomputing.com>
Date:   Mon May 18 10:22:13 2026 +0800

    net: stmmac: eswin: validate RGMII delay values
    
    [ Upstream commit c2e152f7ce3208b9333d212d41a87637ec1dd170 ]
    
    Validate rx-internal-delay-ps and tx-internal-delay-ps against the
    hardware capabilities of the EIC7700 MAC.
    
    The programmable RGMII delay supports 20 ps steps and a maximum value of
    2540 ps. The driver previously accepted arbitrary values and silently
    truncated unsupported settings when converting them to hardware units.
    
    As a result, invalid device tree values could lead to unexpected delay
    programming and incorrect RGMII timing.
    
    Reject delay values that are not multiples of 20 ps or exceed the
    supported hardware range.
    
    Fixes: ea77dbbdbc4e ("net: stmmac: add Eswin EIC7700 glue driver")
    Signed-off-by: Zhi Li <lizhi2@eswincomputing.com>
    Link: https://patch.msgid.link/20260518022214.507-1-lizhi2@eswincomputing.com
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: ti: icssm-prueth: fix eth_ports_node leak in probe [+ + +]

Author: Shitalkumar Gandhi <shital.gandhi45@gmail.com>
Date:   Thu May 7 01:28:13 2026 +0530

    net: ti: icssm-prueth: fix eth_ports_node leak in probe
    
    [ Upstream commit 6635fa84403c3a59455b66007c019a7cc632db30 ]
    
    The error path on of_property_read_u32() failure inside
    icssm_prueth_probe() returns without putting eth_ports_node,
    which was acquired before the for_each_child_of_node() loop.
    
    Drop it before returning.
    
    Fixes: 511f6c1ae093 ("net: ti: icssm-prueth: Adds ICSSM Ethernet driver")
    Signed-off-by: Shitalkumar Gandhi <shitalkumar.gandhi@cambiumnetworks.com>
    Link: https://patch.msgid.link/20260506195813.641610-1-shitalkumar.gandhi@cambiumnetworks.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: tls: fix off-by-one in sg_chain entry count for wrapped sk_msg ring [+ + +]

Author: Jakub Kicinski <kuba@kernel.org>
Date:   Mon May 11 10:49:17 2026 -0700

    net: tls: fix off-by-one in sg_chain entry count for wrapped sk_msg ring
    
    [ Upstream commit 285943c6e7ca309bbea84b253745154241d9788a ]
    
    When an sk_msg scatterlist ring wraps (sg.end < sg.start),
    tls_push_record() chains the tail portion of the ring to the head
    using sg_chain(). An extra entry in the sg array is reserved for
    this:
    
      struct sk_msg_sg {
            [...]
            /* The extra two elements:
             * 1) used for chaining the front and sections when the list becomes
             *    partitioned (e.g. end < start). The crypto APIs require the
             *    chaining;
             * 2) to chain tailer SG entries after the message.
             */
            struct scatterlist              data[MAX_MSG_FRAGS + 2];
    
    The current code uses MAX_SKB_FRAGS + 1 as the ring size:
    
        sg_chain(&msg_pl->sg.data[msg_pl->sg.start],
                 MAX_SKB_FRAGS - msg_pl->sg.start + 1,
                 msg_pl->sg.data);
    
    This places the chain pointer at
    
      sg_chain(data[start], (MAX_SKB_FRAGS - msg_start + 1) .. =
      &data[start] + (MAX_SKB_FRAGS - msg_start + 1) - 1 =
      data[start + (MAX_SKB_FRAGS - start + 1) - 1] =
      data[MAX_SKB_FRAGS]
    
    instead of the true last entry. This is likely due to a "race" of
    the commit under Fixes landing close to
    commit 031097d9e079 ("bpf: sk_msg, zap ingress queue on psock down")
    
    Convert to ARRAY_SIZE and drop the data[start] / - start (as suggested
    by Sabrina).
    
    Reported-by: 钱一铭 <yimingqian591@gmail.com>
    Fixes: 9aaaa56845a0 ("bpf: Sockmap/tls, skmsg can have wrapped skmsg that needs extra chaining")
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Reviewed-by: Sabrina Dubroca <sd@queasysnail.net>
    Link: https://patch.msgid.link/20260511174920.433155-2-kuba@kernel.org
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: tls: prevent chain-after-chain in plain text SG [+ + +]

Author: Jakub Kicinski <kuba@kernel.org>
Date:   Mon May 11 10:49:18 2026 -0700

    net: tls: prevent chain-after-chain in plain text SG
    
    [ Upstream commit ff26a0e8377dec07e4a7230db7675bed1b9a6d03 ]
    
    Sashiko points out that if end = 0 (start != 0) the current
    code will create a chain link to content type right after
    the wrap link:
    
      This would create a chain where the wrap link points directly
      to another chain link. The scatterlist API sg_next iterator
      does not recursively resolve consecutive chain links.
    
    meaning this is illegal input to crypto.
    
    The wrapping link is unnecessary if end = 0. end is the entry after
    the last one used so end = 0 means there's nothing pushed after
    the wrap:
    
       end         start            i
        v            v              v
      [   ]...[   ][ d ][ d ][ d ][ d ][rsv for wrap]
    
    Skip the wrapping in this case.
    
    TLS 1.3 can use the "wrapping slot" for it's chaining if end = 0.
    This avoids the chain-after-chain.
    
    Move the wrap chaining before marking END and chaining off content
    type, that feels like more logical ordering to me, but should not
    matter from functional perspective.
    
    Reported-by: Sashiko <sashiko-bot@kernel.org>
    Fixes: 9aaaa56845a0 ("bpf: Sockmap/tls, skmsg can have wrapped skmsg that needs extra chaining")
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Link: https://patch.msgid.link/20260511174920.433155-3-kuba@kernel.org
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

net: wwan: iosm: fix potential memory leaks in ipc_imem_init() [+ + +]

Author: Abdun Nihaal <nihaal@cse.iitm.ac.in>
Date:   Tue May 19 11:57:39 2026 +0530

    net: wwan: iosm: fix potential memory leaks in ipc_imem_init()
    
    commit c5d93b2c40355e999715262a824965aac025a427 upstream.
    
    The memory allocated in ipc_protocol_init() is not freed on the error
    paths that follow in ipc_imem_init(). Fix that by calling the
    corresponding release function ipc_protocol_deinit() in the error path.
    
    Fixes: 3670970dd8c6 ("net: iosm: shared memory IPC interface")
    Cc: stable@vger.kernel.org
    Signed-off-by: Abdun Nihaal <nihaal@cse.iitm.ac.in>
    Link: https://patch.msgid.link/20260519062815.55545-1-nihaal@cse.iitm.ac.in
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

netfilter: bridge: eb_tables: close module init race [+ + +]

Author: Florian Westphal <fw@strlen.de>
Date:   Thu May 7 11:19:22 2026 +0200

    netfilter: bridge: eb_tables: close module init race
    
    [ Upstream commit 27414ff1b287ea9a2a11675149ec28e05539f3cc ]
    
    sashiko reports for unrelated patch:
     Does the core ebtables initialization in ebtables.c suffer from a similar race?
     Once nf_register_sockopt() completes, the sockopts are exposed globally.
    
    sockopt has to be registered last, just like in ip/ip6/arptables.
    
    Fixes: 5b53951cfc85 ("netfilter: ebtables: use net_generic infra")
    Signed-off-by: Florian Westphal <fw@strlen.de>
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfilter: ebtables: close dangling table module init race [+ + +]

Author: Florian Westphal <fw@strlen.de>
Date:   Wed May 6 12:07:19 2026 +0200

    netfilter: ebtables: close dangling table module init race
    
    [ Upstream commit 92c603fa07bc0d6a17345de3ad7954730b8de44b ]
    
    sashiko reported for a related patch:
     In modules like iptable_raw.c, [..], if register_pernet_subsys() fails,
     the rollback might call kfree(rawtable_ops) before [..]
     During this window, could a concurrent userspace process find the globally
     visible template, trigger table_init(), [..]
    
    The table init functions must always register the template last.
    
    Otherwise, set/getsockopt can instantiate a table in a namespace
    while the required pernet ops (contain the destructor) isn't available.
    This change is also required in x_tables, handled in followup change.
    
    Fixes: 87663c39f898 ("netfilter: ebtables: do not hook tables by default")
    Reviewed-by: Tristan Madani <tristan@talencesecurity.com>
    Signed-off-by: Florian Westphal <fw@strlen.de>
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfilter: ebtables: move to two-stage removal scheme [+ + +]

Author: Florian Westphal <fw@strlen.de>
Date:   Wed May 6 12:07:18 2026 +0200

    netfilter: ebtables: move to two-stage removal scheme
    
    [ Upstream commit b7f0544d86d439cb946515d2ef6a0a75e8626710 ]
    
    Like previous patches for x_tables, follow same pattern in ebtables.
    We can't reuse xt helpers: ebt_table struct layout is incompatible.
    
    table->ops assignment is now done while still holding the ebt mutex
    to make sure we never expose partially-filled table struct.
    
    Fixes: 87663c39f898 ("netfilter: ebtables: do not hook tables by default")
    Reviewed-by: Tristan Madani <tristan@talencesecurity.com>
    Signed-off-by: Florian Westphal <fw@strlen.de>
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfilter: ip6t_hbh: reject oversized option lists [+ + +]

Author: Zhengchuan Liang <zcliangcn@gmail.com>
Date:   Wed May 13 15:57:17 2026 +0800

    netfilter: ip6t_hbh: reject oversized option lists
    
    commit 4322dcde6b4173c2d8e8e6118ed290794263bcc8 upstream.
    
    struct ip6t_opts stores at most IP6T_OPTS_OPTSNR option descriptors,
    but hbh_mt6_check() does not reject larger optsnr values supplied from
    userspace.
    
    Validate optsnr in the rule setup path so only match data that fits the
    fixed-size opts array can be installed. This follows the existing xtables
    pattern of rejecting invalid user-provided counts in checkentry() and
    keeps the packet matching path unchanged.
    
    `struct ip6t_opts` has a fixed `opts[IP6T_OPTS_OPTSNR]` array,
    where `IP6T_OPTS_OPTSNR` is 16, then off-by-one array access is possible:
    
    [  137.924693][ T8692] UBSAN: array-index-out-of-bounds in ../net/ipv6/netfilter/ip6t_hbh.c:110:29
    [  137.926167][ T8692] index 16 is out of range for type '__u16 [16]'
    
    Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
    Cc: stable@kernel.org
    Reported-by: Yuan Tan <yuantan098@gmail.com>
    Reported-by: Yifan Wu <yifanwucs@gmail.com>
    Reported-by: Juefei Pu <tomapufckgml@gmail.com>
    Reported-by: Xin Liu <bird@lzu.edu.cn>
    Signed-off-by: Zhengchuan Liang <zcliangcn@gmail.com>
    Signed-off-by: Ren Wei <n05ec@lzu.edu.cn>
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

netfilter: ipset: stop hash:* range iteration at end [+ + +]

Author: Nan Li <tonanli66@gmail.com>
Date:   Tue May 12 16:50:01 2026 +0800

    netfilter: ipset: stop hash:* range iteration at end
    
    commit 0d3a282ab5f165fc207ff49ea5b6ad8f54616bd6 upstream.
    
    The following hash set variants:
    
    hash:ip,mark
    hash:ip,port
    hash:ip,port,ip
    hash:ip,port,net
    
    iterate IPv4 ranges with a 32-bit iterator.
    
    The iterator must stop once the last address in the requested range has
    been processed. Advancing it once more can move the traversal state past
    the end of the request, so a later retry may continue from an unintended
    position.
    
    Handle the iterator increment explicitly at the end of the loop and stop
    once the upper bound has been processed. This keeps the existing retry
    behaviour intact for valid ranges while preventing traversal from
    continuing past the original boundary.
    
    Fixes: 48596a8ddc46 ("netfilter: ipset: Fix adding an IPv4 range containing more than 2^31 addresses")
    Cc: stable@kernel.org
    Reported-by: Yuan Tan <yuantan098@gmail.com>
    Reported-by: Yifan Wu <yifanwucs@gmail.com>
    Reported-by: Juefei Pu <tomapufckgml@gmail.com>
    Reported-by: Xin Liu <bird@lzu.edu.cn>
    Signed-off-by: Nan Li <tonanli66@gmail.com>
    Signed-off-by: Ren Wei <n05ec@lzu.edu.cn>
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

netfilter: nf_conntrack_expect: restore helper propagation via expectation [+ + +]

Author: Pablo Neira Ayuso <pablo@netfilter.org>
Date:   Thu May 7 13:00:28 2026 +0200

    netfilter: nf_conntrack_expect: restore helper propagation via expectation
    
    [ Upstream commit dcb0f9aefdd604d36710fda53c25bd7cf4a3e37a ]
    
    A recent series to fix expectations broke helper propagation via
    expectation, this mechanism is used by the sip and h323 helper. This
    also propagates the conntrack helper to expected connections. I changed
    semantics of exp->helper which now tells us the actual helper that
    created the expectation.
    
    Add an explicit assign_helper field to expectations for this purpose
    and update helpers to use it.
    
    Restore this feature for userspace conntrack helper via ctnetlink
    nfqueue integration so it is again possible to attach a helper to an
    expectation, where it makes sense. This is not restored via ctnetlink
    expectation creation as there is no client for such feature. Use the
    expectation layer 4 protocol number for the helper lookup for
    consistency.
    
    Make sure the expectation using this helper propagation mechanism also
    go away when the helper is unregistered.
    
    Fixes: 9c42bc9db90a ("netfilter: nf_conntrack_expect: honor expectation helper field")
    Fixes: 917b61fa2042 ("netfilter: ctnetlink: ignore explicit helper on new expectations")
    Reported-by: Ilya Maximets <i.maximets@ovn.org>
    Tested-by: Ilya Maximets <i.maximets@ovn.org>
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfilter: nf_queue: hold bridge skb->dev while queued [+ + +]

Author: Haoze Xie <royenheart@gmail.com>
Date:   Fri May 15 11:19:02 2026 +0800

    netfilter: nf_queue: hold bridge skb->dev while queued
    
    commit e196115ec330a18de415bdb9f5071aa9f08e53ce upstream.
    
    br_pass_frame_up() rewrites skb->dev from the ingress port to the bridge
    master before queueing bridge LOCAL_IN packets. NFQUEUE only holds
    references on state.in/out and bridge physdevs, so a queued bridge
    packet can retain a freed bridge master in skb->dev until reinjection.
    
    When the verdict is reinjected later, br_netif_receive_skb() re-enters
    the receive path with skb->dev still pointing at the freed bridge master,
    triggering a use-after-free.
    
    Store skb->dev in the queue entry, hold a reference on it for the queue
    lifetime, and use the saved device when dropping queued packets during
    NETDEV_DOWN handling.
    
    Fixes: ac2863445686 ("netfilter: bridge: add nf_afinfo to enable queuing to userspace")
    Cc: stable@kernel.org
    Reported-by: Yuan Tan <yuantan098@gmail.com>
    Reported-by: Yifan Wu <yifanwucs@gmail.com>
    Reported-by: Juefei Pu <tomapufckgml@gmail.com>
    Reported-by: Xin Liu <bird@lzu.edu.cn>
    Signed-off-by: Haoze Xie <royenheart@gmail.com>
    Signed-off-by: Ren Wei <n05ec@lzu.edu.cn>
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

netfilter: nft_inner: Fix IPv6 inner_thoff desync [+ + +]

Author: Yizhou Zhao <zhaoyz24@mails.tsinghua.edu.cn>
Date:   Tue May 12 01:30:41 2026 +0800

    netfilter: nft_inner: Fix IPv6 inner_thoff desync
    
    commit b6a91f68ebfed9c38e0e9150f58a9b85da07181c upstream.
    
    In nft_inner_parse_l2l3(), when processing inner IPv6 packets,
    ipv6_find_hdr() correctly computes the transport header offset
    traversing all extension headers, but the result is immediately
    overwritten with nhoff + sizeof(_ip6h) (40 bytes), which only
    accounts for the IPv6 base header. This creates a desync between
    inner_thoff (wrong — points to extension header start) and l4proto
    (correct — e.g., IPPROTO_TCP), enabling transport header forgery
    and potential firewall bypass. This issue affects stable versions
    from Linux 6.2.
    
    For comparison, the normal (non-inner) IPv6 path correctly
    preserves ipv6_find_hdr()'s result. Removing the incorrect overwrite
    ensures that ipv6_find_hdr()'s calculated transport header offset is
    preserved, thereby fixing the desynchronization.
    
    Fixes: 3a07327d10a0 ("netfilter: nft_inner: support for inner tunnel header matching")
    Cc: stable@vger.kernel.org
    Reported-by: Yizhou Zhao <zhaoyz24@mails.tsinghua.edu.cn>
    Reported-by: Yuxiang Yang <yangyx22@mails.tsinghua.edu.cn>
    Reported-by: Xuewei Feng <fengxw06@126.com>
    Reported-by: Qi Li <qli01@tsinghua.edu.cn>
    Reported-by: Ke Xu <xuke@tsinghua.edu.cn>
    Assisted-by: GLM:5.1 Z.ai
    Signed-off-by: Yizhou Zhao <zhaoyz24@mails.tsinghua.edu.cn>
    Reviewed-by: Fernando Fernandez Mancera <fmancera@suse.de>
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

netfilter: nft_inner: release local_lock before re-enabling softirqs [+ + +]

Author: Florian Westphal <fw@strlen.de>
Date:   Tue May 12 11:30:49 2026 +0200

    netfilter: nft_inner: release local_lock before re-enabling softirqs
    
    [ Upstream commit a6cb3ff979855f7f0ee9450a947fe8f96c2ba37a ]
    
    Quoting sashiko:
     In the error path, local_bh_enable() is called before
     local_unlock_nested_bh().
    
    Fixes: ba36fada9ab4 ("netfilter: nft_inner: Use nested-BH locking for nft_pcpu_tun_ctx")
    Signed-off-by: Florian Westphal <fw@strlen.de>
    Reviewed-by: Fernando Fernandez Mancera <fmancera@suse.de>
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfilter: x_tables: add and use xt_unregister_table_pre_exit [+ + +]

Author: Florian Westphal <fw@strlen.de>
Date:   Wed May 6 12:07:15 2026 +0200

    netfilter: x_tables: add and use xt_unregister_table_pre_exit
    
    [ Upstream commit 527d6931473b75d90e38942aae6537d1a527f1fd ]
    
    Remove the copypasted variants of _pre_exit and add one single
    function in the xtables core.  ebtables is not compatible with
    x_tables and therefore unchanged.
    
    This is a preparation patch to reduce noise in the followup
    bug fixes.
    
    Reviewed-by: Tristan Madani <tristan@talencesecurity.com>
    Signed-off-by: Florian Westphal <fw@strlen.de>
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Stable-dep-of: b4597d5fd7d2 ("netfilter: x_tables: add and use xtables_unregister_table_exit")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfilter: x_tables: add and use xtables_unregister_table_exit [+ + +]

Author: Florian Westphal <fw@strlen.de>
Date:   Wed May 6 12:07:17 2026 +0200

    netfilter: x_tables: add and use xtables_unregister_table_exit
    
    [ Upstream commit b4597d5fd7d2f8cebfffd40dffb5e003cc78964c ]
    
    Previous change added xtables_unregister_table_pre_exit to detach the
    table from the packetpath and to unlink it from the active table list.
    In case of rmmod, userspace that is doing set/getsockopt for this table
    will not be able to re-instantiate the table:
     1. The larval table has been removed already
     2. existing instantiated table is no longer on the xt pernet table list.
    
    This adds the second stage helper:
    
    unlink the table from the dying list, free the hook ops (if any) and do
    the audit notification.  It replaces xt_unregister_table().
    
    Fixes: fdacd57c79b7 ("netfilter: x_tables: never register tables by default")
    Reported-by: Tristan Madani <tristan@talencesecurity.com>
    Reviewed-by: Tristan Madani <tristan@talencesecurity.com>
    Closes: https://lore.kernel.org/netfilter-devel/20260429175613.1459342-1-tristmd@gmail.com/
    Signed-off-by: Florian Westphal <fw@strlen.de>
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfilter: x_tables: allocate hook ops while under mutex [+ + +]

Author: Florian Westphal <fw@strlen.de>
Date:   Wed May 6 12:07:14 2026 +0200

    netfilter: x_tables: allocate hook ops while under mutex
    
    [ Upstream commit b62eb8dcf2c47d4d676a434efbd57c4f776f7829 ]
    
    arp/ip(6)t_register_table() add the table to the per-netns list via
    xt_register_table() before allocating the per-netns hook ops copy
    via kmemdup_array().  This leaves a window where the table is
    visible in the list with ops=NULL.
    
    If the pernet exit happens runs concurrently the pre_exit callback finds
    the table via xt_find_table() and passes the NULL ops pointer to
    nf_unregister_net_hooks(), causing a NULL dereference:
    
      general protection fault in nf_unregister_net_hooks+0xbc/0x150
      RIP: nf_unregister_net_hooks (net/netfilter/core.c:613)
      Call Trace:
        ipt_unregister_table_pre_exit
        iptable_mangle_net_pre_exit
        ops_pre_exit_list
        cleanup_net
    
    Fix by moving the ops allocation into the xtables core so the table is
    never in the list without valid ops.  Also ensure the table is no longer
    processing packets before its torn down on error unwind.
    nf_register_net_hooks might have published at least one hook; call
    synchronize_rcu() if there was an error.
    
    audit log register message gets deferred until all operations have
    passed, this avoids need to emit another ureg message in case of
    error unwinding.
    
    Based on earlier patch by Tristan Madani.
    
    Fixes: f9006acc8dfe5 ("netfilter: arp_tables: pass table pointer via nf_hook_ops")
    Fixes: ee177a54413a ("netfilter: ip6_tables: pass table pointer via nf_hook_ops")
    Fixes: ae689334225f ("netfilter: ip_tables: pass table pointer via nf_hook_ops")
    Link: https://lore.kernel.org/netfilter-devel/20260429175613.1459342-1-tristmd@gmail.com/
    Signed-off-by: Tristan Madani <tristan@talencesecurity.com>
    Signed-off-by: Florian Westphal <fw@strlen.de>
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfilter: x_tables: allow initial table replace without emitting audit log message [+ + +]

Author: Florian Westphal <fw@strlen.de>
Date:   Wed May 6 12:07:13 2026 +0200

    netfilter: x_tables: allow initial table replace without emitting audit log message
    
    [ Upstream commit 8e72510db9fa2d41f2b06d5c01fe9020e076fee4 ]
    
    At the moment we emit the audit log a bit too early, which makes it
    necessary to also emit an unregister log in case we have to unwind
    errors after possible hook register failure.
    
    Followup patch will be slightly simpler if we can delay the
    register message until after the hooks have been wired up.
    
    Signed-off-by: Florian Westphal <fw@strlen.de>
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Stable-dep-of: b62eb8dcf2c4 ("netfilter: x_tables: allocate hook ops while under mutex")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfilter: x_tables: close dangling table module init race [+ + +]

Author: Florian Westphal <fw@strlen.de>
Date:   Wed May 6 12:07:20 2026 +0200

    netfilter: x_tables: close dangling table module init race
    
    [ Upstream commit 16bc4b6686b2c112c10e67d6b493adc3607256d3 ]
    
    Similar to the previous ebtables patch:
    template add exposes the table to userspace, we must do this last to
    rnsure the pernet ops are set up (contain the destructors).
    
    Fixes: fdacd57c79b7 ("netfilter: x_tables: never register tables by default")
    Signed-off-by: Florian Westphal <fw@strlen.de>
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfilter: x_tables: unregister the templates first [+ + +]

Author: Florian Westphal <fw@strlen.de>
Date:   Wed May 6 12:07:16 2026 +0200

    netfilter: x_tables: unregister the templates first
    
    [ Upstream commit d338693d778579b676a61346849bebd892427158 ]
    
    When the module is going away we need to zap the template
    first.  Else there is a small race window where userspace
    could instantiate a new table after the pernet exit function
    has removed the current table.
    
    Fixes: fdacd57c79b7 ("netfilter: x_tables: never register tables by default")
    Reported-by: Tristan Madani <tristan@talencesecurity.com>
    Reviewed-by: Tristan Madani <tristan@talencesecurity.com>
    Closes: https://lore.kernel.org/netfilter-devel/20260429175613.1459342-1-tristmd@gmail.com/
    Signed-off-by: Florian Westphal <fw@strlen.de>
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfs, afs: Fix write skipping in dir/link writepages [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Tue May 12 13:34:00 2026 +0100

    netfs, afs: Fix write skipping in dir/link writepages
    
    [ Upstream commit 9871938f99cc6cb266a77265491660e2375271f5 ]
    
    Fix netfs_write_single() and afs_single_writepages() to better handle a
    write that would be skipped due to lock contention and WB_SYNC_NONE by
    returning 1 from netfs_write_single() if it skipped and making
    afs_single_writepages() skip also.  If a skip occurs, the inode must be
    re-marked as the VFS may have cleared the mark.
    
    This is really only theoretical for directories in netfs_write_single() as
    the only path to that is through afs_single_writepages() that takes the
    ->validate_lock around it, thereby serialising it.
    
    Fixes: 6dd80936618c ("afs: Use netfslib for directories")
    Signed-off-by: David Howells <dhowells@redhat.com>
    Link: https://patch.msgid.link/20260512123404.719402-24-dhowells@redhat.com
    cc: Marc Dionne <marc.dionne@auristor.com>
    cc: linux-afs@lists.infradead.org
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfs: Defer the emission of trace_netfs_folio() [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Tue May 12 13:33:49 2026 +0100

    netfs: Defer the emission of trace_netfs_folio()
    
    [ Upstream commit daeb443b92817021c1234e8eded219e164b7c35d ]
    
    Change netfs_perform_write() to keep the netfs_folio trace value in a
    variable and emit it later to make it easier to choose the value displayed.
    This is a prerequisite for a subsequent patch.
    
    Closes: https://sashiko.dev/#/patchset/20260414082004.3756080-1-dhowells%40redhat.com
    Signed-off-by: David Howells <dhowells@redhat.com>
    Link: https://patch.msgid.link/20260512123404.719402-13-dhowells@redhat.com
    cc: Paulo Alcantara <pc@manguebit.org>
    cc: Matthew Wilcox <willy@infradead.org>
    cc: netfs@lists.linux.dev
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Stable-dep-of: 7b4dcf1b9455 ("netfs: Fix streaming write being overwritten")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfs: Fix cancellation of a DIO and single read subrequests [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Tue May 12 13:33:38 2026 +0100

    netfs: Fix cancellation of a DIO and single read subrequests
    
    [ Upstream commit 6f0f7ac1915abc0d202f0eb4b003a6548a5ba60d ]
    
    When the preparation of a new subrequest for a read fails, if the
    subrequest has already been added to the stream->subrequests list, it can't
    simply be put and abandoned as the collector may see it.  Also, if it
    hasn't been queued yet, it has two outstanding refs that both need to be
    put.  Both DIO read and single-read dispatch fail at this; further, both
    differ in the order they do things to the way buffered read works.
    
    Fix cancellation of both DIO-read and single-read subrequests that failed
    preparation by the following steps:
    
     (1) Harmonise all three reads (buffered, dio, single) to queue the subreq
         before prepping it.
    
     (2) Make all three call netfs_queue_read() to do the queuing.
    
     (3) Set NETFS_RREQ_ALL_QUEUED independently of the queuing as we don't
         know the length of the subreq at this point.
    
     (4) In all cases, set the error and NETFS_SREQ_FAILED flag on the subreq
         and then call netfs_read_subreq_terminated() to deal with it.  This
         will pass responsibility off to the collector for dealing with it.
    
    Fixes: e2d46f2ec332 ("netfs: Change the read result collector to only use one work item")
    Closes: https://sashiko.dev/#/patchset/20260425125426.3855807-1-dhowells%40redhat.com
    Signed-off-by: David Howells <dhowells@redhat.com>
    Link: https://patch.msgid.link/20260512123404.719402-2-dhowells@redhat.com
    cc: Paulo Alcantara <pc@manguebit.org>
    cc: netfs@lists.linux.dev
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfs: Fix early put of sink folio in netfs_read_gaps() [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Tue May 12 13:33:54 2026 +0100

    netfs: Fix early put of sink folio in netfs_read_gaps()
    
    [ Upstream commit 3e5dd91b87a8b1450217b56a336bee315f40da7d ]
    
    Fix netfs_read_gaps() to release the sink page it uses after waiting for
    the request to complete.  The way the sink page is used is that an
    ITER_BVEC-class iterator is created that has the gaps from the target folio
    at either end, but has the sink page tiled over the middle so that a single
    read op can fill in both gaps.
    
    The bug was found by KASAN detecting a UAF on the generic/075 xfstest in
    the cifsd kernel thread that handles reception of data from the TCP socket:
    
     BUG: KASAN: use-after-free in _copy_to_iter+0x48a/0xa20
     Write of size 885 at addr ffff888107f92000 by task cifsd/1285
     CPU: 2 UID: 0 PID: 1285 Comm: cifsd Not tainted 7.0.0 #6 PREEMPT(lazy)
     Call Trace:
      dump_stack_lvl+0x5d/0x80
      print_report+0x17f/0x4f1
      kasan_report+0x100/0x1e0
      kasan_check_range+0x10f/0x1e0
      __asan_memcpy+0x3c/0x60
      _copy_to_iter+0x48a/0xa20
      __skb_datagram_iter+0x2c9/0x430
      skb_copy_datagram_iter+0x6e/0x160
      tcp_recvmsg_locked+0xce0/0x1130
      tcp_recvmsg+0xeb/0x300
      inet_recvmsg+0xcf/0x3a0
      sock_recvmsg+0xea/0x100
      cifs_readv_from_socket+0x3a6/0x4d0 [cifs]
      cifs_read_iter_from_socket+0xdd/0x130 [cifs]
      cifs_readv_receive+0xaad/0xb10 [cifs]
      cifs_demultiplex_thread+0x1148/0x1740 [cifs]
      kthread+0x1cf/0x210
    
    Fixes: ee4cdf7ba857 ("netfs: Speed up buffered reading")
    Reported-by: Steve French <sfrench@samba.org>
    Signed-off-by: David Howells <dhowells@redhat.com>
    Link: https://patch.msgid.link/20260512123404.719402-18-dhowells@redhat.com
    Reviewed-by: Paulo Alcantara (Red Hat) <pc@manguebit.org>
    cc: Paulo Alcantara <pc@manguebit.org>
    cc: Matthew Wilcox <willy@infradead.org>
    cc: netfs@lists.linux.dev
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfs: Fix folio->private handling in netfs_perform_write() [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Tue May 12 13:33:58 2026 +0100

    netfs: Fix folio->private handling in netfs_perform_write()
    
    [ Upstream commit ccde2ac757c713535b224233a296de40efe5212d ]
    
    Under some circumstances, netfs_perform_write() doesn't correctly
    manipulate folio->private between NULL, NETFS_FOLIO_COPY_TO_CACHE, pointing
    to a group and pointing to a netfs_folio struct, leading to potential
    multiple attachments of private data with associated folio ref leaks and
    also leaks of netfs_folio structs or netfs_group refs.
    
    Fix this by consolidating the place at which a folio is marked uptodate in
    one place and having that look at what's attached to folio->private and
    decide how to clean it up and then set the new group.  Also, the content
    shouldn't be flushed if group is NULL, even if a group is specified in the
    netfs_group parameter, as that would be the case for a new folio.  A
    filesystem should always specify netfs_group or never specify netfs_group.
    
    The Sashiko auto-review tool noted that it was theoretically possible that
    the fpos >= ctx->zero_point section might leak if it modified a streaming
    write folio.  This is unlikely, but with a network filesystem, third party
    changes can happen.  It also pointed out that __netfs_set_group() would
    leak if called multiple times on the same folio from the "whole folio
    modify section".
    
    Fixes: 8f52de0077ba ("netfs: Reduce number of conditional branches in netfs_perform_write()")
    Closes: https://sashiko.dev/#/patchset/20260414082004.3756080-1-dhowells%40redhat.com
    Signed-off-by: David Howells <dhowells@redhat.com>
    Link: https://patch.msgid.link/20260512123404.719402-22-dhowells@redhat.com
    cc: Paulo Alcantara <pc@manguebit.org>
    cc: Matthew Wilcox <willy@infradead.org>
    cc: netfs@lists.linux.dev
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfs: Fix leak of request in netfs_write_begin() error handling [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Tue May 12 13:33:55 2026 +0100

    netfs: Fix leak of request in netfs_write_begin() error handling
    
    [ Upstream commit 5046a34f0643441f05b0253ea64e1a3af87efe14 ]
    
    Fix netfs_write_begin() to not leak our ref on the request in the event
    that we get an error from netfs_wait_for_read().
    
    Fixes: 4090b31422a6 ("netfs: Add a function to consolidate beginning a read")
    Closes: https://sashiko.dev/#/patchset/20260414082004.3756080-1-dhowells%40redhat.com
    Signed-off-by: David Howells <dhowells@redhat.com>
    Link: https://patch.msgid.link/20260512123404.719402-19-dhowells@redhat.com
    cc: Paulo Alcantara <pc@manguebit.org>
    cc: Matthew Wilcox <willy@infradead.org>
    cc: netfs@lists.linux.dev
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfs: Fix missing barriers when accessing stream->subrequests locklessly [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Tue May 12 13:33:40 2026 +0100

    netfs: Fix missing barriers when accessing stream->subrequests locklessly
    
    [ Upstream commit b5782e2d462c028096f922abca46318cec890670 ]
    
    The list of subrequests attached to stream->subrequests is accessed without
    locks by netfs_collect_read_results() and netfs_collect_write_results(),
    and then they access subreq->flags without taking a barrier after getting
    the subreq pointer from the list.  Relatedly, the functions that build the
    list don't use any sort of write barrier when constructing the list to make
    sure that the NETFS_SREQ_IN_PROGRESS flag is perceived to be set first if
    no lock is taken.
    
    Fix this by:
    
     (1) Add a new list_add_tail_release() function that uses a release barrier
         to set the pointer to the new member of the list.
    
     (2) Add a new list_first_entry_or_null_acquire() function that uses an
         acquire barrier to read the pointer to the first member in a list (or
         return NULL).
    
     (3) Use list_add_tail_release() when adding a subreq to ->subrequests.
    
     (4) Use list_first_entry_or_null_acquire() when initially accessing the
         front of the list (when an item is removed, the pointer to the new
         front iterm is obtained under the same lock).
    
    Fixes: e2d46f2ec332 ("netfs: Change the read result collector to only use one work item")
    Fixes: 288ace2f57c9 ("netfs: New writeback implementation")
    Link: https://sashiko.dev/#/patchset/20260326104544.509518-1-dhowells%40redhat.com
    Signed-off-by: David Howells <dhowells@redhat.com>
    Link: https://patch.msgid.link/20260512123404.719402-4-dhowells@redhat.com
    cc: Paulo Alcantara <pc@manguebit.org>
    cc: netfs@lists.linux.dev
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfs: Fix missing locking around retry adding new subreqs [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Tue May 12 13:33:39 2026 +0100

    netfs: Fix missing locking around retry adding new subreqs
    
    [ Upstream commit cce18c263e9623872327ba3c956012f73c1179cc ]
    
    Fix netfs_retry_read_subrequests() and netfs_retry_write_stream() to take
    the appropriate lock when adding extra subrequests into
    stream->subrequests.
    
    Fixes: e2d46f2ec332 ("netfs: Change the read result collector to only use one work item")
    Fixes: 288ace2f57c9 ("netfs: New writeback implementation")
    Closes: https://sashiko.dev/#/patchset/20260425125426.3855807-1-dhowells%40redhat.com
    Signed-off-by: David Howells <dhowells@redhat.com>
    Link: https://patch.msgid.link/20260512123404.719402-3-dhowells@redhat.com
    cc: Paulo Alcantara <pc@manguebit.org>
    cc: netfs@lists.linux.dev
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfs: Fix netfs_invalidate_folio() to clear dirty bit if all changes gone [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Tue May 12 13:33:48 2026 +0100

    netfs: Fix netfs_invalidate_folio() to clear dirty bit if all changes gone
    
    [ Upstream commit 156ac2ec2ee77c44c4eb7439d6d165247ba12247 ]
    
    If a streaming write is made, this will leave the relevant modified folio
    in a not-uptodate, but dirty state with a netfs_folio struct hung off of
    folio->private indicating the dirty range.  Subsequently truncating the
    file such that the dirty data in the folio is removed, but the first part
    of the folio theoretically remains will cause the netfs_folio struct to be
    discarded... but will leave the dirty flag set.
    
    If the folio is then read via mmap(), netfs_read_folio() will see that the
    page is dirty and jump to netfs_read_gaps() to fill in the missing bits.
    netfs_read_gaps(), however, expects there to be a netfs_folio struct
    present and can oops because truncate removed it.
    
    Fix this by calling folio_cancel_dirty() in netfs_invalidate_folio() in the
    event that all the dirty data in the folio is erased (as nfs does).
    
    Also add some tracepoints to log modifications to a dirty page.
    
    This can be reproduced with something like:
    
        dd if=/dev/zero of=/xfstest.test/foo bs=1M count=1
        umount /xfstest.test
        mount /xfstest.test
        xfs_io -c "w 0xbbbf 0xf96c" \
               -c "truncate 0xbbbf" \
               -c "mmap -r 0xb000 0x11000" \
               -c "mr 0xb000 0x11000" \
               /xfstest.test/foo
    
    with fscaching disabled (otherwise streaming writes are suppressed) and a
    change to netfs_perform_write() to disallow streaming writes if the fd is
    open O_RDWR:
    
            if (//(file->f_mode & FMODE_READ) || <--- comment this out
                netfs_is_cache_enabled(ctx)) {
    
    It should be reproducible even without this change, but if prevents the
    above trivial xfs_io command from reproducing it.
    
    Note that the initial dd is important: the file must start out sufficiently
    large that the zero-point logic doesn't just clear the gaps because it
    knows there's nothing in the file to read yet.  Unmounting and mounting is
    needed to clear the pagecache (there are other ways to do that that may
    also work).
    
    This was initially reproduced with the generic/522 xfstest on some patches
    that remove the FMODE_READ restriction.
    
    Fixes: 9ebff83e6481 ("netfs: Prep to use folio->private for write grouping and streaming write")
    Reported-by: Marc Dionne <marc.dionne@auristor.com>
    Signed-off-by: David Howells <dhowells@redhat.com>
    Link: https://patch.msgid.link/20260512123404.719402-12-dhowells@redhat.com
    cc: Paulo Alcantara <pc@manguebit.org>
    cc: Matthew Wilcox <willy@infradead.org>
    cc: netfs@lists.linux.dev
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfs: Fix netfs_read_folio() to wait on writeback [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Tue May 12 13:33:59 2026 +0100

    netfs: Fix netfs_read_folio() to wait on writeback
    
    [ Upstream commit ded0c6f1606061148c202825f7e53d711f9f84cf ]
    
    Fix netfs_read_folio() to wait for an ongoing writeback to complete so that
    it can trust the dirty flag and whatever is attached to folio->private
    (folio->private may get cleaned up by the collector before it clears the
    writeback flag).
    
    Fixes: ee4cdf7ba857 ("netfs: Speed up buffered reading")
    Closes: https://sashiko.dev/#/patchset/20260414082004.3756080-1-dhowells%40redhat.com
    Signed-off-by: David Howells <dhowells@redhat.com>
    Link: https://patch.msgid.link/20260512123404.719402-23-dhowells@redhat.com
    cc: Paulo Alcantara <pc@manguebit.org>
    cc: Matthew Wilcox <willy@infradead.org>
    cc: netfs@lists.linux.dev
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfs: Fix netfs_read_to_pagecache() to pause on subreq failure [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Tue May 12 13:33:41 2026 +0100

    netfs: Fix netfs_read_to_pagecache() to pause on subreq failure
    
    [ Upstream commit 8a8c0cfdf4658fc5b295b7fc87be56e0d76741f4 ]
    
    Fix netfs_read_to_pagecache() so that it pauses the generation of new
    subrequests if an already-issued subrequest fails.
    
    Fixes: ee4cdf7ba857 ("netfs: Speed up buffered reading")
    Closes: https://sashiko.dev/#/patchset/20260425125426.3855807-1-dhowells%40redhat.com
    Signed-off-by: David Howells <dhowells@redhat.com>
    Link: https://patch.msgid.link/20260512123404.719402-5-dhowells@redhat.com
    cc: Paulo Alcantara <pc@manguebit.org>
    cc: netfs@lists.linux.dev
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfs: Fix overrun check in netfs_extract_user_iter() [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Tue May 12 13:33:47 2026 +0100

    netfs: Fix overrun check in netfs_extract_user_iter()
    
    [ Upstream commit 0ef37eef83fad3542ee06db2940433ae1a92b39d ]
    
    Fix netfs_extract_user_iter() so that if iov_iter_extract_pages() overfills
    pages[], then those pages don't get included in the iterator constructed at
    the end of the function.  If there was an overfill, memory corruption has
    already happened.
    
    Fixes: 85dd2c8ff368 ("netfs: Add a function to extract a UBUF or IOVEC into a BVEC iterator")
    Closes: https://sashiko.dev/#/patchset/20260427154639.180684-1-dhowells%40redhat.com
    Signed-off-by: David Howells <dhowells@redhat.com>
    Link: https://patch.msgid.link/20260512123404.719402-11-dhowells@redhat.com
    cc: Paulo Alcantara <pc@manguebit.org>
    cc: netfs@lists.linux.dev
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfs: Fix partial invalidation of streaming-write folio [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Tue May 12 13:33:57 2026 +0100

    netfs: Fix partial invalidation of streaming-write folio
    
    [ Upstream commit 6d91acc7fb85d33ea58fca9b964a32a453937f4b ]
    
    In netfs_invalidate_folio(), if the region of a partial invalidation
    overlaps the front (but not all) of a dirty write cached in a streaming
    write page (dirty, but not uptodate, with the dirty region tracked by a
    netfs_folio struct), the function modifies the dirty region - but
    incorrectly as it moves the region forward by setting the start to the
    start, not the end, of the invalidation region.
    
    Fix this by setting finfo->dirty_offset to the end of the invalidation
    region (iend).
    
    Fixes: cce6bfa6ca0e ("netfs: Fix trimming of streaming-write folios in netfs_inval_folio()")
    Closes: https://sashiko.dev/#/patchset/20260414082004.3756080-1-dhowells%40redhat.com
    Signed-off-by: David Howells <dhowells@redhat.com>
    Link: https://patch.msgid.link/20260512123404.719402-21-dhowells@redhat.com
    cc: Paulo Alcantara <pc@manguebit.org>
    cc: Matthew Wilcox <willy@infradead.org>
    cc: netfs@lists.linux.dev
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfs: Fix potential deadlock in write-through mode [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Tue May 12 13:33:51 2026 +0100

    netfs: Fix potential deadlock in write-through mode
    
    [ Upstream commit b6a4ae1634b3ad2aaa05222e53d36da532852faf ]
    
    Fix netfs_advance_writethrough() to always unlock the supplied folio and to
    mark it dirty if it isn't yet written to the end.  Unfortunately, it can't
    be marked for writeback until the folio is done with as that may cause a
    deadlock against mmapped reads and writes.
    
    Even though it has been marked dirty, premature writeback can't occur as
    the caller is holding both inode->i_rwsem (which will prevent concurrent
    truncation, fallocation, DIO and other writes) and ictx->wb_lock (which
    will cause flushing to wait and writeback to skip or wait).
    
    Note that this may be easier to deal with once the queuing of folios is
    split from the generation of subrequests.
    
    Fixes: 288ace2f57c9 ("netfs: New writeback implementation")
    Closes: https://sashiko.dev/#/patchset/20260427154639.180684-1-dhowells%40redhat.com
    Signed-off-by: David Howells <dhowells@redhat.com>
    Link: https://patch.msgid.link/20260512123404.719402-15-dhowells@redhat.com
    cc: Paulo Alcantara <pc@manguebit.org>
    cc: netfs@lists.linux.dev
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfs: Fix potential for tearing in ->remote_i_size and ->zero_point [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Tue May 12 13:33:42 2026 +0100

    netfs: Fix potential for tearing in ->remote_i_size and ->zero_point
    
    [ Upstream commit 2c8f4742bb76117d735f92a3932d85239b16c494 ]
    
    Fix potential tearing in using ->remote_i_size and ->zero_point by copying
    i_size_read() and i_size_write() and using the same seqcount as for i_size.
    
    We need to make sure that netfslib and the filesystems that use it always
    hold i_lock whilst updating any of the sizes to prevent i_size_seqcount
    from getting corrupted.
    
    Fixes: 4058f742105e ("netfs: Keep track of the actual remote file size")
    Fixes: 100ccd18bb41 ("netfs: Optimise away reads above the point at which there can be no data")
    Closes: https://sashiko.dev/#/patchset/20260414082004.3756080-1-dhowells%40redhat.com
    Signed-off-by: David Howells <dhowells@redhat.com>
    Link: https://patch.msgid.link/20260512123404.719402-6-dhowells@redhat.com
    cc: Paulo Alcantara <pc@manguebit.org>
    cc: Matthew Wilcox <willy@infradead.org>
    cc: netfs@lists.linux.dev
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfs: Fix potential UAF in netfs_unlock_abandoned_read_pages() [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Tue May 12 13:33:56 2026 +0100

    netfs: Fix potential UAF in netfs_unlock_abandoned_read_pages()
    
    [ Upstream commit dbe556972100fabb8e5a1b3d2163831ff07b1e8e ]
    
    netfs_unlock_abandoned_read_pages(rreq) accesses the index of the folios it
    is wanting to unlock and compares that to rreq->no_unlock_folio so that it
    doesn't unlock a folio being read for netfs_perform_write() or
    netfs_write_begin().
    
    However, given that netfs_unlock_abandoned_read_pages() is called _after_
    NETFS_RREQ_IN_PROGRESS is cleared, the one folio that it's not allowed to
    dereference is the one specified by ->no_unlock_folio as ownership
    immediately reverts to the caller.
    
    Fix this by storing the folio pointer instead and using that rather than
    the index.  Also fix netfs_unlock_read_folio() where the same applies.
    
    Fixes: ee4cdf7ba857 ("netfs: Speed up buffered reading")
    Closes: https://sashiko.dev/#/patchset/20260414082004.3756080-1-dhowells%40redhat.com
    Signed-off-by: David Howells <dhowells@redhat.com>
    Link: https://patch.msgid.link/20260512123404.719402-20-dhowells@redhat.com
    cc: Paulo Alcantara <pc@manguebit.org>
    cc: Viacheslav Dubeyko <Slava.Dubeyko@ibm.com>
    cc: Matthew Wilcox <willy@infradead.org>
    cc: netfs@lists.linux.dev
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfs: Fix read-gaps to remove netfs_folio from filled folio [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Tue May 12 13:33:52 2026 +0100

    netfs: Fix read-gaps to remove netfs_folio from filled folio
    
    [ Upstream commit a41168aef634356a9b87ec44349e3c82835700a5 ]
    
    Fix netfs_read_gaps() to remove the netfs_folio record from the folio
    record before marking the folio uptodate if it successfully fills the gaps
    around the dirty data in a streaming write folio (dirty, but not uptodate).
    
    Found with:
    
        fsx -q -N 1000000 -p 10000 -o 128000 -l 600000 \
            /xfstest.test/junk --replay-ops=junk.fsxops
    
    using the following as junk.fsxops:
    
        truncate 0x0 0x138b1 0x8b15d *
        write 0x507ee 0x10df7 0x927c0
        write 0x19993 0x10e04 0x927c0 *
        mapwrite 0x66214 0x1a253 0x927c0
        copy_range 0xb704 0x89b9 0x24429 0x79380
        write 0x2402b 0x144a2 0x90660 *
        mapwrite 0x204d5 0x140a0 0x927c0 *
        copy_range 0x1f72c 0x137d0 0x7a906 0x927c0 *
        read 0 0x9157c 0x9157c
    
    on cifs with the default cache option.
    
    It shows folio 0x24 misbehaving if the FMODE_READ check is commented out in
    netfs_perform_write():
    
                    if (//(file->f_mode & FMODE_READ) ||
                        netfs_is_cache_enabled(ctx)) {
    
    and no fscache.  This was initially found with the generic/522 xfstest.
    
    Signed-off-by: David Howells <dhowells@redhat.com>
    Link: https://patch.msgid.link/20260512123404.719402-16-dhowells@redhat.com
    Fixes: ee4cdf7ba857 ("netfs: Speed up buffered reading")
    cc: Paulo Alcantara <pc@manguebit.org>
    cc: Matthew Wilcox <willy@infradead.org>
    cc: netfs@lists.linux.dev
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfs: Fix streaming write being overwritten [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Tue May 12 13:33:50 2026 +0100

    netfs: Fix streaming write being overwritten
    
    [ Upstream commit 7b4dcf1b9455a6e52ac7478b4057dbe10359576d ]
    
    In order to avoid reading whilst writing, netfslib will allow "streaming
    writes" in which dirty data is stored directly into folios without reading
    them first.  Such folios are marked dirty but may not be marked uptodate.
    If a folio is entirely written by a streaming write, uptodate will be set,
    otherwise it will have a netfs_folio struct attached to ->private recording
    the dirty region.
    
    In the event that a partially written streaming write page is to be
    overwritten entirely by a single write(), netfs_perform_write() will try to
    copy over it, but doesn't discard the netfs_folio if it succeeds; further,
    it doesn't correctly handle a partial copy that overwrites some of the
    dirty data.
    
    Fix this by the following:
    
     (1) If the folio is successfully overwritten, free the netfs_folio struct
         before marking the page uptodate.
    
     (2) If the copy to the folio partially fails, but short of the dirty data,
         just ignore the copy.
    
     (3) If the copy partially fails and overwrites some of the dirty data,
         accept the copy, update the netfs_folio struct to record the new data.
         If the folio is now filled, free the netfs_folio and set uptodate,
         otherwise return a partial write.
    
    Found with:
    
            fsx -q -N 1000000 -p 10000 -o 128000 -l 600000 \
              /xfstest.test/junk --replay-ops=junk.fsxops
    
    using the following as junk.fsxops:
    
            truncate 0x0 0 0x927c0
            write 0x63fb8 0x53c8 0
            copy_range 0xb704 0x19b9 0x24429 0x79380
            write 0x2402b 0x144a2 0x90660 *
            write 0x204d5 0x140a0 0x927c0 *
            copy_range 0x1f72c 0x137d0 0x7a906 0x927c0 *
            read 0x00000 0x20000 0x9157c
            read 0x20000 0x20000 0x9157c
            read 0x40000 0x20000 0x9157c
            read 0x60000 0x20000 0x9157c
            read 0x7e1a0 0xcfb9 0x9157c
    
    on cifs with the default cache option.
    
    It shows folio 0x24 misbehaving if the FMODE_READ check is commented out in
    netfs_perform_write():
    
                    if (//(file->f_mode & FMODE_READ) ||
                        netfs_is_cache_enabled(ctx)) {
    
    and no fscache.  This was initially found with the generic/522 xfstest.
    
    Fixes: 8f52de0077ba ("netfs: Reduce number of conditional branches in netfs_perform_write()")
    Signed-off-by: David Howells <dhowells@redhat.com>
    Link: https://patch.msgid.link/20260512123404.719402-14-dhowells@redhat.com
    cc: Paulo Alcantara <pc@manguebit.org>
    cc: Matthew Wilcox <willy@infradead.org>
    cc: netfs@lists.linux.dev
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfs: fix VM_BUG_ON_FOLIO() issue in netfs_write_begin() call [+ + +]

Author: Viacheslav Dubeyko <Slava.Dubeyko@ibm.com>
Date:   Tue May 12 13:33:44 2026 +0100

    netfs: fix VM_BUG_ON_FOLIO() issue in netfs_write_begin() call
    
    [ Upstream commit dc7832d05deb4d632e8035e3299e31a3528fa0d0 ]
    
    The multiple runs of generic/013 test-case is capable
    to reproduce a kernel BUG at mm/filemap.c:1504 with
    probability of 30%.
    
    while true; do
      sudo ./check generic/013
    done
    
    [ 9849.452376] page: refcount:3 mapcount:0 mapping:00000000e58ff252 index:0x10781 pfn:0x1c322
    [ 9849.452412] memcg:ffff8881a1915800
    [ 9849.452417] aops:ceph_aops ino:1000058db9e dentry name(?):"f9XXXXXX"
    [ 9849.452432] flags: 0x17ffffc0000000(node=0|zone=2|lastcpupid=0x1fffff)
    [ 9849.452441] raw: 0017ffffc0000000 0000000000000000 dead000000000122 ffff88816110d248
    [ 9849.452445] raw: 0000000000010781 0000000000000000 00000003ffffffff ffff8881a1915800
    [ 9849.452447] page dumped because: VM_BUG_ON_FOLIO(!folio_test_locked(folio))
    [ 9849.452474] ------------[ cut here ]------------
    [ 9849.452476] kernel BUG at mm/filemap.c:1504!
    [ 9849.478635] Oops: invalid opcode: 0000 [#1] SMP KASAN NOPTI
    [ 9849.481772] CPU: 2 UID: 0 PID: 84223 Comm: fsstress Not tainted 7.0.0-rc1+ #18 PREEMPT(full)
    [ 9849.482881] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.17.0-9.fc43 06/1
    0/2025
    [ 9849.484539] RIP: 0010:folio_unlock+0x85/0xa0
    [ 9849.485076] Code: 89 df 31 f6 e8 1c f3 ff ff 48 8b 5d f8 c9 31 c0 31 d2 31 f6 31 ff c3 cc
    cc cc cc 48 c7 c6 80 6c d9 a7 48 89 df e8 4b b3 10 00 <0f> 0b 48 89 df e8 21 e6 2c 00 eb 9d 0f 1f 40 00 66 66 2e 0f 1f 84
    [ 9849.493818] RSP: 0018:ffff8881bb8076b0 EFLAGS: 00010246
    [ 9849.495740] RAX: 0000000000000000 RBX: ffffea00070c8980 RCX: 0000000000000000
    [ 9849.498678] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
    [ 9849.500559] RBP: ffff8881bb8076b8 R08: 0000000000000000 R09: 0000000000000000
    [ 9849.501097] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000010782000
    [ 9849.502108] R13: ffff8881935de738 R14: ffff88816110d010 R15: 0000000000001000
    [ 9849.502516] FS:  00007e36cbe94740(0000) GS:ffff88824a899000(0000) knlGS:0000000000000000
    [ 9849.502996] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    [ 9849.503810] CR2: 000000c0002b0000 CR3: 000000011bbf6004 CR4: 0000000000772ef0
    [ 9849.504459] PKRU: 55555554
    [ 9849.504626] Call Trace:
    [ 9849.505242]  <TASK>
    [ 9849.505379]  netfs_write_begin+0x7c8/0x10a0
    [ 9849.505877]  ? __kasan_check_read+0x11/0x20
    [ 9849.506384]  ? __pfx_netfs_write_begin+0x10/0x10
    [ 9849.507178]  ceph_write_begin+0x8c/0x1c0
    [ 9849.507934]  generic_perform_write+0x391/0x8f0
    [ 9849.508503]  ? __pfx_generic_perform_write+0x10/0x10
    [ 9849.509062]  ? file_update_time_flags+0x19a/0x4b0
    [ 9849.509581]  ? ceph_get_caps+0x63/0xf0
    [ 9849.510259]  ? ceph_get_caps+0x63/0xf0
    [ 9849.510530]  ceph_write_iter+0xe79/0x1ae0
    [ 9849.511282]  ? __pfx_ceph_write_iter+0x10/0x10
    [ 9849.511839]  ? lock_acquire+0x1ad/0x310
    [ 9849.512334]  ? ksys_write+0xf9/0x230
    [ 9849.512582]  ? lock_is_held_type+0xaa/0x140
    [ 9849.513128]  vfs_write+0x512/0x1110
    [ 9849.513634]  ? __fget_files+0x33/0x350
    [ 9849.513893]  ? __pfx_vfs_write+0x10/0x10
    [ 9849.514143]  ? mutex_lock_nested+0x1b/0x30
    [ 9849.514394]  ksys_write+0xf9/0x230
    [ 9849.514621]  ? __pfx_ksys_write+0x10/0x10
    [ 9849.514887]  ? do_syscall_64+0x25e/0x1520
    [ 9849.515122]  ? __kasan_check_read+0x11/0x20
    [ 9849.515366]  ? trace_hardirqs_on_prepare+0x178/0x1c0
    [ 9849.515655]  __x64_sys_write+0x72/0xd0
    [ 9849.515885]  ? trace_hardirqs_on+0x24/0x1c0
    [ 9849.516130]  x64_sys_call+0x22f/0x2390
    [ 9849.516341]  do_syscall_64+0x12b/0x1520
    [ 9849.516545]  ? do_syscall_64+0x27c/0x1520
    [ 9849.516783]  ? do_syscall_64+0x27c/0x1520
    [ 9849.517003]  ? lock_release+0x318/0x480
    [ 9849.517220]  ? __x64_sys_io_getevents+0x143/0x2d0
    [ 9849.517479]  ? percpu_ref_put_many.constprop.0+0x8f/0x210
    [ 9849.517779]  ? entry_SYSCALL_64_after_hwframe+0x76/0x7e
    [ 9849.518073]  ? do_syscall_64+0x25e/0x1520
    [ 9849.518291]  ? __kasan_check_read+0x11/0x20
    [ 9849.518519]  ? trace_hardirqs_on_prepare+0x178/0x1c0
    [ 9849.518799]  ? do_syscall_64+0x27c/0x1520
    [ 9849.519024]  ? local_clock_noinstr+0xf/0x120
    [ 9849.519262]  ? entry_SYSCALL_64_after_hwframe+0x76/0x7e
    [ 9849.519544]  ? do_syscall_64+0x25e/0x1520
    [ 9849.519781]  ? __kasan_check_read+0x11/0x20
    [ 9849.520008]  ? trace_hardirqs_on_prepare+0x178/0x1c0
    [ 9849.520273]  ? do_syscall_64+0x27c/0x1520
    [ 9849.520491]  ? trace_hardirqs_on_prepare+0x178/0x1c0
    [ 9849.520767]  ? irqentry_exit+0x10c/0x6c0
    [ 9849.520984]  ? trace_hardirqs_off+0x86/0x1b0
    [ 9849.521224]  ? exc_page_fault+0xab/0x130
    [ 9849.521472]  entry_SYSCALL_64_after_hwframe+0x76/0x7e
    [ 9849.521766] RIP: 0033:0x7e36cbd14907
    [ 9849.521989] Code: 10 00 f7 d8 64 89 02 48 c7 c0 ff ff ff ff eb b7 0f 1f 00 f3 0f 1e fa 64 8b 04 25 18 00 00 00 85 c0 75 10 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 51 c3 48 83 ec 28 48 89 54 24 18 48 89 74 24
    [ 9849.523057] RSP: 002b:00007ffff2d2a968 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
    [ 9849.523484] RAX: ffffffffffffffda RBX: 000000000000e549 RCX: 00007e36cbd14907
    [ 9849.523885] RDX: 000000000000e549 RSI: 00005bd797ec6370 RDI: 0000000000000004
    [ 9849.524277] RBP: 0000000000000004 R08: 0000000000000047 R09: 00005bd797ec6370
    [ 9849.524652] R10: 0000000000000078 R11: 0000000000000246 R12: 0000000000000049
    [ 9849.525062] R13: 0000000010781a37 R14: 00005bd797ec6370 R15: 0000000000000000
    [ 9849.525447]  </TASK>
    [ 9849.525574] Modules linked in: intel_rapl_msr intel_rapl_common intel_uncore_frequency_common intel_pmc_core pmt_telemetry pmt_discovery pmt_class intel_pmc_ssram_telemetry intel_vsec kvm_intel joydev kvm irqbypass ghash_clmulni_intel aesni_intel input_leds rapl mac_hid psmouse vga16fb serio_raw vgastate floppy i2c_piix4 bochs qemu_fw_cfg i2c_smbus pata_acpi sch_fq_codel rbd msr parport_pc ppdev lp parport efi_pstore
    [ 9849.529150] ---[ end trace 0000000000000000 ]---
    [ 9849.529502] RIP: 0010:folio_unlock+0x85/0xa0
    [ 9849.530813] Code: 89 df 31 f6 e8 1c f3 ff ff 48 8b 5d f8 c9 31 c0 31 d2 31 f6 31 ff c3 cc cc cc cc 48 c7 c6 80 6c d9 a7 48 89 df e8 4b b3 10 00 <0f> 0b 48 89 df e8 21 e6 2c 00 eb 9d 0f 1f 40 00 66 66 2e 0f 1f 84
    [ 9849.534986] RSP: 0018:ffff8881bb8076b0 EFLAGS: 00010246
    [ 9849.536198] RAX: 0000000000000000 RBX: ffffea00070c8980 RCX: 0000000000000000
    [ 9849.537718] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
    [ 9849.539321] RBP: ffff8881bb8076b8 R08: 0000000000000000 R09: 0000000000000000
    [ 9849.540862] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000010782000
    [ 9849.542438] R13: ffff8881935de738 R14: ffff88816110d010 R15: 0000000000001000
    [ 9849.543996] FS:  00007e36cbe94740(0000) GS:ffff88824b899000(0000) knlGS:0000000000000000
    [ 9849.545854] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    [ 9849.547092] CR2: 00007e36cb3ff000 CR3: 000000011bbf6006 CR4: 0000000000772ef0
    [ 9849.548679] PKRU: 55555554
    
    The race sequence:
    1. Read completes -> netfs_read_collection() runs
    2. netfs_wake_rreq_flag(rreq, NETFS_RREQ_IN_PROGRESS, ...)
    3. netfs_wait_for_read() returns -EFAULT to netfs_write_begin()
    4. The netfs_unlock_abandoned_read_pages() unlocks the folio
    5. netfs_write_begin() calls folio_unlock(folio) -> VM_BUG_ON_FOLIO()
    
    The key reason of the issue that netfs_unlock_abandoned_read_pages()
    doesn't check the flag NETFS_RREQ_NO_UNLOCK_FOLIO and executes
    folio_unlock() unconditionally. This patch implements in
    netfs_unlock_abandoned_read_pages() logic similar to
    netfs_unlock_read_folio().
    
    Fixes: ee4cdf7ba857 ("netfs: Speed up buffered reading")
    Signed-off-by: Viacheslav Dubeyko <Slava.Dubeyko@ibm.com>
    Signed-off-by: David Howells <dhowells@redhat.com>
    Link: https://patch.msgid.link/20260512123404.719402-8-dhowells@redhat.com
    Reviewed-by: Paulo Alcantara (Red Hat) <pc@manguebit.org>
    cc: netfs@lists.linux.dev
    cc: linux-fsdevel@vger.kernel.org
    cc: Ceph Development <ceph-devel@vger.kernel.org>
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfs: Fix write streaming disablement if fd open O_RDWR [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Tue May 12 13:33:53 2026 +0100

    netfs: Fix write streaming disablement if fd open O_RDWR
    
    [ Upstream commit 70a7b9193bbbfceaab5974de66834c64ccc875dd ]
    
    In netfs_perform_write(), "write streaming" (the caching of dirty data in
    dirty but !uptodate folios) is performed to avoid the need to read data
    that is just going to get immediately overwritten.  However, this is/will
    be disabled in three circumstances: if the fd is open O_RDWR, if fscache is
    in use (as we need to round out the blocks for DIO) or if content
    encryption is enabled (again for rounding out purposes).
    
    The idea behind disabling it if the fd is open O_RDWR is that we'd need to
    flush the write-streaming page before we could read the data, particularly
    through mmap.  But netfs now fills in the gaps if ->read_folio() is called
    on the page, so that is unnecessary.  Further, this doesn't actually work
    if a separate fd is open for reading.
    
    Fix this by removing the check for O_RDWR, thereby allowing streaming
    writes even when we might read.
    
    This caused a number of problems with the generic/522 xfstest, but those
    are now fixed.
    
    Fixes: c38f4e96e605 ("netfs: Provide func to copy data to pagecache for buffered write")
    Signed-off-by: David Howells <dhowells@redhat.com>
    Link: https://patch.msgid.link/20260512123404.719402-17-dhowells@redhat.com
    cc: Paulo Alcantara <pc@manguebit.org>
    cc: Matthew Wilcox <willy@infradead.org>
    cc: netfs@lists.linux.dev
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

netfs: Fix zeropoint update where i_size > remote_i_size [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Tue May 12 13:33:43 2026 +0100

    netfs: Fix zeropoint update where i_size > remote_i_size
    
    [ Upstream commit 4543a4d737944134a1394afe797622546fbcc98a ]
    
    Fix the update of the zero point[*] by netfs_release_folio() when there is
    uncommitted data in the pagecache beyond the folio being released but the
    on-server EOF is in this folio (ie. i_size > remote_i_size).  The update
    needs to limit zero_point to remote_i_size, not i_size as i_size is a local
    phenomenon reflecting updates made locally to the pagecache, not stuff
    written to the server.  remote_i_size tracks the server's i_size.
    
    [*] The zero point is the file position from which we can assume that the
        server will just return zeros, so we can avoid generating reads.
    
    Note that netfs_invalidate_folio() probably doesn't need fixing as
    zero_point should be updated by setattr after truncation or fallocate.
    
    Found with:
    
        fsx -q -N 1000000 -p 10000 -o 128000 -l 600000 \
            /xfstest.test/junk --replay-ops=junk.fsxops
    
    using the following as junk.fsxops:
    
        truncate 0x0 0x1bbae 0x82864
        write 0x3ef2e 0xf9c8 0x1bbae
        write 0x67e05 0xcb5a 0x4e8f6
        mapread 0x57781 0x85b6 0x7495f
        copy_range 0x5d3d 0x10329 0x54fac 0x7495f
        write 0x64710 0x1c2b 0x7495f
        mapread 0x64000 0x1000 0x7495f
    
    on cifs with the default cache option.
    
    It shows read-gaps on folio 0x64 failing with a short read (ie. it hits
    EOF) if the FMODE_READ check is commented out in netfs_perform_write():
    
                    if (//(file->f_mode & FMODE_READ) ||
                        netfs_is_cache_enabled(ctx)) {
    
    and no fscache.  This was initially found with the generic/522 xfstest.
    
    Fixes: cce6bfa6ca0e ("netfs: Fix trimming of streaming-write folios in netfs_inval_folio()")
    Signed-off-by: David Howells <dhowells@redhat.com>
    Link: https://patch.msgid.link/20260512123404.719402-7-dhowells@redhat.com
    cc: Paulo Alcantara <pc@manguebit.org>
    cc: Matthew Wilcox <willy@infradead.org>
    cc: netfs@lists.linux.dev
    cc: linux-fsdevel@vger.kernel.org
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

NFSD: Fix infinite loop in layout state revocation [+ + +]

Author: Chuck Lever <chuck.lever@oracle.com>
Date:   Sun Apr 19 14:52:59 2026 -0400

    NFSD: Fix infinite loop in layout state revocation
    
    [ Upstream commit 4f8ef58c10bfe5f86a643c7c8331b37e69e3dae1 ]
    
    find_one_sb_stid() skips stids whose sc_status is non-zero, but the
    SC_TYPE_LAYOUT case in nfsd4_revoke_states() never sets sc_status
    before calling nfsd4_close_layout(). The retry loop therefore finds
    the same layout stid on every iteration, hanging the revoker
    indefinitely.
    
    Fixes: 1e33e1414bec ("nfsd: allow layout state to be admin-revoked.")
    Reported-by: Dai Ngo <dai.ngo@oracle.com>
    Reviewed-by: Jeff Layton <jlayton@kernel.org>
    Tested-by: Dai Ngo <dai.ngo@oracle.com>
    Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

nsfs: fix wrong error code returned for pidns ioctls [+ + +]

Author: Zhihao Cheng <chengzhihao1@huawei.com>
Date:   Thu May 7 19:23:01 2026 +0800

    nsfs: fix wrong error code returned for pidns ioctls
    
    [ Upstream commit 725ecd80688bf3c57ca9205431f2c06174ff0756 ]
    
    When executing NS_GET_PID_FROM_PIDNS (or similar pidns ioctls), if the
    target task cannot be found in the corresponding pid_ns, the error code
    should be ESRCH instead of ENOTTY.
    
    This bug was introduced when the extensible ioctl handling was added.
    Without proper return, ret would be overwritten by the default case in
    the extensible ioctl switch statement.
    
    Fixes: a1d220d9dafa8 ("nsfs: iterate through mount namespaces")
    Signed-off-by: Zhihao Cheng <chengzhihao1@huawei.com>
    Link: https://patch.msgid.link/20260507112301.1042757-1-chengzhihao1@huawei.com
    Reviewed-by: Yang Erkun <yangerkun@huawei.com>
    Signed-off-by: Christian Brauner <brauner@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

nvme-pci: fix dma mapping leak on data setup error [+ + +]

Author: Keith Busch <kbusch@kernel.org>
Date:   Tue May 19 13:01:57 2026 -0700

    nvme-pci: fix dma mapping leak on data setup error
    
    [ Upstream commit 1bf86336e4b6cf40873fda47a7fe191446864937 ]
    
    We're leaking the initial DMA mapping during iteration if we fail to
    allocate the tracking descriptor for both PRP and SGL. Unmap the
    iterator directly; we can't use the existing unmap helper because it
    depends on the tracking descriptor being successfully allocated, so a
    new one for an in-use iterator is provided.
    
    The mappings were also leaking when the driver detects an invalid
    bio_vec when mapping PRPs, so fix that too.
    
    Fixes: b8b7570a7ec87 ("nvme-pci: fix dma unmapping when using PRPs and not using the IOVA mapping")
    Fixes: 7ce3c1dd78fca ("nvme-pci: convert the data mapping to blk_rq_dma_map")
    Reviewed-by: Christoph Hellwig <hch@lst.de>
    Signed-off-by: Keith Busch <kbusch@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

nvme-pci: fix dma_vecs leak on p2p memory [+ + +]

Author: Keith Busch <kbusch@kernel.org>
Date:   Tue May 19 18:03:44 2026 -0700

    nvme-pci: fix dma_vecs leak on p2p memory
    
    [ Upstream commit 85686c72966c5ee637893f124ddb31a1cace7bee ]
    
    We don't unmap P2P memory, so we don't need to track it. The dma_vec
    allocation was getting leaked on the completion.
    
    Fixes: b8b7570a7ec87 ("nvme-pci: fix dma unmapping when using PRPs and not using the IOVA mapping")
    Reviewed-by: Christoph Hellwig <hch@lst.de>
    Signed-off-by: Keith Busch <kbusch@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

nvme-pci: fix use-after-free in nvme_free_host_mem() [+ + +]

Author: Chia-Lin Kao (AceLan) <acelan.kao@canonical.com>
Date:   Wed Apr 29 16:11:16 2026 +0800

    nvme-pci: fix use-after-free in nvme_free_host_mem()
    
    [ Upstream commit b35a13036755c5803168a7cb93bc66035c3e65b8 ]
    
    nvme_free_host_mem() frees dev->hmb_sgt via dma_free_noncontiguous()
    but never clears the pointer afterward.  This leads to a use-after-free
    if nvme_free_host_mem() is called twice in the same error path.
    
    This can happen during nvme_probe() when nvme_setup_host_mem() succeeds
    in allocating the HMB (setting dev->hmb_sgt) but nvme_set_host_mem()
    fails with an I/O error:
    
      nvme_setup_host_mem()
        nvme_alloc_host_mem_single()   -> sets dev->hmb_sgt
        nvme_set_host_mem()            -> fails with -EIO
        nvme_free_host_mem()           -> frees hmb_sgt, but does NOT NULL it
        return error
    
      nvme_probe() error path:
        nvme_free_host_mem()           -> dev->hmb_sgt is stale, use-after-free
    
    The second call dereferences the freed sgt, causing a NULL pointer
    dereference in iommu_dma_free_noncontiguous() when it accesses
    sgt->sgl->dma_address (the backing memory has been freed and zeroed).
    
    This is reproducible on Thunderbolt-attached NVMe devices (e.g., OWC
    Envoy Express behind a Dell WD22TB4 dock) where the device intermittently
    returns I/O errors during HMB setup due to PCIe link instability.
    
     BUG: kernel NULL pointer dereference, address: 0000000000000010
     RIP: 0010:iommu_dma_free_noncontiguous+0x22/0x80
     Call Trace:
      <TASK>
      dma_free_noncontiguous+0x3b/0x130
      nvme_free_host_mem+0x30/0xf0 [nvme]
      nvme_probe.cold+0xcc/0x275 [nvme]
      local_pci_probe+0x43/0xa0
      pci_device_probe+0xeea/0x290
      really_probe+0xf9/0x3b0
      __driver_probe_device+0x8b/0x170
      driver_probe_device+0x24/0xd0
      __driver_attach_async_helper+0x6b/0x110
      async_run_entry_fn+0x37/0x170
      process_one_work+0x1ac/0x3d0
      worker_thread+0x1b8/0x360
      kthread+0xf7/0x130
      ret_from_fork+0x2d8/0x3a0
      ret_from_fork_asm+0x1a/0x30
      </TASK>
    
    Fix this by setting dev->hmb_sgt to NULL after freeing it, so the
    second call takes the multi-descriptor path which safely handles the
    already-cleaned-up state.
    
    Fixes: 63a5c7a4b4c4 ("nvme-pci: use dma_alloc_noncontigous if possible")
    Signed-off-by: Chia-Lin Kao (AceLan) <acelan.kao@canonical.com>
    Signed-off-by: Keith Busch <kbusch@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

nvme: fix bio leak on mapping failure [+ + +]

Author: Keith Busch <kbusch@kernel.org>
Date:   Wed May 6 06:16:02 2026 -0700

    nvme: fix bio leak on mapping failure
    
    [ Upstream commit 2279cd9c61a330e5de4d6eb0bc422820dd6fdf36 ]
    
    The local bio is always NULL, so we'd leak the bio if the integrity
    mapping failed. Just get it directly from the request.
    
    Fixes: d0d1d522316e91f ("blk-map: provide the bdev to bio if one exists")
    Reviewed-by: Sagi Grimberg <sagi@grimberg.me>
    Reviewed-by: John Garry <john.g.garry@oracle.com>
    Reviewed-by: Christoph Hellwig <hch@lst.de>
    Signed-off-by: Keith Busch <kbusch@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

octeontx2-af: CGX: add bounds check to cgx_speed_mbps index [+ + +]

Author: Sam Daly <sam@samdaly.ie>
Date:   Wed May 13 18:42:53 2026 +0200

    octeontx2-af: CGX: add bounds check to cgx_speed_mbps index
    
    commit c0bf0a4f3f1f5f57aa83e1400ba4f56f0abfd542 upstream.
    
    cgx_speed_mbps has 13 elements but RESP_LINKSTAT_SPEED can yield values
    0-15. If it returns a value >= 13, this causes an out-of-bounds array
    access. Add a bounds check and default to speed 0 if the index is out of
    range.
    
    Fixes: 61071a871ea6 ("octeontx2-af: Forward CGX link notifications to PFs")
    Cc: Sunil Goutham <sgoutham@marvell.com>
    Cc: Linu Cherian <lcherian@marvell.com>
    Cc: Geetha sowjanya <gakula@marvell.com>
    Cc: hariprasad <hkelam@marvell.com>
    Cc: Subbaraya Sundeep <sbhatta@marvell.com>
    Cc: Andrew Lunn <andrew+netdev@lunn.ch>
    Cc: stable <stable@kernel.org>
    Signed-off-by: Sam Daly <sam@samdaly.ie>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
    Link: https://patch.msgid.link/2026051352-refined-demise-e88d@gregkh
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

octeontx2-af: npc: Fix allmulticast skip logic for LBK and SDP VFs [+ + +]

Author: Ratheesh Kannoth <rkannoth@marvell.com>
Date:   Wed May 20 10:00:36 2026 +0530

    octeontx2-af: npc: Fix allmulticast skip logic for LBK and SDP VFs
    
    [ Upstream commit 9eddc819f00b5b74bb4ac91396f80bd35f5f3561 ]
    
    When installing the allmulticast NPC rule, rvu_npc_install_allmulti_entry()
    should skip LBK and SDP VFs (only CGX PF/VF may add the entry).  The
    code combined is_lbk_vf() and is_sdp_vf() with logical AND, which is
    never true for a single pcifunc, so the intended early return never ran.
    
    Use logical OR instead.
    
    Cc: Geetha sowjanya <gakula@marvell.com>
    Fixes: ae703539f49d2 ("octeontx2-af: Cleanup loopback device checks")
    Signed-off-by: Ratheesh Kannoth <rkannoth@marvell.com>
    Link: https://patch.msgid.link/20260520043036.1523798-1-rkannoth@marvell.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

octeontx2-pf: avoid double free of pool->stack on AQ init failure [+ + +]

Author: Dawei Feng <dawei.feng@seu.edu.cn>
Date:   Fri May 15 23:18:26 2026 +0800

    octeontx2-pf: avoid double free of pool->stack on AQ init failure
    
    commit 9b244c242bec48b37e82b89787afd6a4c43457e1 upstream.
    
    otx2_pool_aq_init() frees pool->stack when mailbox sync or retry
    allocation fails, but leaves the pointer unchanged. Later,
    otx2_sq_aura_pool_init() unwinds the partial setup through
    otx2_aura_pool_free(), which frees pool->stack again. The CN20K-specific
    cn20k_pool_aq_init() implementation has the same bug in
    its corresponding error path.
    
    Set pool->stack to NULL immediately after the local free so the shared
    cleanup path does not free the same stack again while cleaning up
    partially initialized pool state.
    
    The bug was first flagged by an experimental analysis tool we are
    developing for kernel memory-management bugs while analyzing
    v6.13-rc1. The tool is still under development and is not yet publicly
    available. Manual inspection confirms that the bug is still present in
    v7.1-rc3.
    
    Runtime validation was not performed because reproducing this path
    requires OcteonTX2/CN20K hardware.
    
    Fixes: caa2da34fd25 ("octeontx2-pf: Initialize and config queues")
    Fixes: d322fbd17203 ("octeontx2-pf: Initialize cn20k specific aura and pool contexts")
    Cc: stable@vger.kernel.org
    Signed-off-by: Zilin Guan <zilin@seu.edu.cn>
    Signed-off-by: Dawei Feng <dawei.feng@seu.edu.cn>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://patch.msgid.link/20260515151826.1005397-1-dawei.feng@seu.edu.cn
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

octeontx2-pf: fix double free in rvu_rep_rsrc_init() [+ + +]

Author: Dawei Feng <dawei.feng@seu.edu.cn>
Date:   Wed May 13 23:13:20 2026 +0800

    octeontx2-pf: fix double free in rvu_rep_rsrc_init()
    
    commit e8fb3de2a8effcaf62bec2c56b93d8bb480371d1 upstream.
    
    rvu_rep_rsrc_init() allocates queue memory before calling
    otx2_init_hw_resources(). When hardware resource setup fails,
    otx2_init_hw_resources() already unwinds the partially initialized
    SQ, CQ, and aura state before returning an error. The representor
    error path then calls otx2_free_hw_resources() again and can free
    the same resources a second time.
    
    Fix this by splitting the cleanup labels so that a failure from
    otx2_init_hw_resources() only releases queue memory. Keep the
    otx2_free_hw_resources() call for failures that happen after
    hardware resource initialization completed successfully.
    
    The bug was first flagged by an experimental analysis tool we are
    developing for kernel memory-management bugs while analyzing
    v6.13-rc1. The tool is still under development and is not yet publicly
    available. Manual inspection confirms that the bug is still
    present in v7.1-rc3.
    
    Runtime validation was not performed because reproducing this path
    requires OcteonTX2 representor hardware.
    
    Fixes: 3937b7308d4f ("octeontx2-pf: Create representor netdev")
    Cc: stable@vger.kernel.org # v6.13+
    Signed-off-by: Zilin Guan <zilin@seu.edu.cn>
    Signed-off-by: Dawei Feng <dawei.feng@seu.edu.cn>
    Reviewed-by: Geetha sowjanya <gakula@marvell.com>
    Link: https://patch.msgid.link/20260513151320.213260-1-dawei.feng@seu.edu.cn
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ovpn: disable BHs when updating device stats [+ + +]

Author: Ralf Lici <ralf@mandelbit.com>
Date:   Wed May 13 15:26:10 2026 +0200

    ovpn: disable BHs when updating device stats
    
    [ Upstream commit 0c0dddc07d272a8d25922e48041e8e4d2434df7e ]
    
    ovpn updates dev->dstats from both process and softirq contexts. In
    particular, TCP paths may run from socket callbacks, workqueues or
    strparser work, while UDP receive and ovpn's ndo_start_xmit path may
    update the same per-device dstats from BH context.
    
    Add ovpn device drop-stat helpers that disable BHs around
    dev_dstats_rx_dropped() and dev_dstats_tx_dropped(), and use them for
    drop accounting.
    
    The successful RX dev_dstats_rx_add() update is already covered by the
    BH-disabled section around gro_cells_receive(). For the successful TCP
    TX dev_dstats_tx_add() update, replace the existing preempt-disabled
    section with a BH-disabled one.
    
    Fixes: 11851cbd60ea ("ovpn: implement TCP transport")
    Signed-off-by: Ralf Lici <ralf@mandelbit.com>
    Signed-off-by: Antonio Quartulli <antonio@openvpn.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ovpn: fix race between deleting interface and adding new peer [+ + +]

Author: Antonio Quartulli <antonio@openvpn.net>
Date:   Tue Mar 17 14:47:56 2026 +0100

    ovpn: fix race between deleting interface and adding new peer
    
    [ Upstream commit 982422b11e6f95f766a8cd2c2b1cbdb77e234a61 ]
    
    While deleting an existing ovpn interface, there is a very
    narrow window where adding a new peer via netlink may cause
    the netdevice to hang and prevent its unregistration.
    
    It may happen during ovpn_dellink(), when all existing peers are
    freed and the device is queued for deregistration, but a
    CMD_PEER_NEW message comes in adding a new peer that takes again
    a reference to the netdev.
    
    At this point there is no way to release the device because we are
    under the assumption that all peers were already released.
    
    Fix the race condition by releasing all peers in ndo_uninit(),
    when the netdevice has already been removed from the netdev
    list.
    
    Also ovpn_peer_add() has now an extra check that forces the
    function to bail out if the device reg_state is not REGISTERED.
    This way any incoming CMD_PEER_NEW racing with the interface
    deletion routine will simply stop before adding the peer.
    
    Note that the above check happens while holding the netdev_lock
    to prevent racing netdev state changes.
    
    ovpn_dellink() is now empty and can be removed.
    
    Reported-by: Hyunwoo Kim <imv4bel@gmail.com>
    Closes: https://lore.kernel.org/netdev/aaVgJ16edTfQkYbx@v4bel/
    Suggested-by: Sabrina Dubroca <sd@queasysnail.net>
    Fixes: 80747caef33d ("ovpn: introduce the ovpn_peer object")
    Reviewed-by: Sabrina Dubroca <sd@queasysnail.net>
    Signed-off-by: Antonio Quartulli <antonio@openvpn.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ovpn: respect peer refcount in CMD_NEW_PEER error path [+ + +]

Author: David Carlier <devnexen@gmail.com>
Date:   Wed May 13 11:55:21 2026 +0100

    ovpn: respect peer refcount in CMD_NEW_PEER error path
    
    [ Upstream commit 1fef6614673ff0846d30acdeeaf3cf98bb5f6116 ]
    
    ovpn_nl_peer_new_doit()'s error path calls ovpn_peer_release() directly
    rather than ovpn_peer_put(), bypassing the kref. The accompanying
    comment ("peer was not yet hashed, thus it is not used in any context")
    holds for UDP but not for TCP.
    
    For UDP, the ovpn_socket union uses the .ovpn arm and never points back
    at a peer; UDP encap_recv looks up peers via the not-yet-populated
    hashtables, so the new peer is unreachable until ovpn_peer_add()
    publishes it.
    
    For TCP, ovpn_socket_new() sets ovpn_sock->peer and
    ovpn_tcp_socket_attach() publishes ovpn_sock via rcu_assign_sk_user_data().
    From that moment until ovpn_socket_release() detaches in the error path,
    the TCP fd is fully wired: userspace recvmsg / sendmsg / close / poll
    on the fd, as well as the strparser-driven ovpn_tcp_rcv() path, can
    reach the peer through sk_user_data -> ovpn_sock->peer and bump its
    refcount via ovpn_peer_hold().
    
    ovpn_tcp_socket_wait_finish() (called inside ovpn_socket_release())
    drains strparser and the tx work, but does not synchronize with
    userspace syscall callers that already hold a peer reference. If
    ovpn_nl_peer_modify() or ovpn_peer_add() returns an error while such
    a caller is in flight - notably an ovpn_tcp_recvmsg() blocked in
    __skb_recv_datagram() on peer->tcp.user_queue - the direct
    ovpn_peer_release() destroys the peer while the caller still holds
    the reference, and the eventual ovpn_peer_put() from that caller
    operates on freed memory.
    
    Replace the direct destructor call with ovpn_peer_put() so the kref
    correctly defers destruction until the last reference is dropped.
    In the common case where no concurrent user is present, behaviour is
    unchanged: the kref hits zero immediately and ovpn_peer_release_kref()
    runs the same destructor.
    
    With this conversion ovpn_peer_release() has no callers outside peer.c
    - ovpn_peer_release_kref() in the same translation unit is the only
    remaining user - so make it static and drop its declaration from
    peer.h.
    
    Fixes: 11851cbd60ea ("ovpn: implement TCP transport")
    Reviewed-by: Sabrina Dubroca <sd@queasysnail.net>
    Assisted-by: Claude:claude-opus-4-7
    Signed-off-by: David Carlier <devnexen@gmail.com>
    Signed-off-by: Antonio Quartulli <antonio@openvpn.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

ovpn: tcp - use cached peer pointer in ovpn_tcp_close() [+ + +]

Author: David Carlier <devnexen@gmail.com>
Date:   Wed May 13 11:55:20 2026 +0100

    ovpn: tcp - use cached peer pointer in ovpn_tcp_close()
    
    [ Upstream commit 775d8d7ad02aa345e1588424a6a8b9ae49fb9012 ]
    
    ovpn_tcp_close() loads the ovpn_socket via rcu_dereference_sk_user_data()
    under rcu_read_lock(), takes a reference on sock->peer, caches the peer
    pointer in a local, and drops the read lock. It then passes sock->peer
    (rather than the cached local) to ovpn_peer_del(), re-dereferencing the
    ovpn_socket after the RCU read section has ended.
    
    Unlike ovpn_tcp_sendmsg(), which uses the same "load under RCU, use
    after unlock" pattern but is protected by lock_sock() held across the
    function, ovpn_tcp_close() runs without the socket lock: inet_release()
    invokes sk_prot->close() without taking lock_sock first.
    
    ovpn_socket_release() can therefore complete its kref_put -> detach ->
    synchronize_rcu -> kfree(sock) sequence concurrently, in the window
    after ovpn_tcp_close() drops rcu_read_lock() but before it dereferences
    sock->peer. The synchronize_rcu() in ovpn_socket_release() protects
    readers that use the dereferenced pointer inside the RCU read section,
    not those that escape the pointer to a local and use it afterwards.
    
    A reproducer follows the pattern of commit 94560267d6c4 ("ovpn: tcp -
    don't deref NULL sk_socket member after tcp_close()"): trigger a peer
    removal (keepalive expiration or netlink OVPN_CMD_DEL_PEER) at the same
    moment userspace closes the TCP fd. That commit fixed the detach-side
    of the same race window; this one fixes the close-side at a different
    victim.
    
    Tighten the entry block to read sock->peer exactly once into the cached
    peer local, and route all subsequent uses (the hold check, the
    ovpn_peer_del() call, and the prot->close() invocation) through that
    local. sock->peer is only ever written once in ovpn_socket_new() under
    lock_sock(), before rcu_assign_sk_user_data() publishes the ovpn_socket,
    and is never reassigned afterwards - but the previous multi-read pattern
    made that invariant implicit rather than explicit. The same multi-read
    shape exists in ovpn_tcp_recvmsg(), ovpn_tcp_sendmsg(),
    ovpn_tcp_data_ready() and ovpn_tcp_write_space(); those will be cleaned
    up via a dedicated helper in a follow-up net-next series.
    
    Fixes: 11851cbd60ea ("ovpn: implement TCP transport")
    Reviewed-by: Sabrina Dubroca <sd@queasysnail.net>
    Assisted-by: Claude:claude-opus-4-7
    Signed-off-by: David Carlier <devnexen@gmail.com>
    Signed-off-by: Antonio Quartulli <antonio@openvpn.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

pds_core: ensure null-termination for firmware version strings [+ + +]

Author: Nikhil P. Rao <nikhil.rao@amd.com>
Date:   Wed May 20 20:58:42 2026 +0000

    pds_core: ensure null-termination for firmware version strings
    
    [ Upstream commit 3d4432d34c1992701289cbe12df9fd024f315998 ]
    
    The driver passes fw_version directly to devlink_info_version_stored_put()
    without ensuring null-termination. While current firmware null-terminates
    these strings, the driver should not rely on this behavior. Add explicit
    null-termination to prevent potential issues if firmware behavior changes.
    
    Fixes: 45d76f492938 ("pds_core: set up device and adminq")
    Signed-off-by: Nikhil P. Rao <nikhil.rao@amd.com>
    Link: https://patch.msgid.link/20260520205842.1486718-1-nikhil.rao@amd.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

pds_core: fix debugfs_lookup dentry leak and error handling [+ + +]

Author: Nikhil P. Rao <nikhil.rao@amd.com>
Date:   Fri May 15 21:29:07 2026 +0000

    pds_core: fix debugfs_lookup dentry leak and error handling
    
    [ Upstream commit dc416e32baaeb620b9809e9e25fc7b30889686e9 ]
    
    debugfs_lookup() returns a dentry with an elevated reference count that
    must be released with dput(). The current code discards the returned
    dentry without calling dput(), causing a reference leak on every
    firmware reset recovery.
    
    Additionally, when CONFIG_DEBUG_FS is disabled, debugfs_lookup()
    returns ERR_PTR(-ENODEV), not NULL. The current check passes for error
    pointers and would call dput() on an invalid pointer, causing a crash.
    
    Fixes: bc90fbe0c318 ("pds_core: Rework teardown/setup flow to be more common")
    Signed-off-by: Nikhil P. Rao <nikhil.rao@amd.com>
    Link: https://patch.msgid.link/20260515212907.998028-3-nikhil.rao@amd.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

pds_core: fix error handling in pdsc_devcmd_wait [+ + +]

Author: Nikhil P. Rao <nikhil.rao@amd.com>
Date:   Fri May 15 21:29:05 2026 +0000

    pds_core: fix error handling in pdsc_devcmd_wait
    
    [ Upstream commit 0e46b6635b03d29807f810c3b415c4755a3f958d ]
    
    Fix two cases where pdsc_devcmd_wait() returns stale success from
    the completion register instead of an error:
    
    1. FW crash: If firmware stops running, the wait loop breaks early with
       running=false. The condition "if ((!done || timeout) && running)" is
       false, so error handling is bypassed and stale status is returned.
       Check !running first and return -ENXIO.
    
    2. Timeout: If a command times out, err is set to -ETIMEDOUT but then
       overwritten by pdsc_err_to_errno(status) which reads stale status.
       Return -ETIMEDOUT immediately after cleaning up.
    
    Both errors now propagate to pdsc_devcmd_locked() which queues
    health_work for recovery.
    
    Fixes: 45d76f492938 ("pds_core: set up device and adminq")
    Signed-off-by: Nikhil P. Rao <nikhil.rao@amd.com>
    Link: https://patch.msgid.link/20260515212907.998028-1-nikhil.rao@amd.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

phonet/pep: disable BH around forwarded sk_receive_skb() [+ + +]

Author: Zijing Yin <yzjaurora@gmail.com>
Date:   Tue May 19 10:26:33 2026 -0700

    phonet/pep: disable BH around forwarded sk_receive_skb()
    
    commit dbc81608e3a653dea6cf403f20cae35468b8ab9c upstream.
    
    The networking receive path is usually run from softirq context, but
    protocols that take the socket lock may have packets stored in the
    backlog and processed later from process context. In that case
    release_sock() -> __release_sock() drops the slock with spin_unlock_bh()
    and then calls sk->sk_backlog_rcv() with bottom halves enabled.
    
    Typical sk_backlog_rcv handlers process the socket whose backlog is
    being drained, so the BH state at entry is irrelevant for the slocks
    they touch. pep_do_rcv() is different: when the inbound skb targets an
    existing PEP pipe, it forwards the skb to a different *child* socket
    via sk_receive_skb(). That helper takes the child slock with
    bh_lock_sock_nested(), which is just spin_lock_nested() and assumes BH
    is already off. The same child slock therefore ends up acquired with
    BH on (process path) and with BH off (softirq path):
    
      process context                   softirq context
      ---------------                   ---------------
      release_sock(listener)            __netif_receive_skb()
       __release_sock()                  phonet_rcv()
        spin_unlock_bh()                  __sk_receive_skb(listener)
        [BH now ENABLED]                  [BH already disabled]
        sk_backlog_rcv:                   sk_backlog_rcv:
         pep_do_rcv()                      pep_do_rcv()
          sk_receive_skb(child)             sk_receive_skb(child)
           bh_lock_sock_nested(child)        bh_lock_sock_nested(child)
           => SOFTIRQ-ON-W                   => IN-SOFTIRQ-W
    
    Lockdep flags this as inconsistent lock state, and it can become a real
    self-deadlock if a softirq on the same CPU tries to receive to the same
    child socket while its slock is held in the BH-enabled path:
    
      WARNING: inconsistent lock state
      inconsistent {SOFTIRQ-ON-W} -> {IN-SOFTIRQ-W} usage.
       (slock-AF_PHONET/1){+.?.}-{3:3}, at: __sk_receive_skb+0x1cf/0x900
        __sk_receive_skb              net/core/sock.c:563
        sk_receive_skb                include/net/sock.h:2022 [inline]
        pep_do_rcv                    net/phonet/pep.c:675
        sk_backlog_rcv                include/net/sock.h:1190
        __release_sock                net/core/sock.c:3216
        release_sock                  net/core/sock.c:3815
        pep_sock_accept               net/phonet/pep.c:879
    
    Wrap the forwarded sk_receive_skb() in local_bh_disable() /
    local_bh_enable() so the child slock is always acquired with BH off.
    local_bh_disable() nests safely on the softirq path.
    
    Discovered via in-house syzkaller fuzzing; the same root cause also
    on the linux-6.1.y syzbot dashboard as extid 44f0626dd6284f02663c.
    Reproduced under KASAN + LOCKDEP + PROVE_LOCKING, reproducer:
    https://pastebin.com/A3t8xzCR
    
    Fixes: 9641458d3ec4 ("Phonet: Pipe End Point for Phonet Pipes protocol")
    Link: https://syzkaller.appspot.com/bug?extid=44f0626dd6284f02663c
    Cc: stable@vger.kernel.org
    Signed-off-by: Zijing Yin <yzjaurora@gmail.com>
    Acked-by: Rémi Denis-Courmont <remi@remlab.net>
    Reported-by: syzbot+9f4a135646b66c509935@syzkaller.appspotmail.com
    Reviewed-by: Eric Dumazet <edumazet@google.com>
    Link: https://patch.msgid.link/20260519172635.86304-1-yzjaurora@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

phy: apple: atc: Fix typec switch/mux leak on unbind [+ + +]

Author: David Carlier <devnexen@gmail.com>
Date:   Fri May 8 21:19:58 2026 +0100

    phy: apple: atc: Fix typec switch/mux leak on unbind
    
    [ Upstream commit 1854082fe0ddb81bc93d1f8e8a00554217fd09d1 ]
    
    atcphy_probe_switch() and atcphy_probe_mux() discard the pointers
    returned by typec_switch_register() and typec_mux_register(). The
    platform driver has no .remove callback, so when the driver unbinds
    (e.g. via sysfs unbind) neither typec_switch_unregister() nor
    typec_mux_unregister() is called. The framework reference taken in
    typec_switch_register() (device_initialize() + device_add() in
    drivers/usb/typec/mux.c) is therefore never dropped and the
    typec_switch_dev / typec_mux_dev objects stay live forever, with
    their sysfs entries under the typec_mux class also left behind. A
    subsequent rebind cannot recreate them with the same fwnode-derived
    name.
    
    Save the registered handles and unregister them through
    devm_add_action_or_reset() so framework registration is torn down
    in step with the driver's other devm-managed state. While here,
    drop struct apple_atcphy::sw and ::mux: they were declared with the
    consumer-side types (typec_switch *, typec_mux *) instead of the
    provider-side types and were never assigned.
    
    Scope of the fix
    ================
    This patch fixes the registration leak only. It does not close the
    use-after-free window that arises when a consumer that obtained a
    reference via fwnode_typec_switch_get() / fwnode_typec_mux_get()
    outlives the provider unbind: such consumers keep the underlying
    typec_switch_dev / typec_mux_dev alive past device_unregister(),
    and a later typec_switch_set() / typec_mux_set() still invokes the
    registered atcphy_sw_set() / atcphy_mux_set(), which dereferences
    the freed apple_atcphy through typec_{switch,mux}_get_drvdata().
    
    On Apple Silicon the relevant consumers are the typec port and the
    cd321x controller registered by drivers/usb/typec/tipd/core.c.
    Cable plug / orientation events and alt-mode transitions trigger
    the .set callbacks via:
    
      tps6598x_interrupt()                 drivers/usb/typec/tipd/core.c
        tps6598x_handle_plug_event()
          tps6598x_connect()/_disconnect()
            typec_set_orientation()        drivers/usb/typec/class.c
              typec_switch_set(port->sw)   drivers/usb/typec/mux.c
                atcphy_sw_set()            drivers/phy/apple/atc.c
    
      cd321x_update_work()                 drivers/usb/typec/tipd/core.c
        cd321x_typec_update_mode()
          typec_mux_set(cd321x->mux)       drivers/usb/typec/mux.c
            atcphy_mux_set()               drivers/phy/apple/atc.c
    
    Closing that window requires framework support for invalidating
    consumer-held references on provider unbind. The same
    consumer-survives-provider pattern has been discussed for the PHY
    framework [1] and is out of scope here.
    
    [1] https://lore.kernel.org/linux-phy/aZejMSJ9qqRWb2pX@google.com/
    
    Fixes: 8e98ca1e74db ("phy: apple: Add Apple Type-C PHY")
    Signed-off-by: David Carlier <devnexen@gmail.com>
    Reviewed-by: Vladimir Oltean <olteanv@gmail.com>
    Tested-by: Joshua Peisach <jpeisach@ubuntu.com>
    Link: https://lkml.kernel.org/r/6ec1ed08328340db42655287afd5fa4067316b11.camel@perches.com
    Link: https://patch.msgid.link/20260508201958.30060-1-devnexen@gmail.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

phy: exynos5-usbdrd: fix USB 2.0 HS PHY tuning values for Exynos7870 [+ + +]

Author: Łukasz Lebiedziński <kernel@lvkasz.us>
Date:   Mon Apr 6 15:56:27 2026 +0200

    phy: exynos5-usbdrd: fix USB 2.0 HS PHY tuning values for Exynos7870
    
    commit 5a759b120e31aa3ed914d98b51eb1755235250f2 upstream.
    
    The existing PHYPARAM0 tuning values for Exynos7870 are incorrect,
    causing the USB 2.0 PHY to fail high-speed negotiation and fall back
    to full-speed (12Mbps) operation.
    
    Fix TXVREFTUNE (transmitter voltage reference) from 14 to 3,
    TXRESTUNE (transmitter impedance) from 3 to 2, and SQRXTUNE
    (squelch threshold) from 6 to 5. Also explicitly set
    TXPREEMPPULSETUNE to 0, which was previously missing from the
    tuning table despite being included in the register mask.
    
    All values are derived from the vendor kernel for the Samsung
    Galaxy A6 (SM-A600FN), as no public hardware documentation is
    available for the Exynos7870 USB DRD PHY. With these corrections,
    the PHY successfully negotiates high-speed (480Mbps) operation.
    
    Fixes: 588d5d20ca8d ("phy: exynos5-usbdrd: add exynos7870 USBDRD support")
    Cc: stable@vger.kernel.org
    Tested-by: Kaustabh Chakraborty <kauschluss@disroot.org>
    Reviewed-by: Krzysztof Kozlowski <krzysztof.kozlowski@oss.qualcomm.com>
    Signed-off-by: Łukasz Lebiedziński <kernel@lvkasz.us>
    Link: https://patch.msgid.link/20260406135627.234835-1-kernel@lvkasz.us
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

phy: marvell: mvebu-a3700-utmi: fix incorrect USB2_PHY_CTRL register access [+ + +]

Author: Gabor Juhos <j4g8y7@gmail.com>
Date:   Sat Mar 21 15:42:32 2026 +0100

    phy: marvell: mvebu-a3700-utmi: fix incorrect USB2_PHY_CTRL register access
    
    [ Upstream commit 91ddf6f722084383fb05be731c0107814b055c0c ]
    
    The mvebu_a3700_utmi_phy_power_off() function tries to modify the
    USB2_PHY_CTRL register by using the IO address of the PHY IP block along
    with the readl/writel IO accessors. However, the register exist in the
    USB miscellaneous register space, and as such it must be accessed via
    regmap like it is done in the mvebu_a3700_utmi_phy_power_on() function.
    
    Change the code to use regmap_update_bits() for modífying the register
    to fix this.
    
    Fixes: cc8b7a0ae866 ("phy: add A3700 UTMI PHY driver")
    Signed-off-by: Gabor Juhos <j4g8y7@gmail.com>
    Reviewed-by: Miquel Raynal <miquel.raynal@bootlin.com>
    Link: https://patch.msgid.link/20260321-a3700-utmi-fix-usb2_phy_ctrl-access-v1-1-6005ff4b5058@gmail.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

phy: qcom-qmp-ufs: Fix kaanapali PHY PLL lock failure after SM8650 G4 fix [+ + +]

Author: Nitin Rawat <nitin.rawat@oss.qualcomm.com>
Date:   Wed Apr 15 16:18:51 2026 +0530

    phy: qcom-qmp-ufs: Fix kaanapali PHY PLL lock failure after SM8650 G4 fix
    
    commit 80305760d7a55b884fb9023c490b75568d1ea0b1 upstream.
    
    Commit 81af9e40e2e4 ("phy: qcom: qmp-ufs: Fix SM8650 PCS table for Gear 4")
    moved QPHY_V6_PCS_UFS_PLL_CNTL register configuration from the shared
    sm8650_ufsphy_g5_pcs table to the SM8650-specific sm8650_ufsphy_pcs base
    table to fix Gear 4 operation on SM8650.
    
    However, this change inadvertently broke kaanapali and SM8750 SoCs
    which also rely on the shared sm8650_ufsphy_g5_pcs table for Gear 5
    configuration but use their own sm8750_ufsphy_pcs base table. After the
    change, kaanapali PHYs are left without the required PLL_CNTL = 0x33
    setting, causing the PHY PLL to remain at its hardware reset default
    value, preventing PLL lock and resulting in DME_LINKSTARTUP timeouts.
    
    Fix this by adding the missing QPHY_V6_PCS_UFS_PLL_CNTL = 0x33 entry
    to the sm8750_ufsphy_pcs table, mirroring what the original commit
    already did for sm8650_ufsphy_pcs.
    
    Cc: stable@vger.kernel.org # v6.19.12
    Fixes: 81af9e40e2e4 ("phy: qcom: qmp-ufs: Fix SM8650 PCS table for Gear 4")
    Signed-off-by: Nitin Rawat <nitin.rawat@oss.qualcomm.com>
    Reviewed-by: Abel Vesa <abel.vesa@oss.qualcomm.com>
    Reviewed-by: Konrad Dybcio <konrad.dybcio@oss.qualcomm.com>
    Reviewed-by: Manivannan Sadhasivam <mani@kernel.org>
    Link: https://patch.msgid.link/20260415104851.2763238-1-nitin.rawat@oss.qualcomm.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

phy: qcom: edp: Add eDP/DP mode switch support [+ + +]

Author: Yongxing Mou <yongxing.mou@oss.qualcomm.com>
Date:   Mon Apr 27 14:35:20 2026 +0800

    phy: qcom: edp: Add eDP/DP mode switch support
    
    commit 3011c365a329cf2db6d55e8d684550dc88350436 upstream.
    
    The eDP PHY supports both eDP/DP modes, each requiring a different
    swing/pre-emphasis table. However, the driver currently uses a fixed
    static table for eDP programming rather than selecting the appropriate
    table based on the current mode. Add separate tables for eDP and DP
    modes, and select the appropriate table dynamically based on the
    current mode.
    
    Glymur's DP mode table differs from the other platforms, add a
    dedicated table for it.
    
    This also fixes the table mismatch for X1E80100 (eDP) and SA8775P (DP).
    
    Cc: stable@vger.kernel.org
    Fixes: 3f12bf16213c ("phy: qcom: edp: Add support for eDP PHY on SA8775P")
    Reviewed-by: Konrad Dybcio <konrad.dybcio@oss.qualcomm.com>
    Signed-off-by: Yongxing Mou <yongxing.mou@oss.qualcomm.com>
    Link: https://patch.msgid.link/20260427-edp_phy-v5-2-3bb876824475@oss.qualcomm.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

phy: qcom: edp: Fix AUX_CFG8 programming for DP mode [+ + +]

Author: Yongxing Mou <yongxing.mou@oss.qualcomm.com>
Date:   Mon Apr 27 14:35:22 2026 +0800

    phy: qcom: edp: Fix AUX_CFG8 programming for DP mode
    
    commit bf237a9fcbbf9d658522f7315ffc04bf2d49be42 upstream.
    
    AUX_CFG8 depends on whether the PHY is operating in eDP or DP mode, not
    the selected swing/pre-emphasis table. All supported platforms already
    have the proper tables, so remove the unnecessary check.
    
    Cc: stable@vger.kernel.org
    Fixes: 6078b8ce070c ("phy: qcom: edp: Add set_mode op for configuring eDP/DP submode")
    Reviewed-by: Konrad Dybcio <konrad.dybcio@oss.qualcomm.com>
    Reviewed-by: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
    Signed-off-by: Yongxing Mou <yongxing.mou@oss.qualcomm.com>
    Link: https://patch.msgid.link/20260427-edp_phy-v5-4-3bb876824475@oss.qualcomm.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

phy: qcom: edp: Unify generic DP/eDP swing and pre-emphasis tables [+ + +]

Author: Yongxing Mou <yongxing.mou@oss.qualcomm.com>
Date:   Mon Apr 27 14:35:19 2026 +0800

    phy: qcom: edp: Unify generic DP/eDP swing and pre-emphasis tables
    
    commit fd672888cccd6b855154efe0ac78e7ce3e8ab088 upstream.
    
    The current eDP and DP swing/pre-emphasis tables do not match the HPG
    requirements for the supported platforms, correct the table accordingly.
    
    The generic tables which can be shared as follows:
    
    DP mode：
            -sa8775p/sc7280/sc8280xp/x1e80100
            -glymur
            -sc8180x
    eDP mode(low vdiff):
            -glymur/sa8775p/sc8280xp/x1e80100
            -sc7280
            -sc8180x
    
    The proper tables for SC8180X and SC7280 will be added in a later patch,
    since they need separate table.
    
    Cc: stable@vger.kernel.org
    Fixes: f199223cb490 ("phy: qcom: Introduce new eDP PHY driver")
    Reviewed-by: Konrad Dybcio <konrad.dybcio@oss.qualcomm.com>
    Reviewed-by: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
    Signed-off-by: Yongxing Mou <yongxing.mou@oss.qualcomm.com>
    Link: https://patch.msgid.link/20260427-edp_phy-v5-1-3bb876824475@oss.qualcomm.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

phy: qcom: qmp-usbc: Fix out-of-bounds array access in dp swing config [+ + +]

Author: Xiangxu Yin <xiangxu.yin@oss.qualcomm.com>
Date:   Fri Feb 27 20:15:01 2026 +0800

    phy: qcom: qmp-usbc: Fix out-of-bounds array access in dp swing config
    
    [ Upstream commit ea17fc4d7dc2ba6459b1a318962960520201baf1 ]
    
    swing_tbl and pre_emphasis_tbl are 4x4 arrays (valid indices 0-3), but
    the boundary check uses "> 4" instead of ">= 4", allowing index 4 to
    cause an out-of-bounds access.
    
    Reported-by: Dan Carpenter <dan.carpenter@linaro.org>
    Fixes: 81791c45c8e0 ("phy: qcom: qmp-usbc: Add QCS615 USB/DP PHY config and DP mode support")
    Signed-off-by: Xiangxu Yin <xiangxu.yin@oss.qualcomm.com>
    Reviewed-by: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
    Reviewed-by: Konrad Dybcio <konrad.dybcio@oss.qualcomm.com>
    Link: https://patch.msgid.link/20260227-master-v1-1-8d91b9407fdb@oss.qualcomm.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

phy: spacemit: Remove incorrect clk_disable() in spacemit_usb2phy_init() [+ + +]

Author: Felix Gu <ustc.gu@gmail.com>
Date:   Thu Mar 26 00:23:58 2026 +0800

    phy: spacemit: Remove incorrect clk_disable() in spacemit_usb2phy_init()
    
    [ Upstream commit a4058c09dd6e28ec33316fd6eb45ddae4cab1f31 ]
    
    When clk_enable() fails, the clock was never enabled. Calling
    clk_disable() in this error path is incorrect.
    
    Remove the spurious clk_disable() call from the error handling
    in spacemit_usb2phy_init().
    
    Fixes: fe4bc1a08638 ("phy: spacemit: support K1 USB2.0 PHY controller")
    Signed-off-by: Felix Gu <ustc.gu@gmail.com>
    Reviewed-by: Ze Huang <huang.ze@linux.dev>
    Link: https://patch.msgid.link/20260326-k1-usb3-v1-1-0c2b6adf5185@gmail.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

phy: tegra: xusb: Fix per-pad high-speed termination calibration [+ + +]

Author: Wayne Chang <waynec@nvidia.com>
Date:   Mon May 4 11:33:05 2026 +0800

    phy: tegra: xusb: Fix per-pad high-speed termination calibration
    
    commit da110228b54f2e2143d97ea7151e0dc22e539d67 upstream.
    
    The existing code reads a single hs_term_range_adj value from bit field
    [10:7] of FUSE_SKU_CALIB_0 and applies it to all USB2 pads uniformly.
    However, on SoCs that support per-pad termination, each pad has its own
    hs_term_range_adj field: pad 0 in FUSE_SKU_CALIB_0[10:7], and pads 1-3
    in FUSE_USB_CALIB_EXT_0 at bit offsets [8:5], [12:9], and [16:13]
    respectively.
    
    Fix the calibration by reading per-pad values from the appropriate fuse
    registers. For SoCs that do not support per-pad termination, replicate
    pad 0's value to all pads to maintain existing behavior.
    
    Add a has_per_pad_term flag to the SoC data to indicate whether per-pad
    termination values are available in FUSE_USB_CALIB_EXT_0.
    
    Fixes: 1ef535c6ba8e ("phy: tegra: xusb: Add Tegra194 support")
    Cc: stable@vger.kernel.org
    Signed-off-by: Wayne Chang <waynec@nvidia.com>
    Signed-off-by: Wei-Cheng Chen <weichengc@nvidia.com>
    Reviewed-by: Jon Hunter <jonathanh@nvidia.com>
    Tested-by: Jon Hunter <jonathanh@nvidia.com>
    Link: https://patch.msgid.link/20260504033305.2283145-1-weichengc@nvidia.com
    Signed-off-by: Vinod Koul <vkoul@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

pinctrl: mediatek: moore: implement gpio_chip::get_direction() [+ + +]

Author: Bartosz Golaszewski <bartosz.golaszewski@oss.qualcomm.com>
Date:   Fri Apr 10 09:09:35 2026 +0200

    pinctrl: mediatek: moore: implement gpio_chip::get_direction()
    
    [ Upstream commit b560d414239232c6ed7205d3795d3f588034d69b ]
    
    If the gpio_chip::get_direction() callback is not implemented by the GPIO
    controller driver, GPIOLIB emits a warning.
    
    Implement get_direction() for the GPIO part of pinctrl-moore.
    
    Fixes: 471e998c0e31 ("gpiolib: remove redundant callback check")
    Fixes: e623c4303ed1 ("gpiolib: sanitize the return value of gpio_chip::get_direction()")
    Reported-by: Frank Wunderlich <frank-w@public-files.de>
    Closes: https://lore.kernel.org/all/20260409132724.126258-1-linux@fw-web.de/
    Signed-off-by: Bartosz Golaszewski <bartosz.golaszewski@oss.qualcomm.com>
    Tested-By: Frank Wunderlich <frank-w@public-files.de>
    Signed-off-by: Linus Walleij <linusw@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

pinctrl: meson: amlogic-a4: fix deadlock issue [+ + +]

Author: Xianwei Zhao <xianwei.zhao@amlogic.com>
Date:   Wed Apr 22 11:44:13 2026 +0000

    pinctrl: meson: amlogic-a4: fix deadlock issue
    
    [ Upstream commit e72ce029810390eb987a036fb2c8a5da9a23b685 ]
    
    Accessing the pinconf-pins sysfs node may deadlock.
    
    pinconf_pins_show() holds pctldev->mutex, and the platform driver
    calls pinctrl_find_gpio_range_from_pin(), which tries to acquire
    the same mutex again, leading to a deadlock.
    
    Use pinctrl_find_gpio_range_from_pin_nolock() to fix this issue.
    
    Fixes: 6e9be3abb78c ("pinctrl: Add driver support for Amlogic SoCs")
    Signed-off-by: Xianwei Zhao <xianwei.zhao@amlogic.com>
    Reviewed-by: Neil Armstrong <neil.armstrong@linaro.org>
    Signed-off-by: Linus Walleij <linusw@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

pinctrl: qcom: Fix GPIO to PDC wake irq map for qcs615 [+ + +]

Author: Maulik Shah <maulik.shah@oss.qualcomm.com>
Date:   Thu Apr 23 16:55:24 2026 +0530

    pinctrl: qcom: Fix GPIO to PDC wake irq map for qcs615
    
    [ Upstream commit 9d69033ad967b6e09b1e5b30d1a32c6c4876465d ]
    
    PDC interrupts 122-125 were meant for ibi_i3c wakeup but qcs615 do not
    support i3c. GPIOs 39,51,88 and 89 are also connected to different PDC
    pin to support non-ibi wakeup. Update the wakeirq map to reflect same.
    
    Fixes: b698f36a9d40 ("pinctrl: qcom: add the tlmm driver for QCS615 platform")
    Signed-off-by: Maulik Shah <maulik.shah@oss.qualcomm.com>
    Signed-off-by: Navya Malempati <navya.malempati@oss.qualcomm.com>
    Reviewed-by: Konrad Dybcio <konrad.dybcio@oss.qualcomm.com>
    Signed-off-by: Linus Walleij <linusw@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

pinctrl: qcom: Fix wakeirq map by removing disconnected irqs for sm8150 [+ + +]

Author: Maulik Shah <maulik.shah@oss.qualcomm.com>
Date:   Tue Apr 28 17:44:58 2026 +0530

    pinctrl: qcom: Fix wakeirq map by removing disconnected irqs for sm8150
    
    [ Upstream commit 52ac35b8a151446481496404af3a8e5e889b3c5a ]
    
    PDC interrupts 122-125 were meant for ibi_i3c wakeup but sm8150 do not
    support i3c. GPIOs 39,51,88 and 144 are also connected to different PDC
    pin and already reflected in the wake irq map.
    
    Remove the unsupported wakeup interrupts from the map.
    
    Fixes: 90337380c809 ("pinctrl: qcom: sm8150: Specify PDC map")
    Reviewed-by: Konrad Dybcio <konrad.dybcio@oss.qualcomm.com>
    Signed-off-by: Maulik Shah <maulik.shah@oss.qualcomm.com>
    Signed-off-by: Navya Malempati <navya.malempati@oss.qualcomm.com>
    Signed-off-by: Linus Walleij <linusw@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

pinctrl: qcom: ipq4019: mark gpio as a GPIO pin function [+ + +]

Author: Til Kaiser <mail@tk154.de>
Date:   Mon Apr 13 15:52:34 2026 +0200

    pinctrl: qcom: ipq4019: mark gpio as a GPIO pin function
    
    [ Upstream commit b51d33ea8a164bb5f0eec8ad817fa9730ac2b577 ]
    
    The qcom pinctrl core supports marking functions that represent GPIO mode
    via PINCTRL_GPIO_PINFUNCTION(), so that strict pinmuxing does not reject
    GPIO requests for pins that are muxed to the GPIO function.
    
    ipq4019 still describes its gpio function with QCA_PIN_FUNCTION(gpio),
    so it is not treated as a GPIO pin function. As a result, GPIO consumers
    can still conflict with pinctrl states that select the "gpio" function.
    
    Add a QCA_GPIO_PIN_FUNCTION() helper and use it for the ipq4019 gpio
    function, matching how the msm-based qcom drivers handle this.
    
    This allows ipq4019 to keep the GPIO-related pin configuration in DTS
    without tripping over strict pinmux ownership checks.
    
    Fixes: cc85cb96e2e4 ("pinctrl: qcom: make the pinmuxing strict")
    Signed-off-by: Til Kaiser <mail@tk154.de>
    Reviewed-by: Dmitry Baryshkov <dmitry.baryshkov@oss.qualcomm.com>
    Signed-off-by: Linus Walleij <linusw@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

pinctrl: renesas: rzg2l: Fix incorrect PUPD register offset for high pins during suspend/resume [+ + +]

Author: Biju Das <biju.das.jz@bp.renesas.com>
Date:   Sat Mar 28 09:05:45 2026 +0000

    pinctrl: renesas: rzg2l: Fix incorrect PUPD register offset for high pins during suspend/resume
    
    [ Upstream commit 6dba9b7268cc50166bce47608670192fd874e363 ]
    
    When saving/restoring pull-up/down register state during suspend/resume,
    the second PUPD register access was incorrectly using the same base offset
    as the first, effectively reading/writing the same register twice instead
    of the adjacent one.
    
    Add the correct + 4 byte offset to the second RZG2L_PCTRL_REG_ACCESS32
    call so that pupd[1][port] is properly saved and restored from the next
    32-bit register in the PUPD register pair, covering pins 4–7 of ports
    with 4 or more pins.
    
    Fixes: b2bd65fbb617 ("pinctrl: renesas: rzg2l: Add suspend/resume support for pull up/down")
    Signed-off-by: Biju Das <biju.das.jz@bp.renesas.com>
    Reviewed-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Link: https://patch.msgid.link/20260328090548.84124-1-biju.das.jz@bp.renesas.com
    Signed-off-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

pinctrl: renesas: rzg2l: Fix SMT register cache handling [+ + +]

Author: Lad Prabhakar <prabhakar.mahadev-lad.rj@bp.renesas.com>
Date:   Mon Apr 13 19:24:51 2026 +0100

    pinctrl: renesas: rzg2l: Fix SMT register cache handling
    
    [ Upstream commit c88ab9407986836820848128ce1f90f2fa49da95 ]
    
    Store SMT register cache per bank instead of using a single array.
    
    On RZ/V2H(P), RZ/V2N, and RZ/G3E, the SMT register is split across two
    32-bit registers: bits 0/8/16/24 control pins 0-3, while pins 4-7 are
    controlled by the corresponding bits in the next register.  The previous
    implementation cached only a single SMT register, leading to incomplete
    save/restore of SMT state.
    
    Convert cache->smt to a per-bank array and allocate storage for both
    halves.  Update suspend/resume handling to save and restore both SMT
    registers when present.
    
    Fixes: 837afa592c623 ("pinctrl: renesas: rzg2l: Add suspend/resume support for Schmitt control registers")
    Signed-off-by: Lad Prabhakar <prabhakar.mahadev-lad.rj@bp.renesas.com>
    Reviewed-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Link: https://patch.msgid.link/20260413182456.811543-2-prabhakar.mahadev-lad.rj@bp.renesas.com
    Signed-off-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

platform/surface: aggregator_registry: omit battery & AC nodes on Surface Laptop 7 [+ + +]

Author: Oliver White <oliverjwhite07@gmail.com>
Date:   Thu Apr 9 15:43:47 2026 +1200

    platform/surface: aggregator_registry: omit battery & AC nodes on Surface Laptop 7
    
    [ Upstream commit 0488073a6c84571dd3cffe581a4a73a5fceb099d ]
    
    Surface Laptop 7 exposes battery and AC status via Qualcomm PMIC GLINK
    qcom_battmgr. Registering the standard SSAM battery and AC client
    devices on this platform causes duplicate power-supply devices to
    appear.
    
    Drop the SSAM battery and AC nodes from the Surface Laptop 7 registry
    group so that only the qcom_battmgr power supplies are instantiated.
    
    Fixes: b27622f13172 ("platform/surface: Add OF support")
    Signed-off-by: Oliver White <oliverjwhite07@gmail.com>
    Link: https://patch.msgid.link/20260409034347.17381-1-oliverjwhite07@gmail.com
    Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

platform/x86: adv_swbutton: Check ACPI_HANDLE() against NULL [+ + +]

Author: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
Date:   Tue May 12 17:11:49 2026 +0200

    platform/x86: adv_swbutton: Check ACPI_HANDLE() against NULL
    
    [ Upstream commit e7a9a6ea40e352cd7977f6a8c80bdeadf65ad838 ]
    
    Every platform driver can be forced to match a device that doesn't match
    its list of device IDs because of device_match_driver_override(), so
    platform drivers that rely on the existence of a device's ACPI companion
    object need to verify its presence.
    
    Accordingly, add a requisite ACPI_HANDLE() check against NULL to the
    platform/x86 adv_swbutton driver.
    
    Fixes: 3d904005f686 ("platform/x86: add support for Advantech software defined button")
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Reviewed-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com>
    Link: https://patch.msgid.link/5115425.31r3eYUQgx@rafael.j.wysocki
    Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

platform/x86: asus-armoury: fix mini-LED mode get/set on MODE2 devices [+ + +]

Author: Ahmed Yaseen <yaseen@ghoul.dev>
Date:   Sun May 17 18:30:11 2026 +0000

    platform/x86: asus-armoury: fix mini-LED mode get/set on MODE2 devices
    
    [ Upstream commit d2d2e7c8fb37b27301ee5c8343b2f7037efc6ea6 ]
    
    The mini-LED current_value attribute does not work on devices that use
    ASUS_WMI_DEVID_MINI_LED_MODE2 (2024 and newer models).
    
    Reading is broken: mini_led_mode_current_value_show() fetches the mode
    from the device but then decodes a literal 0 instead of the value it
    just read:
    
        mode = FIELD_GET(ASUS_MINI_LED_MODE_MASK, 0);
    
    So mode is always 0, and the attribute always reports the same thing
    regardless of the real hardware state.
    
    Writing is broken too. The number a user writes is an index; the value
    the firmware actually wants is looked up from that index in
    mini_led_mode_map[]. mini_led_mode_current_value_store() skips that
    lookup and passes the raw index straight to armoury_attr_uint_store().
    On 2024 devices the firmware numbers its modes differently from the
    index, so some writes are rejected with -EINVAL and the rest send the
    wrong mode to the hardware.
    
    Fix both paths: decode the value actually read from the device when
    reading, and look up the firmware value before sending it when
    writing. Older (MODE1) devices were unaffected because there the index
    and the firmware value are the same.
    
    Fixes: f99eb098090e ("platform/x86: asus-armoury: move existing tunings to asus-armoury module")
    Signed-off-by: Ahmed Yaseen <yaseen@ghoul.dev>
    Reviewed-by: Denis Benato <denis.benato@linux.dev>
    Link: https://patch.msgid.link/20260517182957.11069-1-yaseen@ghoul.dev
    Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

platform/x86: hp_accel: Check ACPI_COMPANION() against NULL [+ + +]

Author: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
Date:   Tue May 12 17:12:40 2026 +0200

    platform/x86: hp_accel: Check ACPI_COMPANION() against NULL
    
    [ Upstream commit abfbe5ee8ae89f1f5449790423d5dd3e423545bd ]
    
    Every platform driver can be forced to match a device that doesn't match
    its list of device IDs because of device_match_driver_override(), so
    platform drivers that rely on the existence of a device's ACPI companion
    object need to verify its presence.
    
    Accordingly, add a requisite ACPI_COMPANION() check against NULL to the
    platform/x86 hp_accel driver.
    
    Fixes: 8ebcb6c94c71 ("platform/x86: hp_accel: Convert to be a platform driver")
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Reviewed-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com>
    Link: https://patch.msgid.link/2425918.ElGaqSPkdT@rafael.j.wysocki
    Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

platform/x86: intel-hid: Check ACPI_HANDLE() against NULL [+ + +]

Author: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
Date:   Tue May 12 17:13:28 2026 +0200

    platform/x86: intel-hid: Check ACPI_HANDLE() against NULL
    
    [ Upstream commit 5c69e090ae5dd93d910f70db0796357080707d26 ]
    
    Every platform driver can be forced to match a device that doesn't match
    its list of device IDs because of device_match_driver_override(), so
    platform drivers that rely on the existence of a device's ACPI companion
    object need to verify its presence.
    
    Accordingly, add a requisite ACPI_HANDLE() check against NULL to the
    platform/x86 intel-hid driver.
    
    Fixes: ecc83e52b28c ("intel-hid: new hid event driver for hotkeys")
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Reviewed-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com>
    Link: https://patch.msgid.link/1971512.tdWV9SEqCh@rafael.j.wysocki
    Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

platform/x86: intel-vbtn: Check ACPI_HANDLE() against NULL [+ + +]

Author: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
Date:   Tue May 12 17:16:22 2026 +0200

    platform/x86: intel-vbtn: Check ACPI_HANDLE() against NULL
    
    [ Upstream commit a9f305c5a355efeb240d406d378491d9eec02d07 ]
    
    Every platform driver can be forced to match a device that doesn't match
    its list of device IDs because of device_match_driver_override(), so
    platform drivers that rely on the existence of a device's ACPI companion
    object need to verify its presence.
    
    Accordingly, add a requisite ACPI_HANDLE() check against NULL to the
    platform/x86 intel-vbtn driver.
    
    Fixes: 26173179fae1 ("platform/x86: intel-vbtn: Eval VBDL after registering our notifier")
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Reviewed-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com>
    Link: https://patch.msgid.link/3426431.aeNJFYEL58@rafael.j.wysocki
    Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

platform/x86: intel_sar: Check ACPI_HANDLE() against NULL [+ + +]

Author: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
Date:   Tue May 12 17:15:32 2026 +0200

    platform/x86: intel_sar: Check ACPI_HANDLE() against NULL
    
    [ Upstream commit 2765f16c12af7c2533763e46b8113b727354012d ]
    
    Every platform driver can be forced to match a device that doesn't match
    its list of device IDs because of device_match_driver_override(), so
    platform drivers that rely on the existence of a device's ACPI companion
    object need to verify its presence.
    
    Accordingly, add a requisite ACPI_HANDLE() check against NULL to the
    platform/x86 intel_sar driver.
    
    Fixes: dcfbd31ef4bc ("platform/x86: BIOS SAR driver for Intel M.2 Modem")
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Reviewed-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com>
    Link: https://patch.msgid.link/14023870.uLZWGnKmhe@rafael.j.wysocki
    Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

platform/x86: uniwill-laptop: Accept charging threshold of 0 [+ + +]

Author: Armin Wolf <W_Armin@gmx.de>
Date:   Wed May 13 01:21:39 2026 +0200

    platform/x86: uniwill-laptop: Accept charging threshold of 0
    
    [ Upstream commit c16a4823cc60a32b891f7a148bb30c0f51d12cf4 ]
    
    The power supply sysfs ABI states that:
    
            Not all hardware is capable of setting this to an arbitrary
            percentage. Drivers will round written values to the nearest
            supported value. Reading back the value will show the actual
            threshold set by the driver.
    
    The driver currently violates this ABI by rejecting a charging
    threshold of 0. Fix this by clamping this value to 1.
    
    Fixes: d050479693bb ("platform/x86: Add Uniwill laptop driver")
    Reviewed-by: Werner Sembach <wse@tuxedocomputers.com>
    Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Armin Wolf <W_Armin@gmx.de>
    Link: https://patch.msgid.link/20260512232145.329260-3-W_Armin@gmx.de
    Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

platform/x86: uniwill-laptop: Do not enable the charging limit even when forced [+ + +]

Author: Armin Wolf <W_Armin@gmx.de>
Date:   Wed May 13 01:21:41 2026 +0200

    platform/x86: uniwill-laptop: Do not enable the charging limit even when forced
    
    [ Upstream commit 26cbe119f99c86dcb4a0136d2bc73c0c716d80e4 ]
    
    It seems that on some older models (~2020) the battery charging limit
    can permanently damage the battery. Prevent users from enabling this
    feature thru the "force" module parameter to avoid causing permanent
    hardware damage on such devices.
    
    Fixes: d050479693bb ("platform/x86: Add Uniwill laptop driver")
    Link: https://www.reddit.com/r/XMG_gg/comments/ld9yyf/battery_limit_hidden_function_discovered_on/
    Reviewed-by: Werner Sembach <wse@tuxedocomputers.com>
    Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Armin Wolf <W_Armin@gmx.de>
    Link: https://patch.msgid.link/20260512232145.329260-5-W_Armin@gmx.de
    Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

platform/x86: uniwill-laptop: Fix behavior of "force" module param [+ + +]

Author: Armin Wolf <W_Armin@gmx.de>
Date:   Wed May 13 01:21:40 2026 +0200

    platform/x86: uniwill-laptop: Fix behavior of "force" module param
    
    [ Upstream commit fb4b67c44557cb4cbb15900083d4e1af22320339 ]
    
    Users might want to force-enable all possible features even on
    machines with a valid device descriptor. Until now the "force"
    module param was ignored on such machines. Fix this to make
    it easier to test for support of new features.
    
    Fixes: d050479693bb ("platform/x86: Add Uniwill laptop driver")
    Reviewed-by: Werner Sembach <wse@tuxedocomputers.com>
    Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Armin Wolf <W_Armin@gmx.de>
    Link: https://patch.msgid.link/20260512232145.329260-4-W_Armin@gmx.de
    Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

platform/x86: uniwill-laptop: Properly initialize charging threshold [+ + +]

Author: Armin Wolf <W_Armin@gmx.de>
Date:   Wed May 13 01:21:38 2026 +0200

    platform/x86: uniwill-laptop: Properly initialize charging threshold
    
    [ Upstream commit c12cc42dadd85dea210d5699d4f21def827382eb ]
    
    The EC might initialize the charge threshold with 0 to signal that
    said threshold is uninitialized. Detect this and replace said value
    with 100 to signal the EC that we want to take control of battery
    charging. Also set the threshold to 100 if the EC-provided value
    is invalid.
    
    Fixes: d050479693bb ("platform/x86: Add Uniwill laptop driver")
    Reviewed-by: Werner Sembach <wse@tuxedocomputers.com>
    Signed-off-by: Armin Wolf <W_Armin@gmx.de>
    Link: https://patch.msgid.link/20260512232145.329260-2-W_Armin@gmx.de
    Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

powerpc/hv-gpci: fix preempt count leak in sysfs show paths [+ + +]

Author: Aboorva Devarajan <aboorvad@linux.ibm.com>
Date:   Fri May 8 09:42:56 2026 +0530

    powerpc/hv-gpci: fix preempt count leak in sysfs show paths
    
    [ Upstream commit dbc30a57bd8e026995e9fa8e8c31cffd18542c01 ]
    
    Four sysfs show() callbacks in hv-gpci take get_cpu_var(hv_gpci_reqb)
    (which calls preempt_disable()) but only call the matching put_cpu_var()
    on the error path under the 'out:' label. Every successful read leaks
    one preempt_disable():
    
      processor_bus_topology_show()
      processor_config_show()
      affinity_domain_via_virtual_processor_show()
      affinity_domain_via_domain_show()
    
    (affinity_domain_via_partition_show() was already correct.)
    
    On a CONFIG_PREEMPT=y kernel, repeated reads raise preempt_count and
    eventually return to userspace with preemption still disabled. The
    next user-mode page fault then hits faulthandler_disabled() == 1,
    gets forced to SIGSEGV, and the resulting coredump trips
    'BUG: scheduling while atomic' in call_usermodehelper_exec ->
    wait_for_completion_state -> schedule:
    
      BUG: scheduling while atomic: <task>/<pid>/0x00000004
      ...
      __schedule_bug+0x6c/0x90
      __schedule+0x58c/0x13a0
      schedule+0x48/0x1a0
      schedule_timeout+0x104/0x170
      wait_for_completion_state+0x16c/0x330
      call_usermodehelper_exec+0x254/0x2d0
      vfs_coredump+0x1050/0x2590
      get_signal+0xb9c/0xc80
      do_notify_resume+0xf8/0x470
    
    Add an out_success label that calls put_cpu_var() before returning
    the byte count, mirroring affinity_domain_via_partition_show().
    
    Fixes: 71f1c39647d8 ("powerpc/hv_gpci: Add sysfs file inside hv_gpci device to show processor bus topology information")
    Fixes: 1a160c2a13c6 ("powerpc/hv_gpci: Add sysfs file inside hv_gpci device to show processor config information")
    Fixes: 71a7ccb478fc ("powerpc/hv_gpci: Add sysfs file inside hv_gpci device to show affinity domain via virtual processor information")
    Fixes: a69a57cac1ec ("powerpc/hv_gpci: Add sysfs file inside hv_gpci device to show affinity domain via domain information")
    Signed-off-by: Aboorva Devarajan <aboorvad@linux.ibm.com>
    Reviewed-by: Ritesh Harjani (IBM) <ritesh.list@gmail.com>
    Signed-off-by: Madhavan Srinivasan <maddy@linux.ibm.com>
    Link: https://patch.msgid.link/20260508041256.3447113-1-aboorvad@linux.ibm.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

powerpc/time: Remove redundant preempt_disable|enable() calls from arch_irq_work_raise() [+ + +]

Author: Sayali Patil <sayalip@linux.ibm.com>
Date:   Wed May 13 13:44:13 2026 +0530

    powerpc/time: Remove redundant preempt_disable|enable() calls from arch_irq_work_raise()
    
    [ Upstream commit 31467b23823ffec1f6fff407f8e3ca9af8b7491a ]
    
    A kernel panic is observed when handling machine check exceptions from
    real mode.
    
      BUG: Unable to handle kernel data access on read at 0xc00000006be21300
      Oops: Kernel access of bad area, sig: 11 [#1]
      MSR:  8000000000001003 <SF,ME,RI,LE>  CR: 88222248  XER: 00000005
      CFAR: c00000000003ffc4 DAR: c00000006be21300 DSISR: 40000000 IRQMASK: 0
      NIP [c000000000029e40] arch_irq_work_raise+0x10/0x70
      LR [c00000000003ffc8] machine_check_queue_event+0xa8/0x150
      Call Trace:
      [c0000000179d3c70] [c00000000003ff64] machine_check_queue_event+0x44/0x150
      [c0000000179d3d30] [c0000000000084e0] machine_check_early_common+0x1f0/0x2c0
    
    The crash occurs because arch_irq_work_raise() calls preempt_disable()
    from machine check exception (MCE) handlers running in real mode. In
    this context, accessing the preempt_count can fault, leading to the panic.
    
    The preempt_disable()/preempt_enable() pair in arch_irq_work_raise()
    was originally added by commit 0fe1ac48bef0 ("powerpc/perf_event: Fix
    oops due to perf_event_do_pending call") to avoid races while raising
    irq work from exception context.
    
    Later, commit 471ba0e686cb ("irq_work: Do not raise an IPI when
    queueing work on the local CPU") added preemption protection in
    irq_work_queue() path, while commit 20b876918c06 ("irq_work: Use per
    cpu atomics instead of regular atomics") added equivalent
    protection in irq_work_queue_on() before reaching arch_irq_work_raise():
    
      irq_work_queue() / irq_work_queue_on()
        -> preempt_disable()
          -> __irq_work_queue_local()
            -> irq_work_raise()
              -> arch_irq_work_raise()
    
    As a result, callers other than mce_irq_work_raise() already execute
    with preemption disabled, making the additional
    preempt_disable()/preempt_enable() pair in arch_irq_work_raise()
    redundant.
    
    The arch_irq_work_raise() function executes in NMI context when called
    from MCE handler. Hence we will not be preempted or scheduled out since
    we are in NMI context with MSR[EE]=0. Therefore, it is safe to remove
    the preempt_disable()/preempt_enable() calls from here.
    
    Remove it to avoid accessing preempt_count from real mode context.
    
    Fixes: cc15ff327569 ("powerpc/mce: Avoid using irq_work_queue() in realmode")
    Suggested-by: Mahesh Salgaonkar <mahesh@linux.ibm.com>
    Acked-by: Shrikanth Hegde <sshegde@linux.ibm.com>
    Reviewed-by: Ritesh Harjani (IBM) <ritesh.list@gmail.com>
    Signed-off-by: Sayali Patil <sayalip@linux.ibm.com>
    [Maddy: Fixed the commit title]
    Signed-off-by: Madhavan Srinivasan <maddy@linux.ibm.com>
    Link: https://patch.msgid.link/20260513081413.222490-1-sayalip@linux.ibm.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

powerpc: 82xx: fix uninitialized pointers with free attribute [+ + +]

Author: Ally Heev <allyheev@gmail.com>
Date:   Sun Nov 16 19:55:44 2025 +0530

    powerpc: 82xx: fix uninitialized pointers with free attribute
    
    [ Upstream commit acd1e47db03d4b528fd5efb8565dd0de1c79f62a ]
    
    Uninitialized pointers with `__free` attribute can cause undefined
    behavior as the memory allocated to the pointer is freed automatically
    when the pointer goes out of scope.
    
    powerpc/km82xx doesn't have any bugs related to this as of now, but,
    it is better to initialize and assign pointers with `__free` attribute
    in one statement to ensure proper scope-based cleanup
    
    Reported-by: Dan Carpenter <dan.carpenter@linaro.org>
    Closes: https://lore.kernel.org/all/aPiG_F5EBQUjZqsl@stanley.mountain/
    Signed-off-by: Ally Heev <allyheev@gmail.com>
    Fixes: 4aa5cc1e0012 ("powerpc-km82xx.c: replace of_node_put() with __free")
    Reviewed-by: Christophe Leroy (CS GROUP) <chleroy@kernel.org>
    Reviewed-by: Krzysztof Kozlowski <krzk@kernel.org>
    Signed-off-by: Madhavan Srinivasan <maddy@linux.ibm.com>
    Link: https://patch.msgid.link/20251116-aheev-uninitialized-free-attr-km82xx-v2-1-4307e2b5300d@gmail.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

powerpc: fix dead default for GUEST_STATE_BUFFER_TEST [+ + +]

Author: Julian Braha <julianbraha@gmail.com>
Date:   Sun Apr 5 17:15:45 2026 +0100

    powerpc: fix dead default for GUEST_STATE_BUFFER_TEST
    
    [ Upstream commit aef656a0e6c01796190bb5bd2bdba1c644ed7811 ]
    
    The GUEST_STATE_BUFFER_TEST config option should default
    to KUNIT_ALL_TESTS so that if all tests are enabled then
    it is included, but currently the 'default KUNIT_ALL_TESTS'
    statement is shadowed by 'def_tristate n',
    meaning that this second default statement is currently dead code.
    
    It looks to me like the commit
    6ccbbc33f06a ("KVM: PPC: Add helper library for Guest State Buffers")
    intended to set the default to KUNIT_ALL_TESTS, but mistakenly
    missed the def_tristate.
    
    This dead code was found by kconfirm, a static analysis tool for Kconfig.
    
    Fixes: 6ccbbc33f06a ("KVM: PPC: Add helper library for Guest State Buffers")
    Signed-off-by: Julian Braha <julianbraha@gmail.com>
    Tested-by: Gautam Menghani <gautam@linux.ibm.com>
    Reviewed-by: Amit Machhiwal <amachhiw@linux.ibm.com>
    Reviewed-by: Harsh Prateek Bora <harshpb@linux.ibm.com>
    Signed-off-by: Madhavan Srinivasan <maddy@linux.ibm.com>
    Link: https://patch.msgid.link/20260405161545.161006-1-julianbraha@gmail.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

qed: fix double free in qed_cxt_tables_alloc() [+ + +]

Author: Dawei Feng <dawei.feng@seu.edu.cn>
Date:   Wed May 20 15:03:23 2026 +0800

    qed: fix double free in qed_cxt_tables_alloc()
    
    commit 2bccfb8476ca5f3548afbd623dc7a6980d4e77de upstream.
    
    If one of the later PF or VF CID bitmap allocations fails,
    qed_cid_map_alloc() jumps to cid_map_fail and frees the previously
    allocated CID bitmaps before returning an error. qed_cxt_tables_alloc()
    then calls qed_cxt_mngr_free(), which invokes qed_cid_map_free()
    again.
    
    Fix this by setting each CID bitmap pointer to NULL after bitmap_free()
    to avoid double free.
    
    The bug was first flagged by an experimental analysis tool we are
    developing for kernel memory-management bugs while analyzing
    v6.13-rc1. The tool is still under development and is not yet publicly
    available. Manual inspection confirms that the bug is still
    present in v7.1-rc3.
    
    Runtime reproduction was not attempted because exercising the failing
    allocation path requires device-specific setup.
    
    Fixes: fe56b9e6a8d9 ("qed: Add module with basic common support")
    Cc: stable@vger.kernel.org
    Signed-off-by: Zilin Guan <zilin@seu.edu.cn>
    Signed-off-by: Dawei Feng <dawei.feng@seu.edu.cn>
    Link: https://patch.msgid.link/20260520070323.2762379-1-dawei.feng@seu.edu.cn
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

rbd: eliminate a race in lock_dwork draining on unmap [+ + +]

Author: Ilya Dryomov <idryomov@gmail.com>
Date:   Tue May 19 23:07:26 2026 +0200

    rbd: eliminate a race in lock_dwork draining on unmap
    
    commit 9fc75b71fdd38465c76c6f6a884cdd4ae3c72d90 upstream.
    
    Given how rbd_lock_add_request() and rbd_img_exclusive_lock() are
    written, lock_dwork may be (re)queued more than it's actually needed:
    for example in case a new I/O request comes in while we are in the
    middle of rbd_acquire_lock() on behalf of another I/O request.  This is
    expected and with rbd_release_lock() preemptively canceling lock_dwork
    is benign under normal operation.
    
    A more problematic example is maybe_kick_acquire():
    
        if (have_requests || delayed_work_pending(&rbd_dev->lock_dwork)) {
                dout("%s rbd_dev %p kicking lock_dwork\n", __func__, rbd_dev);
                mod_delayed_work(rbd_dev->task_wq, &rbd_dev->lock_dwork, 0);
        }
    
    It's not unrealistic for lock_dwork to get canceled right after
    delayed_work_pending() returns true and for mod_delayed_work() to
    requeue it right there anyway.  This is a classic TOCTOU race.
    
    When it comes to unmapping the image, there is an implicit assumption
    of no self-initiated exclusive lock activity past the point of return
    from rbd_dev_image_unlock() which unlocks the lock if it happens to be
    held.  This unlock is assumed to be final and lock_dwork (as well as
    all other exclusive lock tasks, really) isn't expected to get queued
    again.  However, lock_dwork is canceled only in cancel_tasks_sync()
    (i.e. later in the unmap sequence) and on top of that the cancellation
    can get in effect nullified by maybe_kick_acquire().  This may result
    in rbd_acquire_lock() executing after rbd_dev_device_release() and
    rbd_dev_image_release() run and free and/or reset a bunch of things.
    One of the possible failure modes then is a violated
    
        rbd_assert(rbd_image_format_valid(rbd_dev->image_format));
    
    in rbd_dev_header_info() which is called via rbd_dev_refresh() from
    rbd_post_acquire_action().
    
    Redo exclusive lock task draining to provide saner semantics and try
    to meet the assumptions around rbd_dev_image_unlock().
    
    Cc: stable@vger.kernel.org
    Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
    Reviewed-by: Viacheslav Dubeyko <Slava.Dubeyko@ibm.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

RDMA/mana_ib: Report max_msg_sz in mana_ib_query_port [+ + +]

Author: Shiraz Saleem <shirazsaleem@microsoft.com>
Date:   Tue May 12 02:42:09 2026 -0700

    RDMA/mana_ib: Report max_msg_sz in mana_ib_query_port
    
    [ Upstream commit c9a40f6531b81baa9619bcc2697ff86896afcce7 ]
    
    Report max_msg_sz for mana_ib, which is 16MB.
    
    Fixes: 4bda1d5332ec ("RDMA/mana_ib: Implement port parameters")
    Signed-off-by: Shiraz Saleem <shirazsaleem@microsoft.com>
    Signed-off-by: Konstantin Taranov <kotaranov@microsoft.com>
    Link: https://patch.msgid.link/20260512094209.264955-1-kotaranov@linux.microsoft.com
    Reviewed-by: Long Li <longli@microsoft.com>
    Signed-off-by: Leon Romanovsky <leon@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

RDMA/rtrs: Fix use-after-free in path file creation cleanup [+ + +]

Author: Guangshuo Li <lgs201920130244@gmail.com>
Date:   Thu May 14 19:38:34 2026 +0800

    RDMA/rtrs: Fix use-after-free in path file creation cleanup
    
    [ Upstream commit 5b74373390113fba798a76b483837029ab010fef ]
    
    In the error path of rtrs_srv_create_path_files(), the sysfs root folders
    may already have been created and srv_path->kobj may already have been
    initialized. If a later step fails, the cleanup currently calls
    kobject_put(&srv_path->kobj) before
    rtrs_srv_destroy_once_sysfs_root_folders(srv_path).
    
    kobject_put() may drop the last reference to srv_path->kobj and invoke the
    release callback, rtrs_srv_release(), which frees srv_path. The following
    call to rtrs_srv_destroy_once_sysfs_root_folders(srv_path) then
    dereferences srv_path internally to access srv_path->srv, resulting in a
    use-after-free.
    
    This failure path is reached before rtrs_srv_create_path_files() returns
    success, so the successful-path lifetime handling is not involved.
    
    Fix this by destroying the sysfs root folders before calling
    kobject_put(&srv_path->kobj), so srv_path is still valid while the helper
    accesses it.
    
    This issue was found by a static analysis tool I am developing.
    
    Fixes: ae4c81644e91 ("RDMA/rtrs-srv: Rename rtrs_srv_sess to rtrs_srv_path")
    Signed-off-by: Guangshuo Li <lgs201920130244@gmail.com>
    Link: https://patch.msgid.link/20260514113834.865530-1-lgs201920130244@gmail.com
    Signed-off-by: Leon Romanovsky <leon@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

RDMA/siw: Reject MPA FPDU length underflow before signed receive math [+ + +]

Author: Michael Bommarito <michael.bommarito@gmail.com>
Date:   Wed May 13 13:53:24 2026 -0400

    RDMA/siw: Reject MPA FPDU length underflow before signed receive math
    
    commit 0ce1bc9e46ecabe84772bb561e373c0d9876d6f2 upstream.
    
    A malicious connected siw peer can send an iWARP FPDU whose MPA length
    field (c_hdr->mpa_len, 16 bit big-endian, peer-controlled) is smaller
    than the fixed DDP/RDMAP header for the announced opcode. Soft-iWARP
    parses the full header in siw_get_hdr() based on iwarp_pktinfo[opcode]
    .hdr_len, but never compares mpa_len against that header length.
    
    siw_tcp_rx_data() then derives
    
        srx->fpdu_part_rem = be16_to_cpu(mpa_len) - fpdu_part_rcvd
                             + MPA_HDR_SIZE;
    
    where fpdu_part_rcvd equals iwarp_pktinfo[opcode].hdr_len at this
    point. For a tagged WRITE (hdr_len 16, MPA_HDR_SIZE 2) the smallest
    on-wire mpa_len of 0 yields fpdu_part_rem = -14, and any mpa_len below
    hdr_len - MPA_HDR_SIZE underflows to a negative int.
    
    The signed value then flows into siw_proc_write()/siw_proc_rresp() as
    
        bytes = min(srx->fpdu_part_rem, srx->skb_new);
    
    is handed to siw_check_mem() as an int len (whose interval check
    addr + len > mem->va + mem->len is satisfied for a valid base when
    len is negative), and reaches siw_rx_data() -> siw_rx_kva() /
    siw_rx_umem() -> skb_copy_bits() as a signed copy length. The header
    copy branch in skb_copy_bits() promotes that to size_t, producing a
    multi-gigabyte read.
    
    KASAN under a KUnit harness that drives the real kernel TCP receive
    path -- a loopback AF_INET socketpair, the malformed FPDU written via
    kernel_sendmsg, sk_data_ready firing in softirq, tcp_read_sock
    dispatching to siw_tcp_rx_data -- reports:
    
        BUG: KASAN: use-after-free in skb_copy_bits+0x284/0x480
        Read of size 4294967295 at addr ffff888...
        Call Trace:
         skb_copy_bits
         siw_rx_kva
         siw_rx_data
         siw_check_mem
         siw_proc_write
         siw_tcp_rx_data
         __tcp_read_sock
         siw_qp_llp_data_ready
         tcp_data_ready
         tcp_data_queue
    
    Add the missing invariant at the earliest point where the peer header
    is fully assembled. iwarp_pktinfo[*].hdr_len - MPA_HDR_SIZE is exactly
    the value the siw transmitter uses as the minimum mpa_len for each
    opcode (drivers/infiniband/sw/siw/siw_qp.c:33), so this matches the
    protocol contract. Out-of-range FPDUs terminate the connection with
    TERM_ERROR_LAYER_LLP / LLP_ETYPE_MPA / LLP_ECODE_FPDU_START -- which
    is RFC 5044 Section 8 error code 3 ("Marker and ULPDU Length fields
    do not agree on the start of an FPDU"), the correct framing-error
    class for this inconsistency.
    
    Fixes: 8b6a361b8c48 ("rdma/siw: receive path")
    Link: https://patch.msgid.link/r/20260513175325.2042630-2-michael.bommarito@gmail.com
    Cc: stable@vger.kernel.org
    Signed-off-by: Michael Bommarito <michael.bommarito@gmail.com>
    Assisted-by: Claude:claude-opus-4-7
    Acked-by: Bernard Metzler <bernard.metzler@linux.dev>
    Signed-off-by: Jason Gunthorpe <jgg@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

regulator: tps65219: fix irq_data.rdev not being assigned [+ + +]

Author: Alexander Sverdlin <alexander.sverdlin@gmail.com>
Date:   Mon May 18 10:31:11 2026 +0200

    regulator: tps65219: fix irq_data.rdev not being assigned
    
    commit f9b2d3b703d13df50c630997dfdc25648e96db0d upstream.
    
    Commit 64a6b577490c ("regulator: tps65219: Remove debugging helper
    function") removed the tps65219_get_rdev_by_name() helper along with
    the irq_data.rdev assignment that depended on it. This left
    irq_data.rdev uninitialized for all IRQs, causing undefined behavior
    when regulator_notifier_call_chain() is called from the IRQ handler:
    
      Internal error: Oops: 0000000096000004
      pc : regulator_notifier_call_chain
      lr : tps65219_regulator_irq_handler
      Call trace:
       regulator_notifier_call_chain
       tps65219_regulator_irq_handler
       handle_nested_irq
       regmap_irq_thread
       irq_thread_fn
       irq_thread
       kthread
       ret_from_fork
    
    Instead of restoring a dedicated lookup array, restructure the probe
    function to combine regulator registration with IRQ registration in
    the same loop. This way the rdev returned by devm_regulator_register()
    is naturally available for assigning to irq_data.rdev without any
    auxiliary data structure.
    
    Non-regulator IRQs (SENSOR, TIMEOUT) that don't correspond to any
    registered regulator are registered with rdev=NULL, and the IRQ handler
    is protected with a NULL check to avoid crashing.
    
    Cc: stable@vger.kernel.org
    Closes: https://lore.kernel.org/all/aBDSTxALaOc-PD7X@gaggiata.pivistrello.it/
    Reported-by: Francesco Dolcini <francesco@dolcini.it>
    Fixes: 64a6b577490c ("regulator: tps65219: Remove debugging helper function")
    Signed-off-by: Alexander Sverdlin <alexander.sverdlin@siemens.com>
    Link: https://patch.msgid.link/20260518083113.2063368-1-alexander.sverdlin@siemens.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ring-buffer: Fix reporting of missed events in iterator [+ + +]

Author: Steven Rostedt <rostedt@goodmis.org>
Date:   Wed May 20 22:08:01 2026 -0400

    ring-buffer: Fix reporting of missed events in iterator
    
    commit a254b6d13b0edd6272926674d2afc46d46e496b7 upstream.
    
    When tracing is active while reading the trace file, if the iterator
    reading the buffer detects that the writer has passed the iterator head,
    it will reset and set a "missed events" flag. This flag is passed to the
    output processing to show the user that events were missed:
    
      CPU:4 [LOST EVENTS]
    
    The problem is that the flag is reset after it is checked in
    ring_buffer_iter_dropped(). But the "trace" file iterates over all the CPU
    ring buffers and it will check if they are dropped when figuring out which
    buffer to print next. This prematurely clears the missed_events flag if
    the CPU buffer with the missed events is not the one that is printed next.
    
    On the iteration where the CPU buffer with the missed events is printed,
    the check if it had missed events would return false and the output does
    not show that events were missed.
    
    Do not reset the missed_events flag when checking if there were missed
    events, but instead clear it when moving the iterator head to the next
    event.
    
    Cc: stable@vger.kernel.org
    Cc: Mathieu Desnoyers <mathieu.desnoyers@efficios.com>
    Link: https://patch.msgid.link/20260520220801.4fd09d13@fedora
    Fixes: c9b7a4a72ff64 ("ring-buffer/tracing: Have iterator acknowledge dropped events")
    Acked-by: Masami Hiramatsu (Google) <mhiramat@kernel.org>
    Signed-off-by: Steven Rostedt <rostedt@goodmis.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ring-buffer: Flush and stop persistent ring buffer on panic [+ + +]

Author: Masami Hiramatsu (Google) <mhiramat@kernel.org>
Date:   Thu Apr 30 12:28:16 2026 +0900

    ring-buffer: Flush and stop persistent ring buffer on panic
    
    commit a494d3c8d5392bcdff83c2a593df0c160ff9f322 upstream.
    
    On real hardware, panic and machine reboot may not flush hardware cache
    to memory. This means the persistent ring buffer, which relies on a
    coherent state of memory, may not have its events written to the buffer
    and they may be lost. Moreover, there may be inconsistency with the
    counters which are used for validation of the integrity of the
    persistent ring buffer which may cause all data to be discarded.
    
    To avoid this issue, stop recording of the ring buffer on panic and
    flush the cache of the ring buffer's memory.
    
    Fixes: e645535a954a ("tracing: Add option to use memmapped memory for trace boot instance")
    Cc: stable@vger.kernel.org
    Cc: Will Deacon <will@kernel.org>
    Cc: Mathieu Desnoyers <mathieu.desnoyers@efficios.com>
    Cc: Ian Rogers <irogers@google.com>
    Link: https://patch.msgid.link/177751969602.2136606.12031934362587643488.stgit@mhiramat.tok.corp.google.com
    Signed-off-by: Masami Hiramatsu (Google) <mhiramat@kernel.org>
    Acked-by: Catalin Marinas <catalin.marinas@arm.com>
    Acked-by: Geert Uytterhoeven <geert@linux-m68k.org>
    Signed-off-by: Steven Rostedt <rostedt@goodmis.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

riscv: Docs: fix unmatched quote warning [+ + +]

Author: Randy Dunlap <rdunlap@infradead.org>
Date:   Mon Apr 6 16:23:04 2026 -0700

    riscv: Docs: fix unmatched quote warning
    
    [ Upstream commit 50da1c9ccb70fc5250c37ac474b54ee072732ea3 ]
    
    'make htmldocs' complains about ``prctrl` -- so add a second '`' to
    avoid the warning.
    
    Documentation/arch/riscv/zicfilp.rst:79: WARNING: Inline literal start-string without end-string. [docutils]
    
    Fixes: 08ee1559052b ("prctl: cfi: change the branch landing pad prctl()s to be more descriptive")
    Signed-off-by: Randy Dunlap <rdunlap@infradead.org>
    Link: https://patch.msgid.link/20260406232304.1892528-1-rdunlap@infradead.org
    Signed-off-by: Paul Walmsley <pjw@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

riscv: errata: Fix bitwise vs logical AND in MIPS errata patching [+ + +]

Author: Michael Neuling <mikey@neuling.org>
Date:   Thu Apr 9 09:11:39 2026 +0000

    riscv: errata: Fix bitwise vs logical AND in MIPS errata patching
    
    [ Upstream commit 4d2b03699460b8fd5df34408a03a84a1a7ff8aa1 ]
    
    The condition checking whether a specific errata needs patching uses
    logical AND (&&) instead of bitwise AND (&). Since logical AND only
    checks that both operands are non-zero, this causes all errata patches
    to be applied whenever any single errata is detected, rather than only
    applying the matching one.
    
    The SiFive errata implementation correctly uses bitwise AND for the same
    check.
    
    Fixes: 0b0ca959d206 ("riscv: errata: Fix the PAUSE Opcode for MIPS P8700")
    Signed-off-by: Michael Neuling <mikey@neuling.org>
    Assisted-by: Cursor:claude-4.6-opus-high-thinking
    Link: https://patch.msgid.link/20260409091143.1348853-2-mikey@neuling.org
    [pjw@kernel.org: fixed checkpatch warning]
    Signed-off-by: Paul Walmsley <pjw@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

riscv: Fix register corruption from uninitialized cregs on error [+ + +]

Author: Michael Neuling <mikey@neuling.org>
Date:   Fri May 1 06:23:20 2026 +0000

    riscv: Fix register corruption from uninitialized cregs on error
    
    [ Upstream commit 6ebcbb53fc9bc30843054ed99fd60b8e542628f4 ]
    
    compat_riscv_gpr_set() calls cregs_to_regs() unconditionally, even when
    user_regset_copyin() fails. Since cregs is an uninitialized stack
    variable, a copyin failure causes uninitialized stack data to be written
    into the target task's pt_regs, corrupting its register state and
    potentially leaking kernel stack contents.
    
    compat_restore_sigcontext() has the same issue: it calls cregs_to_regs()
    even when __copy_from_user() fails, leading to the same corruption of
    the signal-returning task's register state on error.
    
    Only call cregs_to_regs() when the user copy succeeds.
    
    Fixes: 4608c159594f ("riscv: compat: ptrace: Add compat_arch_ptrace implement")
    Fixes: 7383ee05314b ("riscv: compat: signal: Add rt_frame implementation")
    Signed-off-by: Michael Neuling <mikey@neuling.org>
    Assisted-by: Cursor:claude-4.6-opus-high-thinking
    Link: https://patch.msgid.link/20260501062320.2339562-1-mikey@neuling.org
    Signed-off-by: Paul Walmsley <pjw@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

riscv: kvm: return SBI_ERR_FAILURE for pmu_event_info() when OOM [+ + +]

Author: Osama Abdelkader <osama.abdelkader@gmail.com>
Date:   Thu May 14 19:36:41 2026 +0200

    riscv: kvm: return SBI_ERR_FAILURE for pmu_event_info() when OOM
    
    commit 0e9d0e7a7c78db7aa1c13796c65cfe0aefa54a5b upstream.
    
    kvm_riscv_vcpu_pmu_event_info() returned -ENOMEM from the
    SBI extension handler, which caused kvm_riscv_vcpu_sbi_ecall()
    to abort KVM_RUN and surface the error to userspace instead of
    completing the ECALL with a negative SBI error in a0.
    Use SBI_ERR_FAILURE and the normal retdata path, matching other PMU
    handlers and kvm_sbi_ext_pmu_handler comment.
    
    Fixes: e309fd113b9f ("RISC-V: KVM: Implement get event info function")
    Cc: stable@vger.kernel.org
    Signed-off-by: Osama Abdelkader <osama.abdelkader@gmail.com>
    Reviewed-by: Anup Patel <anup@brainfault.org>
    Link: https://lore.kernel.org/r/20260514173642.41448-2-osama.abdelkader@gmail.com
    Signed-off-by: Anup Patel <anup@brainfault.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

riscv: kvm: return SBI_ERR_FAILURE for pmu_snapshot_set_shmem() when OOM [+ + +]

Author: Osama Abdelkader <osama.abdelkader@gmail.com>
Date:   Thu May 14 19:36:40 2026 +0200

    riscv: kvm: return SBI_ERR_FAILURE for pmu_snapshot_set_shmem() when OOM
    
    commit 0835ee26938e15eccd70f7d33da386b6490f9449 upstream.
    
    kvm_riscv_vcpu_pmu_snapshot_set_shmem() returned -ENOMEM from the
    SBI extension handler, which caused kvm_riscv_vcpu_sbi_ecall() to
    abort KVM_RUN and surface the error to userspace instead of
    ompleting the ECALL with a negative SBI error in a0.
    Use SBI_ERR_FAILURE and the normal retdata path, matching other PMU
    handlers and kvm_sbi_ext_pmu_handler comment.
    
    Fixes: c2f41ddbcdd7 ("RISC-V: KVM: Implement SBI PMU Snapshot feature")
    Cc: stable@vger.kernel.org
    Signed-off-by: Osama Abdelkader <osama.abdelkader@gmail.com>
    Reviewed-by: Anup Patel <anup@brainfault.org>
    Link: https://lore.kernel.org/r/20260514173642.41448-1-osama.abdelkader@gmail.com
    Signed-off-by: Anup Patel <anup@brainfault.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

riscv: mm: Fixup no5lvl failure when vaddr is invalid [+ + +]

Author: Guo Ren (Alibaba DAMO Academy) <guoren@kernel.org>
Date:   Sun Jan 25 00:52:12 2026 -0500

    riscv: mm: Fixup no5lvl failure when vaddr is invalid
    
    [ Upstream commit db909bd7986c10da074917af3dae83a60fa65093 ]
    
    Unlike no4lvl, no5lvl still continues to detect satp, which
    requires va=pa mapping. When pa=0x800000000000, no5lvl
    would fail in Sv48 mode due to an illegal VA value of
    0x800000000000.
    
    So, prevent detecting the satp flow for no5lvl, when
    vaddr is invalid. Add the is_vaddr_valid() function for
    checking.
    
    Fixes: 26e7aacb83df ("riscv: Allow to downgrade paging mode from the command line")
    Cc: Alexandre Ghiti <alexghiti@rivosinc.com>
    Cc: Björn Töpel <bjorn@rivosinc.com>
    Signed-off-by: Guo Ren (Alibaba DAMO Academy) <guoren@kernel.org>
    Tested-by: Fangyu Yu <fangyu.yu@linux.alibaba.com>
    Link: https://patch.msgid.link/20260125055212.433163-1-guoren@kernel.org
    [pjw@kernel.org: cleaned up commit message]
    Signed-off-by: Paul Walmsley <pjw@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

rxrpc: Fix DATA decrypt vs splice() by copying data to buffer in recvmsg [+ + +]

Author: David Howells <dhowells@redhat.com>
Date:   Sat May 16 00:05:14 2026 +0100

    rxrpc: Fix DATA decrypt vs splice() by copying data to buffer in recvmsg
    
    [ Upstream commit d2bc90cf6c75cb96d2ce549be6c35efa3099d25b ]
    
    This improves the fix for CVE-2026-43500.
    
    Fix the pagecache corruption from in-place decryption of a DATA packet
    transmitted locally by splice() by getting rid of the packet sharing in the
    I/O thread and unconditionally extracting the packet content into a bounce
    buffer in which the buffer is decrypted.  recvmsg() (or the kernel
    equivalent) then copies the data from the bounce buffer to the destination
    buffer.  The sk_buff then remains unmodified.
    
    This has an additional advantage in that the packet is then arranged in the
    buffer with the correct alignment required for the crypto algorithms to
    process directly.  The performance of the crypto does seem to be a little
    faster and, surprisingly, the unencrypted performance doesn't seem to
    change much - possibly due to removing complexity from the I/O thread.
    
    Yet another advantage is that the I/O thread doesn't have to copy packets
    which would slow down packet distribution, ACK generation, etc..
    
    The buffer belongs to the call and is allocated initially at 2K,
    sufficiently large to hold a whole jumbo subpacket, but the buffer will be
    increased in size if needed.  However, to take this work, MSG_PEEK may
    cause a later packet to be decrypted into the buffer, in which case the
    earlier one will need re-decrypting for a subsequent recvmsg().
    
    Note that rx_pkt_offset may legitimately see 0 as a valid offset now, so
    switch to using USHRT_MAX to indicate an invalid offset.
    
    Note also that I would generally prefer to replace the buffers of the
    current sk_buff with a new kmalloc'd buffer of the right size, ditching the
    old data and frags as this makes the handling of MSG_PEEK easier and
    removes the re-decryption issue, but this looks like quite a complicated
    thing to achieve.  skb_morph() looks half way to what I want, but I don't
    want to have to allocate a new sk_buff.
    
    Fixes: d0d5c0cd1e71 ("rxrpc: Use skb_unshare() rather than skb_cow_data()")
    Reported-by: Hyunwoo Kim <imv4bel@gmail.com>
    Closes: https://lore.kernel.org/r/afKV2zGR6rrelPC7@v4bel/
    Signed-off-by: David Howells <dhowells@redhat.com>
    cc: Simon Horman <horms@kernel.org>
    cc: Jiayuan Chen <jiayuan.chen@linux.dev>
    cc: linux-afs@lists.infradead.org
    Reviewed-by: Jeffrey Altman <jaltman@auristor.com>
    Tested-by: Marc Dionne <marc.dionne@auristor.com>
    Link: https://patch.msgid.link/20260515230516.2718212-3-dhowells@redhat.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

s390/cio: Restore GFP_DMA for CHSC allocation [+ + +]

Author: Peter Oberparleiter <oberpar@linux.ibm.com>
Date:   Thu May 7 16:27:08 2026 +0200

    s390/cio: Restore GFP_DMA for CHSC allocation
    
    commit ea34567db0a6b3a7ce78ba421592344315c8f90e upstream.
    
    Re-add GFP_DMA when allocating memory for CHSC control blocks.
    On some supported machines, CHSC cannot access memory outside
    the DMA zone, causing CHSC command failures.
    
    Cc: stable@vger.kernel.org
    Fixes: a3a64a4def8d ("s390/cio: remove unneeded DMA zone allocation")
    Signed-off-by: Peter Oberparleiter <oberpar@linux.ibm.com>
    Reviewed-by: Heiko Carstens <hca@linux.ibm.com>
    Signed-off-by: Alexander Gordeev <agordeev@linux.ibm.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

s390/pai: Disable duplicate read of kernel PAI counter value [+ + +]

Author: Thomas Richter <tmricht@linux.ibm.com>
Date:   Mon Apr 27 07:17:19 2026 +0200

    s390/pai: Disable duplicate read of kernel PAI counter value
    
    commit 3fe7ecab1a0856aafe1026a35af1621a5c18d53f upstream.
    
    The PAI crypto counter design allows for user space and kernel space
    PAI counter increment recording. This is achieved by splitting the
    recording page in half. The upper part of the 4KB page records user
    space increments of PAI crypto counter and the lower half records
    kernel space increments. The page itself looks like:
    
     lowcore ptr ---> ++++++++++++++++++++++++
                      |user space area       |
                      +----------------------+
                      |kernel space area     |
                      ++++++++++++++++++++++++
    
    User space and kernel space entries are handled via a kernel_offset
    value when wrting. For PAI crypto counters this offset is 2048 or
    half of a page size.
    
    For PAI NNPA counter design this distinction was not needed. There is
    no user and kernel space part for the page pointed to by lowcore.
    The set up is:
    
     lowcore ptr ---> ++++++++++++++++++++++++
                      |user + kernel space   |
                      |area                  |
                      |                      |
                      ++++++++++++++++++++++++
    
    There is always only one counter value recorded and saved.
    
    Depending on number of CPUs and machine load, the number of PAI NNPA
    counter increment differs between counting (perf stat) and recording
    (perf record). The number reported by sampling was double the number
    shown by counting.
    
    This was caused by a double read of the PAI NNPA values in function
    pai_copy(). The first part of that function reads the kernel space part.
    The offset into the kernel page part must be larger than zero.
    The second part of that function reads the user space part, which
    begins of offset zero. This works fine for PAI crypto counters.
    
    It fails for PAI NNPA counters because the PMU device driver does
    not support that feature and has a kernel_offset value of 0x0.
    Executing both user and kernel space read out might end up reading
    user space value twice.
    For the PAI NNPA PMU prohibit the kernel space part read out.
    
    Cc: stable@vger.kernel.org
    Fixes: f12473541356 ("s390/pai_crypto: Rename paicrypt_copy() to pai_copy()")
    Signed-off-by: Thomas Richter <tmricht@linux.ibm.com>
    Reviewed-by: Sumanth Korikkar <sumanthk@linux.ibm.com>
    Signed-off-by: Alexander Gordeev <agordeev@linux.ibm.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

s390/pai: Fix missing PAI counter increments under heavy load [+ + +]

Author: Thomas Richter <tmricht@linux.ibm.com>
Date:   Tue May 5 12:34:33 2026 +0200

    s390/pai: Fix missing PAI counter increments under heavy load
    
    commit 99269799bf2448aebccee164df56c22a7b85b02c upstream.
    
    Machines with a larger number of CPUs and under heavy load sometimes
    loose PAI counter increments during recording using events
    -e CRYPTO_ÂLL or -e NNPA_ALL. Counting is not affected.
    This happens when several PAI crypto counters are incremented during
    the same cryptographic operation.
    
    During schedule out the functions
    
    paiXXX_sched_task() (with XXX either crypt or ext)
    +--> pai_have_samples()
       +--> pai_have_sample()
            +--> pai_copy()
            +--> pai_push_sample()
    
    are called to read out PAI counter values.
    In pai_copy() the current values of PAI counters are read from the
    PMU memory mapped page and compared to the values read during last
    schedule out operation, which have been saved in a backup page
    named PAI_SAVE_AREA(event). For each PAI counter a delta is calculated
    and when the delta is positive, that PAI counter was incremented by
    hardware. This positve delta is reported as raw data record attached
    to a sample.
    After all deltas have been calculated, the new PAI counter values
    are saved in the backup page PAI_SAVE_AREA(event). However this is
    done in pai_push_sample(), leaving a small window for missing hardware
    triggered updates. Here is one scenario:
    
      PAI counter idx:   0   1   2   3   4   5   6   7  ....  N
                       +---+---+---+---+---+---+---+---+    +---+
      PAI counter page:|   |   | X |   |   |   |   |   |....| Y |
                       +---+---+---+---+---+---+---+---+    +---+
    
    In pai_copy() each PAI counter value is read and compared
    to its old value. This is done in a loop. When PAI counter indexed
    N is read, the hardware might increment PAI counter indexed 2 again,
    updating its value from X to X+1.
    Later pai_push_sample() simply mem-copies the complete PAI counter
    page to a backup page and the increment of X+1 is lost, because the
    backup page now contains the new value.
    
    Read each PAI counter and save this value in the backup page when
    there is a positive delta. This omits any time window between read
    and store. This also reduced the work load as only modified PAI
    counters are saved.
    
    Cc: stable@vger.kernel.org
    Fixes: fe861b0c8d06 ("s390/pai: save PAI counter value page in event structure")
    Signed-off-by: Thomas Richter <tmricht@linux.ibm.com>
    Reviewed-by: Sumanth Korikkar <sumanthk@linux.ibm.com>
    Signed-off-by: Alexander Gordeev <agordeev@linux.ibm.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

sched_ext: Avoid UAF in scx_root_enable_workfn() init failure path [+ + +]

Author: Tejun Heo <tj@kernel.org>
Date:   Thu May 21 08:57:53 2026 -0400

    sched_ext: Avoid UAF in scx_root_enable_workfn() init failure path
    
    [ Upstream commit 9a415cc53711f2238e0f0ca8a6bcc796c003b127 ]
    
    In scx_root_enable_workfn(), put_task_struct(p) is called before scx_error()
    dereferences p->comm and p->pid. If the iterator's reference is the last
    drop, the task is freed synchronously and the deref becomes a UAF.
    
    Move put_task_struct() past scx_error().
    
    Reported-by: Sashiko <sashiko-bot@kernel.org>
    Closes: https://lore.kernel.org/all/20260511214031.AF5E9C2BCB0@smtp.kernel.org/
    Fixes: f0e1a0643a59 ("sched_ext: Implement BPF extensible scheduler class")
    Cc: stable@vger.kernel.org # v6.12+
    Signed-off-by: Tejun Heo <tj@kernel.org>
    [ kept `scx_init_task()` call site instead of `__scx_init_task()`/`task_rq_lock` ]
    Signed-off-by: Sasha Levin <sashal@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

sched_ext: Fix missing warning in scx_set_task_state() default case [+ + +]

Author: Samuele Mariotti <smariotti@disroot.org>
Date:   Thu May 21 08:57:52 2026 -0400

    sched_ext: Fix missing warning in scx_set_task_state() default case
    
    [ Upstream commit b905ee77d5f557a83a485b4146210f54f13365fc ]
    
    In scx_set_task_state(), the default case was setting the
    warn flag, but then returning immediately. This is problematic
    because the only purpose of the warn flag is to trigger
    WARN_ONCE, but the early return prevented it from ever firing,
    leaving invalid task states undetected and untraced.
    
    To fix this, a WARN_ONCE call is now added directly in the
    default case.
    
    The fix addresses two aspects:
    
     - Guarantees the invalid task states are properly logged
       and traced.
    
     - Provides a distinct warning message
       ("sched_ext: Invalid task state") specifically for
       states outside the defined scx_task_state enum values,
       making it easier to distinguish from other transition
       warnings.
    
    This ensures proper detection and reporting of invalid states.
    
    Signed-off-by: Samuele Mariotti <smariotti@disroot.org>
    Signed-off-by: Paolo Valente <paolo.valente@unimore.it>
    Reviewed-by: Andrea Righi <arighi@nvidia.com>
    Signed-off-by: Tejun Heo <tj@kernel.org>
    Stable-dep-of: 9a415cc53711 ("sched_ext: Avoid UAF in scx_root_enable_workfn() init failure path")
    Signed-off-by: Sasha Levin <sashal@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

scripts/gdb: mm: cast untyped symbols in x86_page_ops [+ + +]

Author: Illia Ostapyshyn <illia@yshyn.com>
Date:   Mon Apr 27 16:24:47 2026 +0200

    scripts/gdb: mm: cast untyped symbols in x86_page_ops
    
    commit c416aee7e7d04fec2d2d30786b3c8393108b85d2 upstream.
    
    The symbols phys_base, _text, and _end, used in x86_page_ops are either
    defined in assembly or implicitly by the linker.  Thus, they lack type
    information and cause a conversion error after gdb.parse_and_eval.
    Explicitly cast these expressions to unsigned long.
    
    Link: https://lore.kernel.org/20260427142448.666117-2-illia@yshyn.com
    Fixes: 55f8b4518d14 ("scripts/gdb: implement x86_page_ops in mm.py")
    Signed-off-by: Illia Ostapyshyn <illia@yshyn.com>
    Cc: Florian Fainelli <florian.fainelli@broadcom.com>
    Cc: Jan Kiszka <jan.kiszka@siemens.com>
    Cc: Kieran Bingham <kbingham@kernel.org>
    Cc: Vlastimil Babka <vbabka@suse.com>
    Cc: Hao Li <hao.li@linux.dev>
    Cc: Harry Yoo <harry@kernel.org>
    Cc: Seongjun Hong <hsj0512@snu.ac.kr>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

scsi: isci: Fix use-after-free in device removal path [+ + +]

Author: Michael Bommarito <michael.bommarito@gmail.com>
Date:   Sun Apr 19 17:04:20 2026 -0400

    scsi: isci: Fix use-after-free in device removal path
    
    commit b52a8d52c3125ec9a93106ed816582368de34426 upstream.
    
    The ISCI completion tasklet is initialized in isci_host_alloc()
    (drivers/scsi/isci/init.c:496) and scheduled from both MSI-X and legacy
    interrupt handlers (drivers/scsi/isci/host.c:223,613).
    
    isci_host_deinit() stops the controller and waits for stop completion,
    but it never kills completion_tasklet before teardown continues. A
    top-of-function tasklet_kill() is not sufficient here: interrupts are
    only disabled when isci_host_stop_complete() runs, so until
    wait_for_stop() returns the IRQ handlers can still requeue the
    tasklet. The tasklet callback also re-enables interrupts after draining
    completions, so killing the tasklet before the source is quiesced leaves
    the same race open.
    
    Once wait_for_stop() returns, no further IRQ-driven scheduling can
    occur. Kill completion_tasklet there so teardown cannot race a queued
    tasklet running on a dead ihost. On remove or unload, the stale callback
    can otherwise dereference ihost and touch ihost->smu_registers after the
    host lifetime ends.
    
    A UML + KASAN analogue reproduced the failure class both with no
    tasklet_kill() and with tasklet_kill() placed before source quiesce, and
    stayed clean once the kill happened after quiescing the scheduling
    source.
    
    This mirrors commit f6ab594672d4 ("scsi: aic94xx: fix use-after-free in
    device removal path"), but ISCI needs the kill after wait_for_stop().
    
    Fixes: 6f231dda6808 ("isci: Intel(R) C600 Series Chipset Storage Control Unit Driver")
    Cc: stable@vger.kernel.org
    Assisted-by: Claude:claude-opus-4-7
    Assisted-by: Codex:gpt-5-4
    Signed-off-by: Michael Bommarito <michael.bommarito@gmail.com>
    Link: https://patch.msgid.link/20260419210420.2134639-1-michael.bommarito@gmail.com
    Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

scsi: sd: Fix return code handling in sd_spinup_disk() [+ + +]

Author: Mike Christie <michael.christie@oracle.com>
Date:   Mon May 11 12:53:17 2026 -0500

    scsi: sd: Fix return code handling in sd_spinup_disk()
    
    [ Upstream commit 6ea68a8dc7d2711504d944811981a5304af7d7a9 ]
    
    As found by smatch-ci, scsi_execute_cmd() can return negative or positve
    values so we should use a int instead of unsigned int.
    
    Fixes: b4d0c33a32c3 ("scsi: sd: Fix sshdr use in sd_spinup_disk")
    Reported-by: Dan Carpenter <error27@gmail.com>
    Closes: https://lore.kernel.org/linux-scsi/agFbI7E6JQwd3wGW@stanley.mountain/T/#u
    Signed-off-by: Mike Christie <michael.christie@oracle.com>
    Reviewed-by: Bart Van Assche <bvanassche@acm.org>
    Link: https://patch.msgid.link/20260511175317.114007-1-michael.christie@oracle.com
    Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

security/keys: fix missed RCU read section on lookup [+ + +]

Author: Linus Torvalds <torvalds@linux-foundation.org>
Date:   Thu May 28 11:45:41 2026 -0700

    security/keys: fix missed RCU read section on lookup
    
    commit 43a1e3744548e6fd85873e6fb43e293eb4010694 upstream.
    
    Nicholas Carlini reports that the keyring code calls assoc_array_find()
    in find_key_to_update() without holding the RCU read lock, while the
    assoc_array_gc() code really is designed around removing the node from
    the tree and then freeing it after an RCU grace-period.
    
    The regular key handling doesn't see this because holding the keyring
    semaphore hides any lifetime issues, but the persistent key handling
    uses a different model.
    
    Instead of extending the keyring locking, just do the simple RCU locking
    that the assoc_array was designed for.
    
    Reported-by: Nicholas Carlini <npc@anthropic.com>
    Cc: David Howells <dhowells@redhat.com>
    Cc: Jarkko Sakkinen <jarkko@kernel.org>
    Cc: Paul Moore <paul@paul-moore.com>
    Cc: James Morris James Morris <jmorris@namei.org>
    Cc: Serge E. Hallyn <serge@hallyn.com>
    Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

selftests/mm: run_vmtests.sh: fix destructive tests invocation [+ + +]

Author: Luiz Capitulino <luizcap@redhat.com>
Date:   Mon Apr 27 12:03:51 2026 -0400

    selftests/mm: run_vmtests.sh: fix destructive tests invocation
    
    commit 3432cbb291aabf85f8af4b9d1ec37179168ff999 upstream.
    
    Destructive tests should be invoked with -d command-line option, but this
    won't work today since 'd' is missing in getopts command-line.  This
    commit fixes it.
    
    Link: https://lore.kernel.org/214fd9e4-5398-4c26-859e-c982c2e277c3@redhat.com
    Fixes: f16ff3b692ad ("selftests/mm: run_vmtests.sh: add missing tests")
    Signed-off-by: Luiz Capitulino <luizcap@redhat.com>
    Reviewed-by: Mike Rapoport (Microsoft) <rppt@kernel.org>
    Reviewed-by: SeongJae Park <sj@kernel.org>
    Cc: David Hildenbrand <david@kernel.org>
    Cc: Liam R. Howlett <liam@infradead.org>
    Cc: Lorenzo Stoakes <ljs@kernel.org>
    Cc: Michal Hocko <mhocko@suse.com>
    Cc: Shuah Khan <shuah@kernel.org>
    Cc: Suren Baghdasaryan <surenb@google.com>
    Cc: Vlastimil Babka <vbabka@kernel.org>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

selftests: mptcp: drop nanoseconds width specifier [+ + +]

Author: Matthieu Baerts (NGI0) <matttbe@kernel.org>
Date:   Fri May 15 06:27:37 2026 +0200

    selftests: mptcp: drop nanoseconds width specifier
    
    commit 01ff78e4b3d98689184c52d97f9575dfbdc3b10f upstream.
    
    Using the format specifier +%s%3N with GNU date is honoured, and only
    prints 3 digits of the nanoseconds portion of the seconds since epoch,
    which corresponds to the milliseconds.
    
    The uutils implementation of date currently does not honour this, and
    always prints all 9 digits. This is a known issue [1], but can be worked
    around by adapting this test to use nanoseconds instead of microseconds,
    and then divide it by 1e6.
    
    This fix is similar to what has been done on systemd side [2], and it is
    needed to run the selftests on Ubuntu 26.04, containing uutils 0.8.0.
    
    Note that the Fixes tag is there even if this patch doesn't fix an issue
    in the kernel selftests, but it is useful for those using uutils 0.8.0.
    
    Fixes: 048d19d444be ("mptcp: add basic kselftest for mptcp")
    Cc: stable@vger.kernel.org
    Link: https://github.com/uutils/coreutils/issues/11658 [1]
    Link: https://github.com/systemd/systemd/pull/41627 [2]
    Signed-off-by: Matthieu Baerts (NGI0) <matttbe@kernel.org>
    Link: https://patch.msgid.link/20260515-net-mptcp-misc-fixes-7-1-rc4-v2-6-701e96419f2f@kernel.org
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

selftests: net: Fix checksums in xdp_native [+ + +]

Author: Nimrod Oren <noren@nvidia.com>
Date:   Wed May 20 18:39:28 2026 +0300

    selftests: net: Fix checksums in xdp_native
    
    [ Upstream commit dfc077043351a81887d1e4c9ac244e9243f3cbf2 ]
    
    Data adjustment cases failed with "Data exchange failed" when using IPv4
    because the program did not update the IP and UDP checksums in the IPv4
    branch. The issue was masked when both IPv4 and IPv6 were configured,
    since the test harness prefers IPv6.
    
    While here, generalize csum_fold_helper() to fold twice so it works for
    any 32-bit input.
    
    Fixes: 0b65cfcef9c5 ("selftests: drv-net: Test tail-adjustment support")
    Reviewed-by: Carolina Jubran <cjubran@nvidia.com>
    Reviewed-by: Dragos Tatulea <dtatulea@nvidia.com>
    Signed-off-by: Nimrod Oren <noren@nvidia.com>
    Link: https://patch.msgid.link/20260520153928.3371765-1-noren@nvidia.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

selftests: ublk: cap nthreads to kernel's actual nr_hw_queues [+ + +]

Author: Ming Lei <tom.leiming@gmail.com>
Date:   Wed May 13 18:19:40 2026 +0800

    selftests: ublk: cap nthreads to kernel's actual nr_hw_queues
    
    [ Upstream commit 87d0740b7c4cc847be1b6f307ab6d8547cb1a726 ]
    
    dev->nthreads is derived from the user-requested queue count before the
    ADD command, but the kernel may reduce nr_hw_queues (capped to
    nr_cpu_ids). When the VM has fewer CPUs than requested queues, the
    daemon creates more handler threads than there are kernel queues.
    
    In non-batch mode, the extra threads access uninitialized queues
    (q_depth=0), submit zero io_uring SQEs, and block forever in
    io_cqring_wait. In batch mode, the extra threads cause similar hangs
    during device removal.
    
    In both cases, the stuck threads prevent the daemon from closing the
    char device, holding the last ublk_device reference and causing
    ublk_ctrl_del_dev() to hang in wait_event_interruptible().
    
    Fix by capping dev->nthreads to the kernel-returned nr_hw_queues after
    the ADD command completes. per_io_tasks mode is excluded because threads
    interleave across all queues, so nthreads > nr_hw_queues is valid.
    
    Fixes: abe54c160346 ("selftests: ublk: kublk: decouple ublk_queues from ublk server threads")
    Signed-off-by: Ming Lei <tom.leiming@gmail.com>
    Link: https://patch.msgid.link/20260513101941.1373998-1-tom.leiming@gmail.com
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

smb/server: promote S_DEL_ON_CLS to S_DEL_PENDING when close [+ + +]

Author: ChenXiaoSong <chenxiaosong@kylinos.cn>
Date:   Mon May 18 15:23:22 2026 +0000

    smb/server: promote S_DEL_ON_CLS to S_DEL_PENDING when close
    
    commit 4ec9c8e023c79f613fe4d5ad8cc737112efb2e44 upstream.
    
    Reproducer:
    
      1. server: systemctl start ksmbd
      2. client: mount -t cifs //${server_ip}/export /mnt
      3. client: C program: openat(AT_FDCWD, "/mnt", O_RDWR | O_TMPFILE, 0600)
    
    Do not treat `FILE_DELETE_ON_CLOSE_LE` as delete pending while files
    remain open.
    
    This patch fixes xfstests generic/004.
    
    Cc: stable@vger.kernel.org
    Link: https://chenxiaosong.com/en/smb-xfstests-generic-004.html
    Co-developed-by: Huiwen He <hehuiwen@kylinos.cn>
    Signed-off-by: Huiwen He <hehuiwen@kylinos.cn>
    Signed-off-by: ChenXiaoSong <chenxiaosong@kylinos.cn>
    Tested-by: Steve French <stfrench@microsoft.com>
    Acked-by: Namjae Jeon <linkinjeon@kernel.org>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

smb: client: protect tc_count increment in smb2_find_smb_sess_tcon_unlocked() [+ + +]

Author: Henrique Carvalho <henrique.carvalho@suse.com>
Date:   Thu May 14 20:18:25 2026 -0300

    smb: client: protect tc_count increment in smb2_find_smb_sess_tcon_unlocked()
    
    commit 4d8690dace005a38e6dbde9ecce2da3ad85c7c41 upstream.
    
    Commit 96c4af418586 ("cifs: Fix locking usage for tcon fields")
    refactored cifs code to change cifs_tcp_ses_lock for tc_lock around
    tc_count changes.
    
    There was missing lock around tc_count increment inside
    smb2_find_smb_sess_tcon_unlocked().
    
    Cc: stable@vger.kernel.org
    Fixes: 96c4af418586 ("cifs: Fix locking usage for tcon fields")
    Reviewed-by: Shyam Prasad N <sprasad@microsoft.com>
    Signed-off-by: Henrique Carvalho <henrique.carvalho@suse.com>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

smb: client: reject userspace cifs.spnego descriptions [+ + +]

Author: Asim Viladi Oglu Manizada <manizada@pm.me>
Date:   Sat May 16 21:15:39 2026 +0000

    smb: client: reject userspace cifs.spnego descriptions
    
    commit 3da1fdf4efbc490041eb4f836bf596201203f8f2 upstream.
    
    cifs.spnego key descriptions contain authority-bearing fields such as
    pid, uid, creduid, and upcall_target that cifs.upcall treats as
    kernel-originating inputs. However, userspace can also create keys of
    this type through request_key(2) or add_key(2), allowing those fields to
    be supplied without CIFS origin.
    
    Only accept cifs.spnego descriptions while CIFS is using its private
    spnego_cred to request the key.
    
    Fixes: f1d662a7d5e5 ("[CIFS] Add upcall files for cifs to use spnego/kerberos")
    Assisted-by: avom-custom-harness:gpt-5.5-qwen3.6-mod-mix
    Reviewed-by: David Howells <dhowells@redhat.com>
    Signed-off-by: Asim Viladi Oglu Manizada <manizada@pm.me>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

smb: client: require net admin for CIFS SWN netlink [+ + +]

Author: Michael Bommarito <michael.bommarito@gmail.com>
Date:   Sun May 17 20:11:50 2026 -0400

    smb: client: require net admin for CIFS SWN netlink
    
    commit d1ebfce2c1d161186a82e77590bf7da2ea1bce91 upstream.
    
    CIFS_GENL_CMD_SWN_NOTIFY is the userspace witness-notify command.  The
    intended sender is the cifs.witness helper, but the generic-netlink
    operation currently has no capability flag, so any local process can send
    RESOURCE_CHANGE or CLIENT_MOVE notifications to the in-kernel witness
    handler.
    
    The same family exposes CIFS_GENL_MCGRP_SWN without multicast-group
    capability flags.  Register messages sent to that group include the witness
    registration id and, for NTLM-authenticated mounts, the username, domain,
    and password attributes copied from the CIFS session.  An unprivileged
    local process should not be able to join that group and receive those
    messages.
    
    Require CAP_NET_ADMIN for incoming SWN_NOTIFY commands with
    GENL_ADMIN_PERM, and require CAP_NET_ADMIN over the network namespace for
    joining the SWN multicast group with GENL_MCAST_CAP_NET_ADMIN.  The
    cifs.witness service runs with the privileges needed for both operations.
    
    Fixes: fed979a7e082 ("cifs: Set witness notification handler for messages from userspace daemon")
    Cc: stable@vger.kernel.org
    Signed-off-by: Michael Bommarito <michael.bommarito@gmail.com>
    Assisted-by: Claude:claude-opus-4-7
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

smb: client: use data_len for SMB2 READ encrypted folioq copy [+ + +]

Author: Jeremy Erazo <mendozayt13@gmail.com>
Date:   Fri May 15 19:31:41 2026 +0000

    smb: client: use data_len for SMB2 READ encrypted folioq copy
    
    commit d4d76c9ee1997cc8c977a63f6c43551c253c1066 upstream.
    
    In handle_read_data() the encrypted/folioq branch
    (buf_len <= data_offset, reached via receive_encrypted_read for
    transform PDUs > CIFSMaxBufSize + MAX_HEADER_SIZE) copies the READ
    payload using buffer_len rather than data_len:
    
            rdata->result = cifs_copy_folioq_to_iter(buffer, buffer_len,
                                                     cur_off,
                                                     &rdata->subreq.io_iter);
            ...
            rdata->got_bytes = buffer_len;
    
    buffer_len comes from the SMB3 transform header OriginalMessageSize
    field (OriginalMessageSize - read_rsp_size); it represents the size
    of the decrypted message after the SMB2 header.  data_len comes from
    the SMB2 READ response DataLength field; it represents the actual
    READ payload size and may be smaller than buffer_len when the
    decrypted message contains padding or other trailing bytes after the
    READ payload.  The existing check `data_len > buffer_len - pad_len`
    only enforces an upper bound, so a server that emits
    OriginalMessageSize larger than read_rsp_size + pad_len + data_len
    passes the check and the kernel copies buffer_len bytes per response,
    ignoring the server-asserted DataLength.
    
    Two observable failures with a crafted server (DataLength=4,
    buffer_len=20000):
    
      - the kernel returns 20000 bytes per sub-request to userspace and
        sets got_bytes = buffer_len, even though the response claimed
        only 4 bytes of payload;
    
      - on a partial netfs sub-request whose iterator is sized to
        data_len, the over-large copy_folio_to_iter() short-reads,
        cifs_copy_folioq_to_iter() returns -EIO via the n != len path,
        and the entire netfs read collapses to -EIO even though the
        leading sub-requests succeeded.
    
    Use data_len for the copy length and for got_bytes so the kernel
    honours the server-asserted READ payload size.  For well-formed
    servers (where buffer_len == pad_len + data_len) the change is
    behaviour-equivalent.
    
    Cc: stable@vger.kernel.org
    Signed-off-by: Jeremy Erazo <mendozayt13@gmail.com>
    Acked-by: David Howells <dhowells@redhat.com>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

spi: amd: Set correct bus number in ACPI probe path [+ + +]

Author: Krishnamoorthi M <krishnamoorthi.m@amd.com>
Date:   Thu May 7 23:30:51 2026 +0530

    spi: amd: Set correct bus number in ACPI probe path
    
    commit 422bd00b71ab42163aa3b8f8370276fe4c1581e7 upstream.
    
    On platforms where the HID2 SPI controller (AMDI0063) is enumerated via
    ACPI instead of PCI, amd_spi_probe() unconditionally sets bus_num to 0,
    while the PCI probe path assigns bus_num 2 for HID2 controller.
    
    Align the ACPI probe path to use the same bus number so that userspace
    and SPI client drivers see a consistent bus assignment regardless of the
    enumeration method.
    
    Fixes: b644c2776652 ("spi: spi_amd: Add PCI-based driver for AMD HID2 SPI controller")
    Cc: stable@vger.kernel.org # v6.16+
    Signed-off-by: Krishnamoorthi M <krishnamoorthi.m@amd.com>
    Link: https://patch.msgid.link/20260507180051.4158674-1-krishnamoorthi.m@amd.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

spi: ep93xx: fix error pointer deref after DMA setup failure [+ + +]

Author: Johan Hovold <johan@kernel.org>
Date:   Tue May 12 09:48:49 2026 +0200

    spi: ep93xx: fix error pointer deref after DMA setup failure
    
    commit 5e121a81667a83e9a01d62b429e340f5a4a84abc upstream.
    
    The driver falls back to PIO mode if DMA setup fails during probe.
    
    Make sure to the clear the DMA channel pointers on setup failure to
    avoid dereferencing an error pointer on later probe errors or driver
    unbind.
    
    This issue was flagged by Sashiko when reviewing a devres allocation
    conversion patch.
    
    Fixes: e79e7c2df627 ("spi: ep93xx: add DT support for Cirrus EP93xx")
    Link: https://sashiko.dev/#/patchset/20260429091333.165363-1-johan%40kernel.org?part=10
    Cc: stable@vger.kernel.org      # 6.12
    Cc: Nikita Shubin <nikita.shubin@maquefel.me>
    Signed-off-by: Johan Hovold <johan@kernel.org>
    Acked-by: Nikita Shubin <nikita.shubin@maquefel.me>
    Link: https://patch.msgid.link/20260512074849.915143-1-johan@kernel.org
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

spi: mtk-snfi: Fix resource leak in mtk_snand_read_page_cache() [+ + +]

Author: Felix Gu <ustc.gu@gmail.com>
Date:   Sun May 10 01:55:37 2026 +0800

    spi: mtk-snfi: Fix resource leak in mtk_snand_read_page_cache()
    
    [ Upstream commit 496ba79b9496b8b3747cbc764ebd33ee7325e806 ]
    
    When DMA read times out in mtk_snand_read_page_cache(), the original code
    erroneously jumped to cleanup label which skips DMA unmapping and ECC
    disable, causing a resource leak.
    
    Fixes: 764f1b748164 ("spi: add driver for MTK SPI NAND Flash Interface")
    Signed-off-by: Felix Gu <ustc.gu@gmail.com>
    Link: https://patch.msgid.link/20260510-snfi-v1-1-bc375cf1af8e@gmail.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

spi: qup: fix error pointer deref after DMA setup failure [+ + +]

Author: Johan Hovold <johan@kernel.org>
Date:   Tue May 12 09:43:34 2026 +0200

    spi: qup: fix error pointer deref after DMA setup failure
    
    commit a7e8f3efd50a165ba0189f6dc57f7e51a7d149db upstream.
    
    The driver falls back to PIO mode if DMA setup fails during probe.
    
    Make sure to the clear the DMA channel pointers on setup failure to
    avoid dereferencing an error pointer (or attempting to release a channel
    a second time) on later probe errors or driver unbind.
    
    This issue was flagged by Sashiko when reviewing a devres allocation
    conversion patch.
    
    Fixes: 612762e82ae6 ("spi: qup: Add DMA capabilities")
    Link: https://sashiko.dev/#/patchset/20260505072909.618363-1-johan%40kernel.org?part=4
    Cc: stable@vger.kernel.org      # 4.1
    Signed-off-by: Johan Hovold <johan@kernel.org>
    Link: https://patch.msgid.link/20260512074334.914735-1-johan@kernel.org
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

spi: sprd: fix error pointer deref after DMA setup failure [+ + +]

Author: Johan Hovold <johan@kernel.org>
Date:   Tue May 12 09:47:33 2026 +0200

    spi: sprd: fix error pointer deref after DMA setup failure
    
    commit 3d67fffb74267772d461c02c67f1eff893ad547d upstream.
    
    The driver falls back to PIO mode if DMA setup fails during probe.
    
    Make sure to check the dma.enabled flag before trying to release the DMA
    channels also on late probe errors to avoid dereferencing an error
    pointer (or attempting to release a channel a second time).
    
    This issue was flagged by Sashiko when reviewing a devres allocation
    conversion patch.
    
    Fixes: 386119bc7be9 ("spi: sprd: spi: sprd: Add DMA mode support")
    Link: https://sashiko.dev/#/patchset/20260505072909.618363-1-johan%40kernel.org?part=10
    Cc: stable@vger.kernel.org      # 5.1
    Cc: Lanqing Liu <lanqing.liu@unisoc.com>
    Signed-off-by: Johan Hovold <johan@kernel.org>
    Link: https://patch.msgid.link/20260512074733.915029-1-johan@kernel.org
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

spi: ti-qspi: fix use-after-free after DMA setup failure [+ + +]

Author: Johan Hovold <johan@kernel.org>
Date:   Tue May 12 09:48:09 2026 +0200

    spi: ti-qspi: fix use-after-free after DMA setup failure
    
    commit ea6ec3343e05f7937a53eb6d7617b3abdb4abc19 upstream.
    
    The driver falls back to PIO mode if DMA setup fails during probe.
    
    Make sure to clear the DMA channel pointer also if buffer allocation
    fails to avoid passing a pointer to the released channel to the DMA
    engine (or trying to free the channel a second time on late probe errors
    or driver unbind).
    
    This issue was flagged by Sashiko when reviewing a devres allocation
    conversion patch.
    
    Fixes: c687c46e9e45 ("spi: spi-ti-qspi: Use bounce buffer if read buffer is not DMA'ble")
    Link: https://sashiko.dev/#/patchset/20260505072909.618363-1-johan%40kernel.org?part=17
    Cc: stable@vger.kernel.org      # 4.12
    Cc: Vignesh R <vigneshr@ti.com>
    Signed-off-by: Johan Hovold <johan@kernel.org>
    Link: https://patch.msgid.link/20260512074809.915084-1-johan@kernel.org
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

srcu: Don't queue workqueue handlers to never-online CPUs [+ + +]

Author: Paul E. McKenney <paulmck@kernel.org>
Date:   Mon May 11 19:54:41 2026 +0200

    srcu: Don't queue workqueue handlers to never-online CPUs
    
    [ Upstream commit 593889c401426004bd0ea0f6d4fcece728b03420 ]
    
    While an srcu_struct structure is in the midst of switching from CPU-0
    to all-CPUs state, it can attempt to invoke callbacks for CPUs that
    have never been online.  Worse yet, it can attempt in invoke callbacks
    for CPUs that never will be online, even including imaginary CPUs not in
    cpu_possible_mask.  This can cause hangs on s390, which is not set up to
    deal with workqueue handlers being scheduled on such CPUs.  This commit
    therefore causes Tree SRCU to refrain from queueing workqueue handlers
    on CPUs that have not yet (and might never) come online.
    
    Because callbacks are not invoked on CPUs that have not been
    online, it is an error to invoke call_srcu(), synchronize_srcu(), or
    synchronize_srcu_expedited() on a CPU that is not yet fully online.
    However, it turns out to be less code to redirect the callbacks
    from too-early invocations of call_srcu() than to warn about such
    invocations.  This commit therefore also redirects callbacks queued on
    not-yet-fully-online CPUs to the boot CPU.
    
    Reported-by: Vasily Gorbik <gor@linux.ibm.com>
    Fixes: 61bbcfb50514 ("srcu: Push srcu_node allocation to GP when non-preemptible")
    Signed-off-by: Paul E. McKenney <paulmck@kernel.org>
    Tested-by: Vasily Gorbik <gor@linux.ibm.com>
    Tested-by: Samir <samir@linux.ibm.com>
    Reviewed-by: Shrikanth Hegde <sshegde@linux.ibm.com>
    Cc: Tejun Heo <tj@kernel.org>
    Signed-off-by: Uladzislau Rezki (Sony) <urezki@gmail.com>
    Signed-off-by: Boqun Feng <boqun@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

sysfs: don't remove existing directory on update failure [+ + +]

Author: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Date:   Wed May 20 15:05:04 2026 +0200

    sysfs: don't remove existing directory on update failure
    
    commit 237557b8a81ab948e8332f7c0058e758f081c0a3 upstream.
    
    When sysfs_update_group() is called for a named group and create_files()
    fails (e.g. -ENOMEM), internal_create_group() calls kernfs_remove(kn) on
    the group directory.  In the update path, kn was obtained via
    kernfs_find_and_get() and refers to a directory that already existed
    before this call.  Removing it silently destroys a sysfs group that the
    caller did not create.
    
    Only remove the directory if we created it ourselves.  On update failure
    the directory remains as it is left empty by remove_files() inside
    create_files(), but can be repopulated by a retry.
    
    Cc: Rajat Jain <rajatja@google.com>
    Fixes: c855cf2759d2 ("sysfs: Fix internal_create_group() for named group updates")
    Cc: stable <stable@kernel.org>
    Assisted-by: gkh_clanker_t1000
    Reviewed-by: Rafael J. Wysocki (Intel) <rafael@kernel.org>
    Reviewed-by: Danilo Krummrich <dakr@kernel.org>
    Link: https://patch.msgid.link/2026052003-uniquely-hastily-c093@gregkh
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

tap: fix stack info leak in tap_ioctl() SIOCGIFHWADDR [+ + +]

Author: Weiming Shi <bestswngs@gmail.com>
Date:   Wed May 20 00:57:38 2026 -0700

    tap: fix stack info leak in tap_ioctl() SIOCGIFHWADDR
    
    [ Upstream commit bddc09212c24934643bd44fc794748d2bbb3b6cd ]
    
    In the SIOCGIFHWADDR path, tap_ioctl() copies 16 bytes of an
    uninitialised on-stack struct sockaddr_storage to userspace via
    ifr_hwaddr, but netif_get_mac_address() only writes sa_family and
    dev->addr_len (6 for Ethernet) bytes, leaving sa_data[6..13] uninitialised.
    
    Those 8 trailing bytes leak kernel stack contents; SIOCGIFHWADDR on a
    macvtap chardev returns kernel .text and direct-map pointers, defeating
    KASLR.
    
    Initialise ss at declaration.
    
    Fixes: 3b23a32a6321 ("net: fix dev_ifsioc_locked() race condition")
    Reported-by: Xiang Mei <xmei5@asu.edu>
    Signed-off-by: Weiming Shi <bestswngs@gmail.com>
    Reviewed-by: Willem de Bruijn <willemb@google.com>
    Link: https://patch.msgid.link/20260520075736.3415676-3-bestswngs@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

tcp: Fix imbalanced icsk_accept_queue count. [+ + +]

Author: Kuniyuki Iwashima <kuniyu@google.com>
Date:   Wed May 6 03:59:19 2026 +0000

    tcp: Fix imbalanced icsk_accept_queue count.
    
    [ Upstream commit 7eca3292cac7c26dad4c236f51ba225c39a0523f ]
    
    When TCP socket migration happens in reqsk_timer_handler(),
    @sk_listener will be updated with the new listener.
    
    When we call __inet_csk_reqsk_queue_drop(), the listener must
    be the one stored in req->rsk_listener.
    
    The cited commit accidentally replaced oreq->rsk_listener with
    sk_listener, leading to imbalanced icsk_accept_queue count.
    
    Let's pass the correct listener to __inet_csk_reqsk_queue_drop().
    
    Fixes: e8c526f2bdf1 ("tcp/dccp: Don't use timer_pending() in reqsk_queue_unlink().")
    Reported-by: Damiano Melotti <melotti@google.com>
    Signed-off-by: Kuniyuki Iwashima <kuniyu@google.com>
    Link: https://patch.msgid.link/20260506035954.1563147-3-kuniyu@google.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

tcp: Fix out-of-bounds access for twsk in tcp_ao_established_key(). [+ + +]

Author: Kuniyuki Iwashima <kuniyu@google.com>
Date:   Fri May 8 12:08:46 2026 +0000

    tcp: Fix out-of-bounds access for twsk in tcp_ao_established_key().
    
    [ Upstream commit 03cb001ef87b3f8d859cf7f96329acf3d6235d29 ]
    
    lockdep_sock_is_held() was added in tcp_ao_established_key()
    by the cited commit.
    
    It can be called from tcp_v[46]_timewait_ack() with twsk.
    
    Since it does not have sk->sk_lock, the lockdep annotation
    results in out-of-bound access.
    
      $ pahole -C tcp_timewait_sock vmlinux | grep size
            /* size: 288, cachelines: 5, members: 8 */
      $ pahole -C sock vmlinux | grep sk_lock
            socket_lock_t              sk_lock;              /*   440   192 */
    
    Let's not use lockdep_sock_is_held() for TCP_TIME_WAIT.
    
    Fixes: 6b2d11e2d8fc ("net/tcp: Add missing lockdep annotations for TCP-AO hlist traversals")
    Reported-by: Damiano Melotti <melotti@google.com>
    Signed-off-by: Kuniyuki Iwashima <kuniyu@google.com>
    Reviewed-by: Eric Dumazet <edumazet@google.com>
    Link: https://patch.msgid.link/20260508120853.4098365-1-kuniyu@google.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

tcp: fix stale per-CPU tcp_tw_isn leak enabling ISN prediction [+ + +]

Author: Eric Dumazet <edumazet@google.com>
Date:   Tue May 19 08:46:11 2026 +0000

    tcp: fix stale per-CPU tcp_tw_isn leak enabling ISN prediction
    
    [ Upstream commit 1bbf0ced1d9db73ac7893c2187f3459288603e0d ]
    
    Blamed commit moved the TIME_WAIT-derived ISN from the skb control
    block to a per-CPU variable, assuming the value would always be consumed
    by tcp_conn_request() for the same packet that wrote it. That assumption
    is violated by multiple drop paths between the producer
    (__this_cpu_write(tcp_tw_isn, isn) in tcp_v{4,6}_rcv()) and the consumer
    (tcp_conn_request()):
    
     - min_ttl / min_hopcount check
     - xfrm policy check
     - tcp_inbound_hash() MD5/AO mismatch
     - tcp_filter() eBPF/SO_ATTACH_FILTER drop
     - th->syn && th->fin discard in tcp_rcv_state_process() TCP_LISTEN
     - psp_sk_rx_policy_check() in tcp_v{4,6}_do_rcv()
     - tcp_checksum_complete() in tcp_v{4,6}_do_rcv()
     - tcp_v{4,6}_cookie_check() returning NULL
    
    When a packet is dropped on any of these paths, tcp_tw_isn is left set.
    
    The next SYN processed on the same CPU then consumes the non zero value in
    tcp_conn_request(), receiving a potentially predictable ISN.
    
    This patch moves back tcp_tw_isn to skb->cb[], getting rid of the per-cpu
    variable.
    
    Note that tcp_v{4,6}_fill_cb() do not set it.
    
    Very litle impact on overall code size/complexity:
    
    $ scripts/bloat-o-meter -t vmlinux.old vmlinux.new
    add/remove: 0/0 grow/shrink: 2/1 up/down: 8/-15 (-7)
    Function                                     old     new   delta
    tcp_v6_rcv                                  3038    3042      +4
    tcp_v4_rcv                                  3035    3039      +4
    tcp_conn_request                            2938    2923     -15
    Total: Before=24436060, After=24436053, chg -0.00%
    
    Fixes: 41eecbd712b7 ("tcp: replace TCP_SKB_CB(skb)->tcp_tw_isn with a per-cpu field")
    Reported-by: Chris Mason <clm@meta.com>
    Signed-off-by: Eric Dumazet <edumazet@google.com>
    Reviewed-by: Kuniyuki Iwashima <kuniyu@google.com>
    Link: https://patch.msgid.link/20260519084611.2485277-1-edumazet@google.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

test_kprobes: clear kprobes between test runs [+ + +]

Author: Martin Kaiser <martin@kaiser.cx>
Date:   Fri May 8 09:56:36 2026 +0900

    test_kprobes: clear kprobes between test runs
    
    [ Upstream commit ef5581bb30efb939cc2bf093475c6cc85258e5cd ]
    
    Running the kprobes sanity tests twice makes all tests fail and
    eventually crashes the kernel.
    
    [root@martin-riscv-1 ~]# echo 1 > /sys/kernel/debug/kunit/kprobes_test/run
    ...
       # Totals: pass:5 fail:0 skip:0 total:5
       ok 1 kprobes_test
    [root@martin-riscv-1 ~]# echo 1 > /sys/kernel/debug/kunit/kprobes_test/run
    ...
      # test_kprobe: EXPECTATION FAILED at lib/tests/test_kprobes.c:64
      Expected 0 == register_kprobe(&kp), but
          register_kprobe(&kp) == -22 (0xffffffffffffffea)
    ...
      Unable to handle kernel paging request ...
    
    The testsuite defines several kprobes and kretprobes as static variables
    that are preserved across test runs.
    
    After register_kprobe and unregister_kprobe, a kprobe contains some
    leftover data that must be cleared before the kprobe can be registered
    again. The tests are setting symbol_name to define the probe location.
    Address and flags must be cleared.
    
    The existing code clears some of the probes between subsequent tests, but
    not between two test runs. The leftover data from a previous test run
    makes the registrations fail in the next run.
    
    Move the cleanups for all kprobes into kprobes_test_init, this function
    is called before each single test (including the first test of a test
    run).
    
    Link: https://lore.kernel.org/all/20260507134615.1010905-1-martin@kaiser.cx/
    
    Fixes: e44e81c5b90f ("kprobes: convert tests to kunit")
    Signed-off-by: Martin Kaiser <martin@kaiser.cx>
    Signed-off-by: Masami Hiramatsu (Google) <mhiramat@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

tls: Preserve sk_err across recvmsg() when data has been copied [+ + +]

Author: Chuck Lever <chuck.lever@oracle.com>
Date:   Wed May 13 08:58:25 2026 -0400

    tls: Preserve sk_err across recvmsg() when data has been copied
    
    [ Upstream commit f508262ae9f21fe0e6c0749948b9dc7dd5a62a70 ]
    
    The sk_err check in tls_rx_rec_wait() consumes the error via
    sock_error(), which clears sk_err atomically. When the caller
    (tls_sw_recvmsg, tls_sw_splice_read, or tls_sw_read_sock) already
    has bytes copied to userspace, it returns those bytes and discards
    the error from this call. sk_err is now zero on the socket, so the
    next read syscall observes only RCV_SHUTDOWN and reports a clean
    EOF instead of the actual error (typically -ECONNRESET).
    
    The race is reachable when tls_read_flush_backlog()'s periodic
    sk_flush_backlog() triggers tcp_reset() in the middle of a
    multi-record read.
    
    Pass a has_copied flag to tls_rx_rec_wait(). When has_copied is
    false, consume sk_err via sock_error() as before. When has_copied
    is true, report the error from READ_ONCE() but leave sk_err set:
    the caller returns the byte count and discards the err from this
    call, and the next read syscall surfaces the preserved sk_err. This
    mirrors the tcp_recvmsg() preserve-and-surface pattern.
    
    The decrypt-abort path is unaffected: tls_err_abort() raises
    sk_err to EBADMSG after tls_rx_rec_wait() returns, and nothing
    on the caller's return path consumes it, so the EBADMSG surfaces
    on the next read.
    
    tls_sw_splice_read() passes has_copied=false: it processes
    one record per call, so no bytes have been copied within the
    function when tls_rx_rec_wait() runs. A reset that arrives
    between iterations of splice_direct_to_actor() (the sendfile()
    path) is still consumed by sock_error() in the later call, and the
    outer loop returns the prior iterations' byte count and drops the
    error. tcp_splice_read() exhibits the same pattern at the iteration
    boundary; addressing it belongs at the splice_direct_to_actor()
    layer and is out of scope here.
    
    Fixes: c46b01839f7a ("tls: rx: periodically flush socket backlog")
    Suggested-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
    Link: https://patch.msgid.link/20260513125825.205189-1-cel@kernel.org
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

tracing: Avoid NULL return from hist_field_name() on truncation [+ + +]

Author: David Carlier <devnexen@gmail.com>
Date:   Fri May 8 20:57:47 2026 +0100

    tracing: Avoid NULL return from hist_field_name() on truncation
    
    [ Upstream commit 576ec047d20b368b43c4d5db98c4f2e0f3c101ec ]
    
    hist_field_name() returns "" everywhere except the fully-qualified
    VAR_REF/EXPR case, where snprintf() truncation returns NULL early
    and bypasses the bottom NULL->"" guard. Callers don't expect NULL:
    strcat(expr, hist_field_name(field, 0)) at trace_events_hist.c:1758
    and the strcmp() in the sort-key match loop at :4804 both deref it.
    
    system and event_name are bounded by MAX_EVENT_NAME_LEN, but the
    field name on a VAR_REF is kstrdup'd from a histogram variable
    name parsed out of the trigger string and has no length cap, so
    a long enough var name in a fully qualified reference can reach
    the truncation path.
    
    Keep the length check but leave field_name as "" on overflow.
    
    Link: https://patch.msgid.link/20260508195747.25492-1-devnexen@gmail.com
    Fixes: 5ec1d1e97de1 ("tracing: Rebuild full_name on each hist_field_name() call")
    Signed-off-by: David Carlier <devnexen@gmail.com>
    Signed-off-by: Steven Rostedt <rostedt@goodmis.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

tracing: Do not call map->ops->elt_free() if elt_alloc() fails [+ + +]

Author: Masami Hiramatsu (Google) <mhiramat@kernel.org>
Date:   Thu May 21 13:49:14 2026 +0900

    tracing: Do not call map->ops->elt_free() if elt_alloc() fails
    
    commit 8f0f5c4fb9df0e19a341e0c6ed8dc4fda9124f03 upstream.
    
    In paths where tracing_map_elt_alloc() failed to allocate objects,
    the map->ops->elt_alloc() call was never successful. In this case,
    map->ops->elt_free() should not be called.
    
    Link: https://sashiko.dev/#/patchset/20260520223101.34710-1-rosenp%40gmail.com
    
    Cc: stable@vger.kernel.org
    Cc: Tom Zanussi <tom.zanussi@linux.intel.com>
    Cc: Mathieu Desnoyers <mathieu.desnoyers@efficios.com>
    Cc: Rosen Penev <rosenp@gmail.com>
    Reported-by: Sashiko <sashiko-bot@kernel.org>
    Fixes: 2734b629525a ("tracing: Add per-element variable support to tracing_map")
    Link: https://patch.msgid.link/177933895460.108746.5396070821443932634.stgit@devnote2
    Signed-off-by: Masami Hiramatsu (Google) <mhiramat@kernel.org>
    Signed-off-by: Steven Rostedt <rostedt@goodmis.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

ublk: reject max_sectors smaller than PAGE_SECTORS in parameter validation [+ + +]

Author: Ming Lei <tom.leiming@gmail.com>
Date:   Sun May 10 22:48:43 2026 +0800

    ublk: reject max_sectors smaller than PAGE_SECTORS in parameter validation
    
    [ Upstream commit 1860c2f85922917d8a46f16a6f4bd2298ffa0fb5 ]
    
    blk_validate_limits() requires max_hw_sectors >= PAGE_SECTORS and fires
    a WARN_ON_ONCE if this invariant is violated. ublk_validate_params()
    only checked the upper bound of max_sectors against max_io_buf_bytes,
    allowing userspace to pass small values (including zero) that trigger
    the warning when blk_mq_alloc_disk() is called from
    ublk_ctrl_start_dev().
    
    Before 494ea040bcb5, ublk used blk_queue_max_hw_sectors() which silently
    clamped small values up to PAGE_SECTORS. The conversion to passing
    queue_limits directly to blk_mq_alloc_disk() lost that clamping and now
    hits blk_validate_limits()'s WARN_ON_ONCE instead.
    
    Validate that max_sectors is at least PAGE_SECTORS in
    ublk_validate_params() so invalid values are rejected early with
    -EINVAL instead of reaching the block layer.
    
    Fixes: 494ea040bcb5 ("ublk: pass queue_limits to blk_mq_alloc_disk")
    Signed-off-by: Ming Lei <tom.leiming@gmail.com>
    Link: https://patch.msgid.link/20260510144843.769031-1-tom.leiming@gmail.com
    Signed-off-by: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

udp: Fix UDP length on last GSO_PARTIAL segment [+ + +]

Author: Gal Pressman <gal@nvidia.com>
Date:   Mon May 18 09:22:50 2026 +0300

    udp: Fix UDP length on last GSO_PARTIAL segment
    
    [ Upstream commit 78effd896eee11ac9db9bcbb53e7bbcad96073d7 ]
    
    Following the cited commit, __udp_gso_segment() writes single MSS length
    in the UDP header.
    The cited patch doesn't account for the fact that the last segment could
    be a GSO skb by itself. This could happen when the size of the packet is
    a multiple of MSS, hence the first segment is also the last one (there
    is no need for a remainder skb).
    
    When the post-loop segment is a GSO skb, assign the single MSS length in
    the UDP header.
    
    Fixes: b10b446ce7ad ("udp: gso: Use single MSS length in UDP header for GSO_PARTIAL")
    Reported-by: Matthew Schwartz <matthew.schwartz@linux.dev>
    Closes: https://lore.kernel.org/all/6c3fb15e-711d-4b8d-b152-e03d9b05293f@linux.dev/
    Tested-by: Matthew Schwartz <matthew.schwartz@linux.dev>
    Reviewed-by: Dragos Tatulea <dtatulea@nvidia.com>
    Signed-off-by: Gal Pressman <gal@nvidia.com>
    Link: https://patch.msgid.link/20260518062250.3019914-3-gal@nvidia.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

udp: gso: Fix handling checksum in __udp_gso_segment [+ + +]

Author: Alice Mikityanska <alice@isovalent.com>
Date:   Mon May 18 09:22:49 2026 +0300

    udp: gso: Fix handling checksum in __udp_gso_segment
    
    [ Upstream commit 5f17ae0f595aeb560155ce98edbe44d3eacc7e40 ]
    
    The cited commit started using msslen for uh->len, but still uses newlen
    to adjust uh->check. Although the checksum is ignored in most cases due
    to the hardware offload, __udp_gso_segment attempts to maintain the
    correct one. Fix uh->check and adjust it by the right value.
    
    Additionally, after the fix, newlen becomes assigned and unused before
    the loop. The code can be simplified a bit if mss adjustment is dropped,
    so that newlen becomes equal to msslen before the loop, and msslen can
    be also dropped, saving a few lines of code.
    
    This brings us back to one variable, drops an unneeded arithmetic for
    mss, and fixes the UDP checksum.
    
    Fixes: b10b446ce7ad ("udp: gso: Use single MSS length in UDP header for GSO_PARTIAL")
    Signed-off-by: Alice Mikityanska <alice@isovalent.com>
    Reviewed-by: Willem de Bruijn <willemb@google.com>
    Signed-off-by: Gal Pressman <gal@nvidia.com>
    Link: https://patch.msgid.link/20260518062250.3019914-2-gal@nvidia.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

vfio/pci: Check BAR resources before exporting a DMABUF [+ + +]

Author: Matt Evans <mattev@meta.com>
Date:   Mon May 11 07:58:24 2026 -0700

    vfio/pci: Check BAR resources before exporting a DMABUF
    
    [ Upstream commit 702809dabdecca807bdd50cfdcc1c980feb2ba62 ]
    
    A DMABUF exports access to BAR resources and, although they are
    requested at startup time, we need to ensure they really were reserved
    before exporting.  Otherwise, it's possible to access unreserved
    resources through the export.
    
    Add a check to the DMABUF-creation path.
    
    Fixes: 5d74781ebc86c ("vfio/pci: Add dma-buf export support for MMIO regions")
    Signed-off-by: Matt Evans <mattev@meta.com>
    Link: https://lore.kernel.org/r/20260511145829.2993601-3-mattev@meta.com
    Signed-off-by: Alex Williamson <alex@shazbot.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

virt: sev-guest: Explicitly leak pages in unknown state [+ + +]

Author: Carlos López <clopez@suse.de>
Date:   Tue May 12 12:00:42 2026 +0200

    virt: sev-guest: Explicitly leak pages in unknown state
    
    commit fd948c3f96b18ff9ba7d3e8eae13d196593e1aaf upstream.
    
    When set_memory_{encrypted,decrypted}() fail, the user cannot know at which
    point the function failed, meaning that the pages are left in an unknown state
    from the point of view of the caller.
    
    Since the pages may be left in an unencrypted state, they are not suitable for
    general use, and cannot be returned safely to the buddy allocator. Avoid the
    issue by never freeing the pages, and then do the proper accounting by calling
    snp_leak_pages().
    
    Fixes: 3e385c0d6ce8 ("virt: sev-guest: Move SNP Guest Request data pages handling under snp_cmd_mutex")
    Signed-off-by: Carlos López <clopez@suse.de>
    Signed-off-by: Borislav Petkov (AMD) <bp@alien8.de>
    Cc: stable@kernel.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

vsock/virtio: fix zerocopy completion for multi-skb sends [+ + +]

Author: Stefano Garzarella <sgarzare@redhat.com>
Date:   Thu May 14 11:29:48 2026 +0200

    vsock/virtio: fix zerocopy completion for multi-skb sends
    
    [ Upstream commit ae38d9179190a956e2a87a69ef1dd6f451b51c4d ]
    
    When a large message is fragmented into multiple skbs, the zerocopy
    uarg is only allocated and attached to the last skb in the loop.
    Non-final skbs carry pinned user pages with no completion tracking,
    so the kernel has no way to notify userspace when those pages are safe
    to reuse. If the loop breaks early the uarg is never allocated at all,
    leaking pinned pages with no completion notification.
    
    Fix this by following the approach used by TCP: allocate the zerocopy
    uarg (if not provided by the caller) before the send loop and attach
    it to every skb via skb_zcopy_set(), which takes a reference per skb.
    Each skb's completion properly decrements the refcount, and the
    notification only fires after the last skb is freed.
    On failure, if no data was sent, the uarg is cleanly aborted via
    net_zcopy_put_abort().
    
    This issue was initially discovered by sashiko while reviewing commit
    1cb36e252211 ("vsock/virtio: fix MSG_ZEROCOPY pinned-pages accounting")
    but was pre-existing.
    
    Fixes: 581512a6dc93 ("vsock/virtio: MSG_ZEROCOPY flag support")
    Closes: https://sashiko.dev/#/patchset/20260420132051.217589-1-sgarzare%40redhat.com
    Reported-by: Maher Azzouzi <maherazz04@gmail.com>
    Signed-off-by: Stefano Garzarella <sgarzare@redhat.com>
    Acked-by: Michael S. Tsirkin <mst@redhat.com>
    Acked-by: Arseniy Krasnov <avkrasnov@salutedevices.com>
    Link: https://patch.msgid.link/20260514092948.268720-1-sgarzare@redhat.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

vsock/virtio: reset connection on receiving queue overflow [+ + +]

Author: Stefano Garzarella <sgarzare@redhat.com>
Date:   Mon May 18 11:06:55 2026 +0200

    vsock/virtio: reset connection on receiving queue overflow
    
    commit a4f0b001782b21663d10df983b4b208195bec66c upstream.
    
    When there is no more space to queue an incoming packet, the packet is
    silently dropped. This causes data loss without any notification to
    either peer, since there is no retransmission.
    
    Under normal circumstances, this should never happen. However, it could
    happen if the other peer doesn't respect the credit, or if the skb
    overhead, which we recently began to take into account with commit
    059b7dbd20a6 ("vsock/virtio: fix potential unbounded skb queue"),
    is too high.
    
    Fix this by resetting the connection and setting the local socket error
    to ENOBUFS when virtio_transport_recv_enqueue() can no longer queue a
    packet, so both peers are explicitly notified of the failure rather than
    silently losing data.
    
    Fixes: ae6fcfbf5f03 ("vsock/virtio: discard packets if credit is not respected")
    Cc: stable@vger.kernel.org
    Signed-off-by: Stefano Garzarella <sgarzare@redhat.com>
    Link: https://patch.msgid.link/20260518090656.134588-2-sgarzare@redhat.com
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

vsock/vmci: fix UAF when peer resets connection during handshake [+ + +]

Author: Minh Nguyen <minhnguyen.080505@gmail.com>
Date:   Tue May 19 17:23:10 2026 +0700

    vsock/vmci: fix UAF when peer resets connection during handshake
    
    commit 99e22ddf4edb63dc8382bc028af928056d3450cf upstream.
    
    vmci_transport_recv_connecting_server() returned err = 0 for a peer
    RST in its default switch arm:
    
            err = pkt->type == VMCI_TRANSPORT_PACKET_TYPE_RST ? 0 : -EINVAL;
    
    That made vmci_transport_recv_listen() skip vsock_remove_pending(),
    leaving the pending socket on the listener's pending_links with
    sk_state = TCP_CLOSE while destroy: still dropped the explicit
    reference taken before schedule_delayed_work().
    
    One second later vsock_pending_work() observed is_pending=true and
    performed full cleanup: vsock_remove_pending() then the two trailing
    sock_put(sk) calls -- the first reached refcount 0 and __sk_freed
    the socket, and the second wrote into the freed object:
    
      BUG: KASAN: slab-use-after-free in refcount_warn_saturate
      Write of size 4 at addr ffff88800b1cac80 by task kworker
      Workqueue: events vsock_pending_work
    
    Treat peer RST like any other unexpected packet type (err = -EINVAL).
    All destroy: arms now return err < 0, so vmci_transport_recv_listen()
    removes pending from pending_links synchronously and
    vsock_pending_work() takes the is_pending=false / !rejected branch,
    dropping only its own work reference.  This also closes the
    multi-packet race Sashiko reported on v2: pending is removed from
    the list before any subsequent packet can find it.
    
    The pre-existing sk_acceptq_removed() gap on the err < 0 path of
    vmci_transport_recv_listen() that Sashiko also noted is not
    introduced or changed by this patch.
    
    Tested on lts-6.12.79 with KASAN: 52/100 unpatched -> 0/100 patched.
    
    Fixes: d021c344051a ("VSOCK: Introduce VM Sockets")
    Cc: stable@vger.kernel.org
    Signed-off-by: Minh Nguyen <minhnguyen.080505@gmail.com>
    Acked-by: Bryan Tan <bryan-bt.tan@broadcom.com>
    Link: https://patch.msgid.link/20260519102310.237181-1-minhnguyen.080505@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

wifi: ath10k: skip WMI and beacon transmission when device is wedged [+ + +]

Author: Kang Yang <kang.yang@oss.qualcomm.com>
Date:   Tue Apr 28 14:17:37 2026 +0800

    wifi: ath10k: skip WMI and beacon transmission when device is wedged
    
    [ Upstream commit 54a5b38e4396530e5b2f12b54d3844e860ab6784 ]
    
    In ath10k_wmi_cmd_send(), the current code detects ATH10K_STATE_WEDGED
    and sets ret to -ESHUTDOWN, but still proceeds to transmit pending
    beacons and calls ath10k_wmi_cmd_send_nowait().
    
    This can lead to incorrect behavior, as WMI commands and beacons are
    still sent after the device has been marked as wedged, and the original
    -ESHUTDOWN return value may be overwritten by the result of the send
    path.
    
    The wedged state indicates the hardware is already unreliable, and no
    further interaction with firmware is expected or meaningful in this
    state.
    
    Fix this by skipping beacon transmission and the WMI send path entirely
    once ATH10K_STATE_WEDGED is detected, ensuring consistent return values
    and avoiding unnecessary firmware interaction.
    
    Tested-on: QCA6174 hw3.2 PCI WLAN.RM.4.4.1-00288-QCARMSWPZ-1
    Tested-on: QCA6174 hw3.2 SDIO WLAN.RMH.4.4.1-00189
    
    Fixes: c256a94d1b1b ("wifi: ath10k: shutdown driver when hardware is unreliable")
    Signed-off-by: Kang Yang <kang.yang@oss.qualcomm.com>
    Reviewed-by: Rameshkumar Sundaram <rameshkumar.sundaram@oss.qualcomm.com>
    Reviewed-by: Baochen Qiang <baochen.qiang@oss.qualcomm.com>
    Link: https://patch.msgid.link/20260428061737.37-1-kang.yang@oss.qualcomm.com
    Signed-off-by: Jeff Johnson <jeff.johnson@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: ath11k: clear shared SRNG pointer state on restart [+ + +]

Author: Kyle Farnung <kfarnung@gmail.com>
Date:   Wed May 13 21:52:12 2026 -0700

    wifi: ath11k: clear shared SRNG pointer state on restart
    
    commit f51e4b3b5574ad8cb5b16b11f8a1452147ece87a upstream.
    
    LMAC rings reuse the shared rdp/wrp pointer buffers without going
    through the normal SRNG hw-init path that zeros non-LMAC ring
    pointers. After restart, ath11k_hal_srng_clear() can therefore hand
    stale hp/tp state from the previous firmware instance back to the new
    one.
    
    Clear the shared pointer buffers while keeping the allocations in
    place so restart still avoids reallocating SRNG DMA memory, but starts
    with fresh ring-pointer state.
    
    Fixes: 32be3ca4cf78b ("wifi: ath11k: HAL SRNG: don't deinitialize and re-initialize again")
    Cc: stable@vger.kernel.org
    Closes: https://lore.kernel.org/all/CAOPSVF04q6uvVdq8GTRLHBrVMdpt9=o9wVcFMc6f-yhmSBcZqQ@mail.gmail.com/
    Signed-off-by: Kyle Farnung <kfarnung@gmail.com>
    Reviewed-by: Rameshkumar Sundaram <rameshkumar.sundaram@oss.qualcomm.com>
    Reviewed-by: Baochen Qiang <baochen.qiang@oss.qualcomm.com>
    Link: https://patch.msgid.link/20260513-kfarnung-ath11k-srng-clear-pointer-state-v1-1-bc700dd8b333@gmail.com
    Signed-off-by: Jeff Johnson <jeff.johnson@oss.qualcomm.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

wifi: ath11k: fix error path leak in ath11k_tm_cmd_wmi_ftm() [+ + +]

Author: Nicolas Escande <nico.escande@gmail.com>
Date:   Wed May 6 15:42:40 2026 +0200

    wifi: ath11k: fix error path leak in ath11k_tm_cmd_wmi_ftm()
    
    [ Upstream commit 7320d6eb861e9913193a7801834c661381756a79 ]
    
    This is similar to what was fixed by previous patches. We have a call
    to ath11k_wmi_cmd_send() which does check the return value, but forgot
    to free the related skb on error.
    
    Fixes: b43310e44edc ("wifi: ath11k: factory test mode support")
    Signed-off-by: Nicolas Escande <nico.escande@gmail.com>
    Reviewed-by: Baochen Qiang <baochen.qiang@oss.qualcomm.com>
    Reviewed-by: Rameshkumar Sundaram <rameshkumar.sundaram@oss.qualcomm.com>
    Link: https://patch.msgid.link/20260506134240.2284016-4-nico.escande@gmail.com
    Signed-off-by: Jeff Johnson <jeff.johnson@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: ath11k: fix error path leaks in some WMI WOW calls [+ + +]

Author: Nicolas Escande <nico.escande@gmail.com>
Date:   Wed May 6 15:42:38 2026 +0200

    wifi: ath11k: fix error path leaks in some WMI WOW calls
    
    [ Upstream commit 55dda532bbc261aef495e403c8900c5e2ab5fa34 ]
    
    Fix two instances where we used to directly return the result of
    ath11k_wmi_cmd_send(...). Because we did not check the return value, we
    also did not free the skb in the error path.
    
    Fixes: 79802b13a492 ("ath11k: implement WoW enable and wakeup commands")
    Signed-off-by: Nicolas Escande <nico.escande@gmail.com>
    Reviewed-by: Baochen Qiang <baochen.qiang@oss.qualcomm.com>
    Reviewed-by: Rameshkumar Sundaram <rameshkumar.sundaram@oss.qualcomm.com>
    Link: https://patch.msgid.link/20260506134240.2284016-2-nico.escande@gmail.com
    Signed-off-by: Jeff Johnson <jeff.johnson@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: ath11k: fix peer resolution on rx path when peer_id=0 [+ + +]

Author: Matthew Leach <matthew.leach@collabora.com>
Date:   Fri Apr 24 10:50:35 2026 +0100

    wifi: ath11k: fix peer resolution on rx path when peer_id=0
    
    [ Upstream commit 2a2451a34afdf563b3102d36a4b6cf335cf813e2 ]
    
    It has been observed that on certain chipsets a peer can be assigned
    peer_id=0. For reception of non-aggregated MPDUs this is fine as
    ath11k_dp_rx_h_find_peer() has a fallback case where it locates the peer
    based upon the source MAC address. On an aggregated link, the mpdu_start
    header is only populated by hardware on the first sub-MSDU. This causes
    the peer resolution to be skipped for the subsequent MSDUs and the
    encryption type of these frames to be set to an incorrect value,
    resulting in these MSDUs being dropped by ieee80211.
    
    ath11k_pci 0000:03:00.0: data rx skb 000000002f4b704d len 1534 peer xx:xx:xx:xx:xx:xx 0 ucast sn 3063 he160 rate_idx 9 vht_nss 2 freq 5240 band 1 flag 0x40d1a fcs-err 0 mic-err 0 amsdu-more 0 peer_id 0 first_msdu 1 last_msdu 0
    ath11k_pci 0000:03:00.0: data rx skb 0000000038acd580 len 1534 peer (null) 0 ucast sn 3063 he160 rate_idx 9 vht_nss 2 freq 5240 band 1 flag 0x40d00 fcs-err 0 mic-err 0 amsdu-more 0 peer_id 0 first_msdu 0 last_msdu 1
    
    Remove the null peer_id checks in ath11k_dp_rx_h_find_peer() and
    ath11k_hal_rx_parse_mon_status_tlv(), allowing peers with an assigned ID
    of 0 to be resolved.
    
    Tested-on: QCA2066 hw2.1 PCI WLAN.HSP.1.1-03926.13-QCAHSPSWPL_V2_SILICONZ_CE-2.52297.9
    
    Fixes: 2167fa606c0f ("ath11k: Add support for RX decapsulation offload")
    Reviewed-by: Baochen Qiang <baochen.qiang@oss.qualcomm.com>
    Signed-off-by: Matthew Leach <matthew.leach@collabora.com>
    Reviewed-by: P Praneesh <praneesh.p@oss.qualcomm.com>
    Link: https://patch.msgid.link/20260424-ath11k-null-peerid-workaround-v4-1-252b224d3cf6@collabora.com
    Signed-off-by: Jeff Johnson <jeff.johnson@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: ath12k: fix EHT TX MCS limitation due to wrong 20 MHz-only parsing [+ + +]

Author: Baochen Qiang <baochen.qiang@oss.qualcomm.com>
Date:   Thu May 14 11:32:51 2026 +0800

    wifi: ath12k: fix EHT TX MCS limitation due to wrong 20 MHz-only parsing
    
    [ Upstream commit 60fb2cf51e77bb1c0261160b4be44209d68956b1 ]
    
    When connecting to an AP configured for EHT 20 MHz with a full EHT
    MCS/NSS map (supporting MCS 0-13)
    
    Supported EHT-MCS and NSS Set
        EHT-MCS Map (BW <= 80MHz): 0x444444
            .... .... .... .... .... 0100 = Rx Max Nss That Supports EHT-MCS 0-9: 4
            .... .... .... .... 0100 .... = Tx Max Nss That Supports EHT-MCS 0-9: 4
            .... .... .... 0100 .... .... = Rx Max Nss That Supports EHT-MCS 10-11: 4
            .... .... 0100 .... .... .... = Tx Max Nss That Supports EHT-MCS 10-11: 4
            .... 0100 .... .... .... .... = Rx Max Nss That Supports EHT-MCS 12-13: 4
            0100 .... .... .... .... .... = Tx Max Nss That Supports EHT-MCS 12-13: 4
    
    TX throughput is observed to be significantly lower than expected.
    Investigation shows that TX rates are limited to EHT MCS 11, even though
    the AP advertises support for EHT MCS 12/13.
    
    The root cause is an incorrect parsing of the Supported EHT-MCS and NSS
    Set element in ath12k_peer_assoc_h_eht().
    
    IEEE Std 802.11be-2024 Figure 9-1074as describes the format for 20
    MHz-Only Non-AP STAs.
    
    IEEE Std 802.11be-2024 Figure 9-1074at describes the format for all
    other AP and non-AP STAs.
    
    Currently the first format is parsed when the peer advertises no wider
    HE channel width support, without considering whether it is an AP or a
    non-AP STA. This is incorrect: the peer AP's capabilities must be parsed
    using Figure 9-1074at even when it operates on 20 MHz only. Parsing it
    as Figure 9-1074as causes rx_tx_mcs13_max_nss to be interpreted as zero,
    which is then passed to firmware, leading firmware to assume the peer
    does not support MCS 13 and to limit TX rates at MCS 11.
    
    Fix this by parsing the Figure 9-1074as format only when the peer is a
    20 MHz-Only non-AP STA, i.e. when the local interface operates as AP or
    mesh point.
    
    Tested-on: WCN7850 hw2.0 PCI WLAN.HMT.1.1.c5-00302-QCAHMTSWPL_V1.0_V2.0_SILICONZ-1.115823.3
    
    Fixes: 6c95151e2e77 ("wifi: ath12k: Add EHT MCS/NSS rates to Peer Assoc")
    Signed-off-by: Baochen Qiang <baochen.qiang@oss.qualcomm.com>
    Reviewed-by: Rameshkumar Sundaram <rameshkumar.sundaram@oss.qualcomm.com>
    Link: https://patch.msgid.link/20260514-ath12k-fix-20mhz-only-mcs-map-v1-1-a38d4a9b21a2@oss.qualcomm.com
    Signed-off-by: Jeff Johnson <jeff.johnson@oss.qualcomm.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: cfg80211: advance loop vars in cfg80211_merge_profile() [+ + +]

Author: John Walker <johnwalker0@gmail.com>
Date:   Thu May 7 17:07:20 2026 -0600

    wifi: cfg80211: advance loop vars in cfg80211_merge_profile()
    
    commit 7666dbb1bacc4ba522b96740cba7283d243d16e1 upstream.
    
    cfg80211_merge_profile() reassembles a Multi-BSSID non-transmitted BSS
    profile that has been split across multiple consecutive MBSSID elements.
    Its while-loop calls
    
            cfg80211_get_profile_continuation(ie, ielen, mbssid_elem, sub_elem)
    
    but never advances mbssid_elem or sub_elem inside the body.  Each
    iteration therefore searches for a continuation that follows the same
    fixed pair; the helper returns the same next_mbssid; and the same
    next_sub bytes are memcpy()'d into merged_ie at a growing offset until
    the buffer fills.
    
    Advance both mbssid_elem and sub_elem to the just-consumed continuation
    so the next call to cfg80211_get_profile_continuation() searches for a
    further continuation beyond it (or returns NULL when none exists).
    
    A specially-crafted malicious beacon can take advantage of this bug
    to cause the kernel to spend an excessive amount of time in
    cfg80211_merge_profile (up to as much as 2ms per beacon received),
    which could theoretically be abused in some way.
    
    Cc: stable@vger.kernel.org
    Fixes: fe806e4992c9 ("cfg80211: support profile split between elements")
    Signed-off-by: John Walker <johnwalker0@gmail.com>
    Link: https://patch.msgid.link/20260507230720.64783-1-johnwalker0@gmail.com
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

wifi: iwlwifi: mld: don't dereference a pointer before NULL checking it [+ + +]

Author: Miri Korenblit <miriam.rachel.korenblit@intel.com>
Date:   Fri May 15 15:14:56 2026 +0300

    wifi: iwlwifi: mld: don't dereference a pointer before NULL checking it
    
    [ Upstream commit d733ed481fd20a8e7bfe5119c4e77761ba3f87ee ]
    
    In iwl_mld_remove_link, the link->fw_id is saved at the beginning of the
    function so we have it after we freed the link.
    
    But the link pointer can be NULL, and is not checked when the fw_id is
    stored.
    
    Fix it by simply freeing the link at the end of the function.
    
    fFixes: 0e66a39f4f0e ("wifi: iwlwifi: fix potential use after free in iwl_mld_remove_link()")
    Reviewed-by: Johannes Berg <johannes.berg@intel.com>
    Link: https://patch.msgid.link/20260515151351.371f40fc6711.I6a82cfe9655564e9c5731af91c36493b26b1208e@changeid
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: iwlwifi: mld: fix TSO segmentation explosion when AMSDU is disabled [+ + +]

Author: Cole Leavitt <cole@unwrap.rs>
Date:   Sat Apr 4 22:41:44 2026 -0700

    wifi: iwlwifi: mld: fix TSO segmentation explosion when AMSDU is disabled
    
    [ Upstream commit 92cee08dc4f00e77fd1317e4343c5d458b0abab7 ]
    
    When the TLC notification disables AMSDU for a TID, the MLD driver sets
    max_tid_amsdu_len to the sentinel value 1. The TSO segmentation path in
    iwl_mld_tx_tso_segment() checks for zero but not for this sentinel,
    allowing it to reach the num_subframes calculation:
    
      num_subframes = (max_tid_amsdu_len + pad) / (subf_len + pad)
                    = (1 + 2) / (1534 + 2) = 0
    
    This zero propagates to iwl_tx_tso_segment() which sets:
    
      gso_size = num_subframes * mss = 0
    
    Calling skb_gso_segment() with gso_size=0 creates over 32000 tiny
    segments from a single GSO skb. This floods the TX ring with ~1024
    micro-frames (the rest are purged), creating a massive burst of TX
    completion events that can lead to memory corruption and a subsequent
    use-after-free in TCP's retransmit queue (refcount underflow in
    tcp_shifted_skb, NULL deref in tcp_rack_detect_loss).
    
    The MVM driver is immune because it checks mvmsta->amsdu_enabled before
    reaching the num_subframes calculation. The MLD driver has no equivalent
    bitmap check and relies solely on max_tid_amsdu_len, which does not
    catch the sentinel value.
    
    Fix this by detecting the sentinel value (max_tid_amsdu_len == 1) at the
    existing check and falling back to non-AMSDU TSO segmentation. Also add
    a WARN_ON_ONCE guard after the num_subframes division as defense-in-depth
    to catch any future code paths that produce zero through a different
    mechanism.
    
    Suggested-by: Miriam Rachel Korenblit <miriam.rachel.korenblit@intel.com>
    Fixes: d1e879ec600f ("wifi: iwlwifi: add iwlmld sub-driver")
    Signed-off-by: Cole Leavitt <cole@unwrap.rs>
    Link: https://patch.msgid.link/20260405054145.1064152-3-cole@unwrap.rs
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: iwlwifi: mld: stop TX during firmware restart [+ + +]

Author: Sheroz Juraev <goodmartiandev@gmail.com>
Date:   Sun Mar 15 13:12:21 2026 +0500

    wifi: iwlwifi: mld: stop TX during firmware restart
    
    commit 2becb38a3e217ef2b2f42fddd7db7a25905ec291 upstream.
    
    When iwlwifi firmware crashes (e.g., NMI_INTERRUPT_UNKNOWN on Intel
    BE201/Wi-Fi 7), iwl_mld_nic_error() sets mld->fw_status.in_hw_restart
    to true. However, iwl_mld_tx_from_txq() does not check this flag before
    dequeuing frames from mac80211 and pushing them to the transport layer.
    
    Since the firmware is dead, iwl_trans_tx() returns -EIO for each frame,
    which then gets freed immediately. Under high-throughput conditions
    (e.g., Tailscale UDP traffic or active SSH sessions), this creates a
    tight dequeue-send-fail-free loop that wastes CPU cycles and generates
    rapid skb allocation churn, leading to memory pressure from slab
    fragmentation.
    
    The RX path already has this guard (iwl_mld_rx_mpdu checks
    in_hw_restart at rx.c:1906), and so does the TXQ allocation worker
    (iwl_mld_add_txqs_wk at tx.c:156). Add the same guard to
    iwl_mld_tx_from_txq() to stop all TX during firmware restart.
    
    Frames left in mac80211's TXQs are naturally drained after restart
    completes, when queue reallocation triggers iwl_mld_tx_from_txq()
    via iwl_mld_add_txq_list(), or when new upper-layer traffic invokes
    wake_tx_queue.
    
    Tested on ASUS Zenbook 14 UX3405CA with Intel BE201 (Wi-Fi 7) on
    kernel 6.19.5 where the firmware crashes approximately every 10-15
    minutes under Tailscale traffic.
    
    Fixes: d1e879ec600f ("wifi: iwlwifi: add iwlmld sub-driver")
    Cc: stable@vger.kernel.org
    Signed-off-by: Sheroz Juraev <goodmartiandev@gmail.com>
    Link: https://patch.msgid.link/20260315081221.2678478-1-goodmartiandev@gmail.com
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

wifi: iwlwifi: mvm: fix driver-set TX rates on old devices [+ + +]

Author: Johannes Berg <johannes.berg@intel.com>
Date:   Fri May 15 15:14:57 2026 +0300

    wifi: iwlwifi: mvm: fix driver-set TX rates on old devices
    
    commit fb84b5cbcaab3ca0f4e961d92a40ed7f3aac483b upstream.
    
    On old devices such as 7265D, rates are still encoded in version 1
    format, which doesn't use the CCK/OFDM rate index (0-3/0-7) but
    rather their PLCP value (e.g. 10 for 1 Mbps CCK rate.)
    
    While introducing v3 rates, I changed the driver from internally
    handling v1 rates and converting to v2, to internally handling v3
    and converting to v1 or v2 according to the firmware. I accordingly
    changed the code in iwl_mvm_mac80211_idx_to_hwrate() to no longer
    have different values for different APIs. This was correct.
    
    However, I later reverted this part of the change, because it was
    reported that I had broken beacon rates, causing a FW assert/crash.
    This caused TX_CMD rates to be set incorrectly, potentially causing
    a warning when reported back from the device as having been used.
    
    Fix this (hopefully correctly now) by handling beacon rates in the
    TX_CMD that's embedded in the beacon template command separately.
    Restore iwl_mvm_mac80211_idx_to_hwrate() to return only the rate
    index, not PLCP value, fixing the real TX_CMD.
    
    Cc: stable@vger.kernel.org
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Link: https://patch.msgid.link/20260515151351.7407e293dff7.I4ea1a17f8fe99c933d3f3e30d077cf4246125c3e@changeid
    Signed-off-by: Miri Korenblit <miriam.rachel.korenblit@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

wifi: mac80211: bounds-check link_id in ieee80211_ml_epcs [+ + +]

Author: Alexandru Hossu <hossu.alexandru@gmail.com>
Date:   Fri May 15 12:29:08 2026 +0200

    wifi: mac80211: bounds-check link_id in ieee80211_ml_epcs
    
    [ Upstream commit f718506edd2d9c6a308ded9d13c632bf7b7d5a2c ]
    
    IEEE80211_MLE_STA_EPCS_CONTROL_LINK_ID is 0x000f, so link_id extracted
    from a PRIO_ACCESS ML element PER_STA_PROFILE subelement can be 0..15.
    sdata->link[] has IEEE80211_MLD_MAX_NUM_LINKS (15) entries (indices 0..14),
    making index 15 out-of-bounds.
    
    A connected WiFi 7 AP can trigger this by sending an EPCS Enable Response
    action frame with a PER_STA_PROFILE subelement where link_id = 15.  The
    unsolicited-notification path (dialog_token = 0) is reachable any time
    EPCS is already enabled, without any prior client request.
    
    sdata->link[15] reads into the first word of sdata->activate_links_work
    (a wiphy_work whose embedded list_head is non-NULL after INIT_LIST_HEAD),
    so the NULL check on the result does not catch the invalid access.  The
    garbage pointer is then passed to ieee80211_sta_wmm_params(), which
    dereferences link->sdata and crashes the kernel.
    
    The same class of bug was fixed for ieee80211_ml_reconfiguration() by
    commit 162d331d833d ("wifi: mac80211: bounds-check link_id in
    ieee80211_ml_reconfiguration").
    
    Fixes: de86c5f60839 ("wifi: mac80211: Add support for EPCS configuration")
    Signed-off-by: Alexandru Hossu <hossu.alexandru@gmail.com>
    Link: https://patch.msgid.link/20260515102908.1653088-1-hossu.alexandru@gmail.com
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: mac80211: capture fast-RX rate before mesh reuses skb->cb [+ + +]

Author: Zhao Li <enderaoelyther@gmail.com>
Date:   Sat May 9 12:34:28 2026 +0800

    wifi: mac80211: capture fast-RX rate before mesh reuses skb->cb
    
    commit d71c841be5d9e586ee7f36c0dc8ed4db0d9a1349 upstream.
    
    ieee80211_invoke_fast_rx() reads RX status through
    IEEE80211_SKB_RXCB(skb), which aliases the same skb->cb storage
    that ieee80211_rx_mesh_data() reuses as IEEE80211_TX_INFO.  In the
    unicast forward path, mesh_data does:
    
            info = IEEE80211_SKB_CB(fwd_skb);
            memset(info, 0, sizeof(*info));
    
    on the same skb the caller still names via rx->skb, then either
    queues the skb for TX (success) or kfree_skb()'s it (no-route)
    before returning RX_QUEUED.  The caller's RX_QUEUED arm then
    calls sta_stats_encode_rate(status) on memory that is either
    zeroed (success path) or freed (no-route path).  The latter is
    KASAN slab-use-after-free in ieee80211_prepare_and_rx_handle.
    
    Fix by encoding the rate from status before invoking
    ieee80211_rx_mesh_data(), so the RX_QUEUED arm consumes a value
    captured while status was still backed by valid memory.
    
    Fixes: 3468e1e0c639 ("wifi: mac80211: add mesh fast-rx support")
    Cc: stable@vger.kernel.org
    Signed-off-by: Zhao Li <enderaoelyther@gmail.com>
    Link: https://patch.msgid.link/20260509043427.60322-2-enderaoelyther@gmail.com
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

wifi: mac80211: consume only present negotiated TTLM maps [+ + +]

Author: Michael Bommarito <michael.bommarito@gmail.com>
Date:   Fri May 15 11:17:18 2026 -0400

    wifi: mac80211: consume only present negotiated TTLM maps
    
    commit a6e6ccd5bd07155c2add6c74ce1a5e68ad3b95ea upstream.
    
    ieee80211_tid_to_link_map_size_ok() validates negotiated TTLM elements
    against the number of link-map entries indicated by link_map_presence.
    ieee80211_parse_neg_ttlm() must consume the same layout.
    
    The parser advanced its cursor for every TID, including TIDs whose
    presence bit is clear and therefore have no map bytes in the element.
    A sparse map can then make a later present TID read past the validated
    element.
    
    The bad bytes land in neg_ttlm->{up,down}link[tid] but are gated by
    valid_links before being applied to driver state, so a peer cannot
    turn the read into a policy change.  Under KUnit + KASAN with an
    exact-sized element allocation the OOB read is reported as a
    slab-out-of-bounds; whether the same trigger fires under the
    production RX path depends on surrounding allocator state.
    
    Advance the cursor only when the current TID has a map present.
    
    Fixes: 8f500fbc6c65 ("wifi: mac80211: process and save negotiated TID to Link mapping request")
    Cc: stable@vger.kernel.org
    Assisted-by: Claude:claude-opus-4-7
    Signed-off-by: Michael Bommarito <michael.bommarito@gmail.com>
    Link: https://patch.msgid.link/20260515151719.1317659-2-michael.bommarito@gmail.com
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

wifi: mac80211: fix MLE defragmentation [+ + +]

Author: Johannes Berg <johannes.berg@intel.com>
Date:   Fri May 8 09:10:31 2026 +0200

    wifi: mac80211: fix MLE defragmentation
    
    [ Upstream commit a74e893f30db64cdce0fc7a96d3baa417bcd55f5 ]
    
    If either reconf or EPCS multi-link element (MLE) is contained in
    a non-transmitted profile, the defragmentation routine is called
    with a pointer to the defragmented copy, but the original elements.
    
    This is incorrect for two reasons:
     - if the original defragmentation was needed, it will not find the
       correct data
     - if the original frame is at a higher address, the parsing will
       potentially overrun the heap data (though given the layout of
       the buffers, only into the new defragmentation buffer, and then
       it has to stop and fail once that's filled with copied data.
    
    Fix it by tracking the container along with the pointer and in
    doing so also unify the two almost identical defragmentation
    routines.
    
    Fixes: 4d70e9c5488d ("wifi: mac80211: defragment reconfiguration MLE when parsing")
    Reviewed-by: Miriam Rachel Korenblit <miriam.rachel.korenblit@intel.com>
    Reviewed-by: Ilan Peer <ilan.peer@intel.com>
    Link: https://patch.msgid.link/20260508091031.8a6c34613178.I4de16ebbce2d27f2f8f98fc49949c7a376c2fe8d@changeid
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: mac80211: fix multi-link element inheritance [+ + +]

Author: Johannes Berg <johannes.berg@intel.com>
Date:   Fri May 8 09:10:32 2026 +0200

    wifi: mac80211: fix multi-link element inheritance
    
    [ Upstream commit fe2d61a5d2849ee75dd4deeb2fe35f78d80721f8 ]
    
    When parsing a beacon, mac80211 erroneously inherits any
    reconfiguration or EPCS multi-link elements from the outer
    elements into the multi-BSSID profile that's requested, if
    connected to a non-transmitted BSS, unless that profile
    has a non-inheritance element.
    
    This also happens if parsing a multi-BSSID profile that
    doesn't have a non-inheritance element.
    
    Fix this by having an empty non-inheritance element so
    cfg80211_is_element_inherited() is invoked in these cases
    and causes the parser to skip the elements that should
    never be inherited.
    
    Fixes: cf36cdef10e2 ("wifi: mac80211: Add support for parsing Reconfiguration Multi Link element")
    Fixes: 24711d60f849 ("wifi: mac80211: Support parsing EPCS ML element")
    Reviewed-by: Ilan Peer <ilan.peer@intel.com>
    Reviewed-by: Benjamin Berg <benjamin.berg@intel.com>
    Link: https://patch.msgid.link/20260508091032.92184c0a3f08.I3c43b0b63d2cef8a4ddddaef1c2faaeb1de711ad@changeid
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

wifi: wilc1000: fix dma_buffer leak on bus acquire failure [+ + +]

Author: Shitalkumar Gandhi <shital.gandhi45@gmail.com>
Date:   Mon May 11 09:57:32 2026 +0530

    wifi: wilc1000: fix dma_buffer leak on bus acquire failure
    
    [ Upstream commit dd7b6a8671939708cc4b7a46786d8c11297e8f69 ]
    
    wilc_wlan_firmware_download() allocates dma_buffer with kmalloc() at
    the top of the function and uses a 'fail:' label to free it via
    kfree(dma_buffer) on error.
    
    All later error paths correctly use 'goto fail' to route through this
    cleanup. However, the early failure path after the first acquire_bus()
    call uses a bare 'return ret;', which leaks dma_buffer whenever the bus
    acquire fails.
    
    Replace the early return with goto fail so the existing cleanup path
    runs.
    
    Found via a custom Coccinelle semantic patch hunting for kmalloc'd
    locals leaked on early-return error paths in driver firmware-download
    code.
    
    Fixes: 1241c5650ff7 ("wifi: wilc1000: Fill in missing error handling")
    Signed-off-by: Shitalkumar Gandhi <shitalkumar.gandhi@cambiumnetworks.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://patch.msgid.link/20260511042732.998311-1-shitalkumar.gandhi@cambiumnetworks.com
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

x86/mce: Restore MCA polling interval halving [+ + +]

Author: Borislav Petkov (AMD) <bp@alien8.de>
Date:   Mon Mar 16 16:12:00 2026 +0100

    x86/mce: Restore MCA polling interval halving
    
    [ Upstream commit ea324444ece9f301b5c4ff71b258cc68990c4d61 ]
    
    RongQing reported that the MCA polling interval doesn't halve when an
    error gets logged. It was traced down to the commit in Fixes:, because:
    
      mce_timer_fn()
      |-> mce_poll_banks()
      |-> machine_check_poll()
      |-> mce_log()
    
    which will queue the work and return.
    
    Now, back in mce_timer_fn():
    
            /*
             * Alert userspace if needed. If we logged an MCE, reduce the polling
             * interval, otherwise increase the polling interval.
             */
            if (mce_notify_irq())
    
    <--- here we haven't ran the notifier chain yet so mce_need_notify is
    not set yet so this won't hit and we won't halve the interval iv.
    
    Now the notifier chain runs. mce_early_notifier() sets the bit, does
    mce_notify_irq(), that clears the bit and then the notifier chain
    a little later logs the error.
    
    So this is a silly timing issue.
    
    But, that's all unnecessary.
    
    All it needs to happen here is, the "should we notify of a logged MCE"
    mce_notify_irq() asks, should be simply a question to the mce gen pool:
    "Are you empty?"
    
    And that then turns into a simple yes or no answer and it all
    JustWorks(tm).
    
    So do that and also distribute the functionality where it belongs:
     - Print that MCE events have been logged in mce_log()
     - Trigger the mcelog tool specific work in the first notifier
    
    As a result, mce_notify_irq() can go now.
    
    Fixes: 011d82611172 ("RAS: Add a Corrected Errors Collector")
    Reported-by: Li RongQing <lirongqing@baidu.com>
    Signed-off-by: Borislav Petkov (AMD) <bp@alien8.de>
    Reviewed-by: Qiuxu Zhuo <qiuxu.zhuo@intel.com>
    Tested-by: Qiuxu Zhuo <qiuxu.zhuo@intel.com>
    Link: https://lore.kernel.org/r/20260112082747.2842-1-lirongqing@baidu.com
    Signed-off-by: Sasha Levin <sashal@kernel.org>

x86/mm: Disable broadcast TLB flush when PCID is disabled [+ + +]

Author: Tom Lendacky <thomas.lendacky@amd.com>
Date:   Wed May 20 12:00:50 2026 -0500

    x86/mm: Disable broadcast TLB flush when PCID is disabled
    
    commit 44126343d58c68adaa8343fbf1c07dd20078c35e upstream.
    
    Booting with "nopcid" clears X86_FEATURE_PCID and keeps CR4.PCIDE from being
    set to one. On AMD CPUs that support INVLPGB, broadcast TLB flushing remains
    enabled.
    
    There are two checks that decide whether the global ASID code runs,
    mm_global_asid() and consider_global_asid(), that key off of the
    X86_FEATURE_INVLPGB feature. Once an mm becomes active on more than three
    CPUs, consider_global_asid() assigns it a global ASID, after which
    flush_tlb_mm_range() takes the broadcast_tlb_flush() path using a non-zero
    PCID. Issuing an INVLPGB with a non-zero PCID while CR4.PCIDE is not set
    results in a #GP:
    
      Oops: general protection fault, kernel NULL pointer dereference 0x1: 0000 [#1] SMP NOPTI
      CPU: 158 UID: 0 PID: 3119 Comm: snap Not tainted 7.1.0-rc3 #1 PREEMPT(full)
      Hardware name: ...
      RIP: 0010:broadcast_tlb_flush
      Code: ... 89 da 48 83 c8 07 <0f> 01 fe eb 08 cc cc cc ...
      Call Trace:
       <TASK>
       flush_tlb_mm_range
       ptep_clear_flush
       wp_page_copy
       ? _raw_spin_unlock
       __handle_mm_fault
       handle_mm_fault
       do_user_addr_fault
       exc_page_fault
       asm_exc_page_fault
    
    All processors that support broadcast TLB invalidation also have PCID support,
    so it is only the "nopcid" scenario that is of concern. In this situation just
    disable the broadcast TLB support using the CPUID dependency support by making
    X86_FEATURE_INVLPGB dependent on X86_FEATURE_PCID.
    
      [ bp: Massage commit message. ]
    
    Fixes: 4afeb0ed1753 ("x86/mm: Enable broadcast TLB invalidation for multi-threaded processes")
    Suggested-by: Dave Hansen <dave.hansen@intel.com>
    Assisted-by: Claude:claude-opus-4.7
    Signed-off-by: Tom Lendacky <thomas.lendacky@amd.com>
    Signed-off-by: Borislav Petkov (AMD) <bp@alien8.de>
    Acked-by: Rik van Riel <riel@surriel.com>
    Cc: <stable@kernel.org>
    Link: https://patch.msgid.link/b915acfd63e8b2a094fdeb8dc608738072518764.1779296450.git.thomas.lendacky@amd.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

x86/xen: Fix xen_e820_swap_entry_with_ram() [+ + +]

Author: Juergen Gross <jgross@suse.com>
Date:   Tue May 5 12:24:17 2026 +0200

    x86/xen: Fix xen_e820_swap_entry_with_ram()
    
    [ Upstream commit 28e03f78e69cf6628b81f24777799778528a84c1 ]
    
    When swapping a not page-aligned E820 map entry with RAM, the start
    address of the modified entry is calculated wrong (the offset into the
    page is subtracted instead of being added to the page address).
    
    Fixes: be35d91c8880 ("xen: tolerate ACPI NVS memory overlapping with Xen allocated memory")
    Reported-by: Jan Beulich <jbeulich@suse.com>
    Reviewed-by: Jan Beulich <jbeulich@suse.com>
    Signed-off-by: Juergen Gross <jgross@suse.com>
    Message-ID: <20260505102417.208138-1-jgross@suse.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

zonefs: handle integer overflow in zonefs_fname_to_fno [+ + +]

Author: Johannes Thumshirn <johannes.thumshirn@wdc.com>
Date:   Wed Apr 29 22:58:15 2026 +0200

    zonefs: handle integer overflow in zonefs_fname_to_fno
    
    [ Upstream commit 3a8389d42bdf4213730f4067f8bfa78bae6564ef ]
    
    In zonefs the file name in one of the two directories corresponds to the
    zone number.
    
    Here Alexey reported a possible integer overflow in zonefs_fname_to_fno(),
    where the parsing of the zone number from the file name can overflow the
    'long' data type.
    
    Add a check for integer overflows and if the fno 'long' did overflow
    return -ENOENT.
    
    Reported-by: Alexey Dobriyan <adobriyan@gmail.com>
    Fixes: d207794ababe ("zonefs: Dynamically create file inodes when needed")
    Signed-off-by: Johannes Thumshirn <johannes.thumshirn@wdc.com>
    Signed-off-by: Damien Le Moal <dlemoal@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>