| Mesa 19.3.0 Release Notes / 2019-12-12 |
| ====================================== |
| |
| Mesa 19.3.0 is a new development release. People who are concerned with |
| stability and reliability should stick with a previous release or wait |
| for Mesa 19.3.1. |
| |
| Mesa 19.3.0 implements the OpenGL 4.6 API, but the version reported by |
| glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / |
| glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being |
| used. Some drivers don't support all the features required in OpenGL |
| 4.6. OpenGL 4.6 is **only** available if requested at context creation. |
| Compatibility contexts may report a lower version depending on each |
| driver. |
| |
| Mesa 19.3.0 implements the Vulkan 1.1 API, but the version reported by |
| the apiVersion property of the VkPhysicalDeviceProperties struct depends |
| on the particular driver being used. |
| |
| SHA256 checksum |
| --------------- |
| |
| :: |
| |
| 5fa0e4e9dca79560f6882e362f9db36d81cf96da16cf6a84e0ada7466a99a5d7 mesa-19.3.0.tar.xz |
| |
| New features |
| ------------ |
| |
| - GL_ARB_gl_spirv on i965, iris. |
| - GL_ARB_spirv_extensions on i965, iris. |
| - GL_EXT_demote_to_helper_invocation on iris, i965. |
| - OpenGL 4.6 on i965, iris. |
| - EGL_EXT_image_flush_external |
| - VK_ANDROID_external_memory_android_hardware_buffer on RADV. |
| - VK_KHR_shader_clock on Intel, RADV. |
| - VK_KHR_shader_float_controls on Intel, RADV. |
| - VK_KHR_spirv_1_4 on Intel, RADV. |
| - VK_KHR_timeline_semaphore on RADV. |
| - VK_KHR_vulkan_memory_model on Intel. |
| - VK_EXT_shader_subgroup_ballot on Intel. |
| - VK_EXT_shader_subgroup_vote on Intel. |
| - VK_EXT_texel_buffer_alignment on RADV. |
| - VK_INTEL_performance_query on Intel. |
| - Meson support for windows using MSVC and MinGW |
| - scons has been deprecated for non windows |
| - Initial Intel gen12 (Tigerlake) support on anvil and iris |
| - New compiler backend "ACO" for RADV (RADV_PERFTEST=aco) |
| - VK_EXT_shader_demote_to_helper_invocation on RADV/ACO. |
| |
| Bug fixes |
| --------- |
| |
| - [RADV] The Dead Rising 4 is causing a GPU hang with LLVM backend |
| - radeonsi: mpv --vo=vaapi incorrect rendering on gfx9+ |
| - NULL resource when playing VP9 video through VDPAU on RX 570 |
| - gnome-shell overview mode crash in recent mesa |
| - radv/aco Jedi Fallen Order hair rendering buggy |
| - [RADV] VK_KHR_timeline_semaphore balloons in runtime |
| - Shadow of Mordor has randomly dancing black shadows on Talion's face |
| - ld.lld: error: duplicate symbol (mesa-19.3.0-rc1) |
| - triangle strip clipping with GL_FIRST_VERTEX_CONVENTION causes wrong |
| vertex's attribute to be broadcasted for flat interpolation |
| - [bisected][regression][g45,g965,ilk] piglit arb_fragment_program kil |
| failures |
| - textureSize(samplerExternalOES, int) missing in desktop mesa 19.1.7 |
| implementation |
| - HSW. Tropico 6 and SuperTuxKart have shadows flickering |
| - glxgears segfaults on POWER / Xvnc |
| - Objects leaving trails in Firefox with antialias and |
| preserveDrawingBuffer in three.js WebGLRednerer with mesa 19.2 |
| - radv regression after 84d9551b232bdcead017b212cbb3e291486e698c: vk: |
| error: failed to submit CS |
| - Rename ACO README to README.md |
| - Steam crash due to commit e137b3a9b71a2711c1f68c8a8b9c0a7407fbcc4b |
| (bisected) |
| - [Anv regression] SPIR-V abort in Aztec Ruins |
| - FreeBSD does not have \_GNU_SOURCE in util/strtod.c |
| - glLinkProgram crash when using gcc-9 -O3 -flto due to use of |
| uninitialised value |
| - KeyError: 'force_scons': |
| - link_shader and deserialize_glsl_program suddenly consume huge amount |
| of RAM |
| - build errors after "meson: add -Werror=empty-body to disallow |
| \`if(x);`" |
| - performance regression in Heroes of the Storm with Mesa 19.1.1 & |
| Polaris |
| - Vulkan version of "Middle-earth: Shadow of Mordor" has graphics |
| glitches on RADV driver (part 2) |
| - swr/rasterizer/core/format_types.h:1183: undefined reference to |
| \`_mm256_cvtps_ph' |
| - Meson: Building osmesa gallium and tests at the same time results in |
| osmesa gallium build failure |
| - Vulkan version of "Middle-earth: Shadow of Mordor" has graphics |
| glitches on RADV driver |
| - [amdgpu][Navi][llvm] Minimap problem in Nier Automata |
| - [bisected] anon_inode:sync_file file descriptor leak |
| - Cache meson packagecach in appveyor |
| - Piglit tests regression in gallium drivers |
| - Black ground in Dirt 4 |
| - Superbibles examples crashing Mesa drivers (radeonsi) and causing gpu |
| reset |
| - [CTS] dEQP-VK.graphicsfuzz.write-red-in-loop-nest crashes |
| - mesa and libglvnd install the same headers |
| - Multiple EGL displays with multiple window systems leads to a crash |
| - Regression: Doom (2016) crashes on Mesa 19.2 and above and Radeon 380 |
| with Vulkan (worked on Mesa 19.1) |
| - Rocket League displays corruption when the game starts |
| - drm.h:50:9: error: unknown type name 'uint8_t' |
| - Mesa build breaks when only building radeonsi due to missing llvm |
| coroutines symbols |
| - radeonsi aborting in LLVM validation test in si_compile_tgsi_shader() |
| - meson.build:1447:6: ERROR: Problem encountered: libdrm required for |
| gallium video statetrackers when using x11 |
| - Mesa doesn't build with current Scons version (3.1.0) |
| - libXvMC-1.0.12 breaks mesa build |
| - Meson can't find 32-bit libXvMCW in non-standard path |
| - Mesa installs gl.pc and egl.pc even with libglvnd >= 1.2.0 |
| |
| Changes |
| ------- |
| |
| Adam Jackson (44): |
| |
| - glx: Whitespace cleanups |
| - glx: Sync <GL/glxext.h> with Khronos |
| - glx: Make \__glXGetDrawableAttribute return true sometimes |
| - glx: Unset the direct_support bit for GLX_EXT_import_context |
| - Revert "glx: Unset the direct_support bit for GLX_EXT_import_context" |
| - egl: Enable 10bpc EGLConfigs for platform_{device,surfaceless} |
| - gallium/xlib: Fix an obvious thinko |
| - mesa: Remove unused gl_config::indexBits |
| - mesa: Eliminate gl_config::have{Accum,Depth,Stencil}Buffer |
| - mesa: Eliminate gl_config::rgbMode |
| - gallium: Require LLVM >= 3.4 |
| - gallium: Require LLVM >= 3.5 |
| - gallium: Require LLVM >= 3.6 |
| - gallium: Require LLVM >= 3.7 |
| - gallium: Require LLVM >= 3.8 |
| - gallium: Require LLVM >= 3.9 |
| - egl/dri2: Refuse to add EGLConfigs with no supported surface types |
| - glx: Remove unused indirection for glx_context->fillImage |
| - gallium: Restore VSX for llvm >= 4 |
| - ci: Run tests on i386 cross builds |
| - gallium/xlib: Remove drawable caching from the MakeCurrent path |
| - gallium/xlib: Remove MakeCurrent_PrevContext |
| - gallium/xlib: Fix glXMakeCurrent(dpy, None, None, ctx) |
| - docs: Update bug report URLs for the gitlab migration |
| - glx: Avoid atof() when computing the server's GLX version |
| - glx: Fix drawable lookup bugs in glXUseXFont |
| - egl/wayland: Reindent the format table |
| - egl/wayland: Add FP16 format support |
| - egl/wayland: Implement getCapability for the dri2 and image loaders |
| - egl/surfaceless: Add FP16 format support |
| - libgbm: Wire up getCapability for the image loader |
| - glx: Move vertex array protocol state into the indirect backend |
| - glx: Lift sending the MakeCurrent request to top-level code |
| - glx: Implement GLX_EXT_no_config_context |
| - Revert "glx: Implement GLX_EXT_no_config_context" |
| - Revert "glx: Lift sending the MakeCurrent request to top-level code" |
| - drisw: Simplify GC setup |
| - drisw: Fix and simplify drawable setup |
| - glx: Log the filename of the drm device if we fail to open it |
| - egl/dri2: Don't dlclose() the driver on dri2_load_driver_common |
| failure |
| - surfaceless: Support EGL_WL_bind_wayland_display |
| - egl: Make native display detection work more than once |
| - gallium/xlib: Fix xmesa drawable creation |
| |
| Alan Coopersmith (6): |
| |
| - gallium: Fix a bunch of undefined left-shifts in u_format\_\* |
| - c99_compat.h: Don't try to use 'restrict' in C++ code |
| - util: Make Solaris implemention of p_atomic_add work with gcc |
| - util: Workaround lack of flock on Solaris |
| - util: Solaris has linux-style pthread_setname_np |
| - meson: recognize "sunos" as the system name for Solaris |
| - intel/common: include unistd.h for ioctl() prototype on Solaris |
| |
| Alejandro Piñeiro (5): |
| |
| - i965: enable ARB_gl_spirv extension and ARB_spirv_extensions for |
| gen7+ |
| - mesa/version: uncomment SPIR-V extensions |
| - i965: Enable OpenGL 4.6 for Gen8+ |
| - v3d: take into account prim_counts_offset |
| - v3d: adds an extra MOV for any sig.ld\* |
| |
| Alex Smith (1): |
| |
| - radv: Change memory type order for GPUs without dedicated VRAM |
| |
| Alexandros Frantzis (1): |
| |
| - gitlab-ci: Update required libdrm version |
| |
| Alyssa Rosenzweig (220): |
| |
| - pan/decode: Eliminate DYN_MEMORY_PROP |
| - pan/decode: Don't print MALI_DRAW_NONE |
| - panfrost: Move pan_invocation to shared panfrost/ |
| - panfrost: Set workgroups z to 32 for non-instanced graphics |
| - pan/decode: Don't print canonical workgroup encoding |
| - panfrost: Implement workgroups_x_shift_2 quirk |
| - pan/decode: Silence workgroups_x_shift_2 |
| - pan/decode: Fix missing NULL terminator |
| - pan/decode: Don't print zero exception_status |
| - pan/decode: Express tiler structures as offsets |
| - pan/decode: Allow updating mmaps |
| - pan/decode: Bounds check polygon list and tiler heap |
| - panfrost: Move pan_tiler.c outside of Gallium |
| - pan/decode: Verify and omit polygon size |
| - pan/decode: Print "just right" count of texture pointers |
| - panfrost: Remove DRY_RUN |
| - panfrost: Correct polygon size computations |
| - pan/decode: Check for a number of potential issues |
| - pan/decode: Don't print unreferenced attribute memory |
| - pan/decode: Add static bounds checking utility |
| - pan/decode: Do not print uniform/buffers explicitly |
| - pan/decode: Validate AFBC fields are zero when AFBC is disabled |
| - pan/decode: Check for MFBD preload chicken bit |
| - pan/decode: Mark tripped zeroes with XXX |
| - pan/decode: Normalize case matching XXX format |
| - pan/decode: Normalize final instances of XXX |
| - panfrost: Fix scoreboarding with dependency on job #0 |
| - panfrost: Do not expose PIPE_CAP_TEXTURE_MIRROR_CLAMP |
| - panfrost: Don't crash on GL_CLAMP |
| - pan/decode: Guard attribute unknowns |
| - panfrost: Don't trip the prefix magic field |
| - pan/decode: Handle VARYING_DISCARD |
| - pan/decode: Treat RESERVED swizzles as errors |
| - pan/decode: Validate swizzles against format |
| - pan/decode: Don't print the default swizzle |
| - pan/decode: Use GLSL style formats/swizzles |
| - pan/decode: Guard texture unknowns as zero trips |
| - pan/decode: Break out pandecode_texture function |
| - pan/decode: Validate texture dimensionality |
| - panfrost: nr_mipmap_levels -> levels |
| - panfrost: Remove ancient TODO |
| - pan/decode: Pretty-print sRGB format |
| - panfrost: Break up usage2 field |
| - pan/decode: Use concise texture printing |
| - pan/decode: Include address in union mali_attr |
| - pan/decode: Validate attribute/varying buffer pointer |
| - pan/decode: Cleanup mali_attr printing |
| - pan/midgard: Free liveness info |
| - pan/midgard: Allocate \`dependencies\` on stack |
| - pan/decode: Don't leak FBD pointer |
| - pan/decode: Remove all_zero |
| - pan/bifrost: Avoid buffer overflow in disassembler |
| - pan/midgard: Represent unused nodes by ~0 |
| - pan/midgard: Reorder bits check to fix 8-bit masks |
| - pan/midgard: Simplify contradictory check. |
| - panfrost: Don't check reads_point_coord |
| - pan/midgard: Mark fallthrough explicitly |
| - panfrost: Pay attention to framebuffer dimension sign |
| - panfrost: Clarify intention with PIPE_SWIZZLE_X check |
| - panfrost: Prevent potential integer overflow in instancing |
| - panfrost: Hoist job != NULL check |
| - panfrost: Hoist bo != NULL check before dereference |
| - panfrost: Fix missing ret assignment in DRM code |
| - pan/bifrost: Correct file size signedness |
| - panfrost: Guard against NULL rasterizer explicitly |
| - panfrost: Pass stream_output_info by reference |
| - pan/midgard: Breakout texture reg select printer |
| - pan/midgard: Identify and disassemble indirect texture/sampler |
| - panfrost: Don't bail on PIPE_BUFFER |
| - panfrost: Implement depth range clipping |
| - panfrost: Fix PIPE_BUFFER spacing |
| - pan/midgard,bifrost: Expand nir_const_load_to_arr |
| - nir: Remove nir_const_load_to_arr |
| - pan/decode: Hoist shader-db stats to shared decode |
| - pan/midgard: Sketch static analysis to uniform count |
| - pan/midgard: Compute work_count via writes |
| - pan/midgard: Analyze simple loads/store |
| - pan/midgard: Explain ffma |
| - pan/midgard: Disassemble integer constants in hex |
| - pan/decode: Remove mali_attr(_meta) framing |
| - pan/decode: Removing uniform buffer framing |
| - pan/decode: Eliminate non-FBD dumped case |
| - pan/decode: Validate MFBD tags |
| - pan/decode: Validate and simplify FRAGMENT payloads |
| - pan/decode: Validate blend shaders don't access I/O |
| - pan/decode: Fix uniform printing |
| - pan/decode: Promote <no shader> to an error |
| - pan/decode: Disassemble before printing shader descriptor |
| - pan/decode: Validate mali_shader_meta stats |
| - pan/decode: Validate, but do not print, index buffer |
| - pan/decode: Downgrade shader property mismatch to warning |
| - pan/decode: Decode actual varying_meta address |
| - pan/decode: Print stub for uniforms |
| - pan/decode: Decouple attribute/meta printing |
| - pan/decode: Remove size/stride divisibility check |
| - pan/decode: Handle special varyings |
| - panfrost: Remove vertex buffer offset from its size |
| - panfrost: Implement gl_FragCoord correctly |
| - pan/midgard: Fix writeout combining |
| - pan/midgard: Analyze helper invocations |
| - pan/decode: Validate and quiet helper invocation flag |
| - pan/midgard, bifrost: Set lower_fdph = true |
| - pan/midgard: Switch constants to uint32 |
| - pan/midgard: Add imov->fmov optimization |
| - pan/midgard: Fold ssa_args into midgard_instruction |
| - pan/midgard: Fix invert fusing with r26 |
| - freedreno/ir3: Link directly to Sethi-Ullman paper |
| - pan/midgard: Count shader-db stats by bundled instructions |
| - pan/midgard: Factor out mir_is_scalar |
| - pan/midgard: Extract instruction sizing helper |
| - pan/midgard: Expose mir_get/set_swizzle |
| - pan/midgard: Add OP_IS_CSEL_V helper |
| - pan/midgard: Fix corner case in RA |
| - pan/midgard: Add post-schedule iteration helpers |
| - pan/midgard: Include condition in branch->src[0] |
| - pan/midgard: Document Midgard scheduling requirements |
| - pan/midgard: Ensure fragment writeout is in the final block |
| - pan/midgard: Track csel swizzle |
| - pan/midgard: Add mir_insert_instruction*scheduled helpers |
| - pan/midgard: csel_swizzle with mir get swizzle |
| - pan/midgard: Extend mir_special_index to writeout |
| - pan/midgard: Improve mir_mask_of_read_components |
| - pan/midgard: Allow NULL argument in mir_has_arg |
| - pan/midgard: Track shader quadword count while scheduling |
| - pan/midgard: Add scheduling barriers |
| - pan/midgard: Cleanup fragment writeout branch |
| - pan/midgard: Remove texture_index |
| - pan/midgard: Print branches in MIR |
| - pan/midgard: Print MIR by the bundle |
| - pan/midgard: Fix misc. RA issues |
| - pan/midgard: Do not propagate swizzles into writeout |
| - pan/midgard: Handle fragment writeout in RA |
| - pan/midgard: Schedule before RA |
| - pan/midgard: Remove mir_opt_post_move_eliminate |
| - pan/midgard: Use shared psiz clamp pass |
| - pan/decode: Fix uninitialized variables |
| - pan/decode: Use %zu instead of %d |
| - pan/decode: Use portable format specifier for 64-bit |
| - pan/decode: Add missing format specifier |
| - pan/midgard: Correct issues in disassemble.c |
| - pan/midgard: Fix cppcheck issues |
| - pan/midgard: Remove cppwrap.cpp |
| - pan/midgard: Remove mir_print_bundle |
| - pan/midgard: Remove mir_rewrite_index_*_tag |
| - panfrost: Mark (1 << 31) as unsigned |
| - panfrost: Fix misc. issues flagged by cppcheck |
| - panfrost: Remove panfrost_upload |
| - pan/midgard: Add missing parans in SWIZZLE definition |
| - pan/midgard: Fix component count handling for ldst |
| - pan/midgard: Squeeze indices before scheduling |
| - pan/midgard: Add flatten_mir helper |
| - pan/midgard: Calculate dependency graph |
| - pan/midgard: Initialize worklist |
| - pan/midgard: Add mir_choose_instruction stub |
| - pan/midgard: Add mir_update_worklist helper |
| - pan/midgard: Add mir_choose_bundle helper |
| - pan/midgard: Add mir_schedule_texture/ldst/alu helpers |
| - pan/midgard: Remove csel constant unit force |
| - pan/midgard: Add constant intersection filters |
| - pan/midgard: Add predicate->exclude |
| - pan/midgard: Implement predicate->unit |
| - pan/midgard: Add helpers for scheduling conditionals |
| - pan/midgard: Extend csel_swizzle to branches |
| - pan/midgard: Implement load/store pairing |
| - pan/midgard: Add mir_choose_alu helper |
| - pan/midgard: Add distance metric to choose_instruction |
| - pan/midgard: Use new scheduler |
| - pan/midgard: Don't double check SCALAR units |
| - pan/midgard: Extend choose_instruction for scalar units |
| - pan/midgard: Schedule to smul/sadd |
| - pan/midgard: Only one conditional per bundle allowed |
| - pan/midgard: Allow 6 instructions per bundle |
| - pan/midgard: Allow writeout to see into the future |
| - pan/midgard: Tightly pack 32-bit constants |
| - pan/midgard: Add mir_flip helper |
| - pan/midgard: Add csel invert optimization |
| - pan/midgard: Allow scheduling conditions with constants |
| - pan/midgard: Remove mir_has_multiple_writes |
| - pan/midgard: Add mir_calculate_temp_count helper |
| - pan/midgard: Move RA's liveness analysis into midgard_liveness.c |
| - pan/midgard: Don't try to OR live_in of successors |
| - pan/midgard: Begin tracking liveness metadata |
| - pan/midgard: Invalidate liveness for mir_is_live_after |
| - pan/midgard: Calculate temp_count for liveness |
| - pan/midgard: Replace mir_is_live_after with new pass |
| - pan/midgard: Report read mask for branch arguments |
| - pan/midgard: Allow non-contiguous masks in UBO lowering |
| - pan/midgard: Don't try to propagate swizzles to branches |
| - pan/midgard: Add perspective ops to mir_get_swizzle |
| - pan/midgard: Fix mir_mask_of_read_components with dot products |
| - panfrost: Disable frame throttling |
| - pan/midgard: Use 16-bit liveness masks |
| - pan/midgard: Allow COMPUTE jobs in panfrost_bo_access_for_stage |
| - pan/midgard: Fix memory corruption in register spilling |
| - pan/midgard: Do not repeatedly spill same value |
| - pan/midgard: Debug mir_insert_instruction_after_scheduled |
| - pan/midgard: Identify 64-bit atomic opcodes |
| - pan/midgard/disasm: Fix printing 8-bit/16-bit masks |
| - pan/midgard: Factor out mir_get_alu_src |
| - pan/midgard: Tableize load/store ops |
| - pan/midgard: Implement OP_IS_STORE with table |
| - pan/midgard: Add helpers for manipulating byte masks |
| - pan/midgard: Report byte masks for read components |
| - pan/midgard: Simplify mir_bytemask_of_read_components |
| - pan/midgard: Implement per-byte liveness tracking |
| - pan/midgard: Handle nontrivial masks in texture RA |
| - pan/midgard: Create dependency graph bytewise |
| - pan/midgard: Implement SIMD-aware dead code elimination |
| - panfrost/ci: Update expectations list |
| - pan/midgard: Add mir_set_bytemask helper |
| - pan/midgard: Expose more typesize manipulation routines |
| - pan/midgard: Express allocated registers as offsets |
| - pipe-loader: Add kmsro pipe_loader target |
| - pipe-loader: Default to kmsro if probe fails |
| - panfrost: Expose serialized NIR support |
| - pan/midgard: Disable precise occlusion queries |
| - panfrost: Cleanup \_shader_upper -> shader |
| - panfrost: Remove unused definitions in mali-job.h |
| - pipe-loader: Build kmsro loader for with all kmsro targets |
| - gallium/util: Support POLYGON in u_stream_outputs_for_vertices |
| |
| Andreas Baierl (5): |
| |
| - lima/ppir: Rename ppir_op_dummy to ppir_op_undef |
| - lima/ppir: Add undef handling |
| - lima/ppir: Add various varying fetch sources to disassembler |
| - lima: Fix compiler warning in standalone compiler |
| - lima: Fix crash when there are no vertex shader attributes |
| |
| Andreas Gottschling (1): |
| |
| - drisw: Fix shared memory leak on drawable resize |
| |
| Andres Gomez (12): |
| |
| - nir/algebraic: mark float optimizations returning one parameter as |
| inexact |
| - docs: Update to OpenGL 4.6 in the release notes |
| - nir/opcodes: Clear variable names confusion |
| - docs: Add the maximum implemented Vulkan API version in 19.1 rel |
| notes |
| - docs: Add the maximum implemented Vulkan API version in 19.2 rel |
| notes |
| - docs: Add the maximum implemented Vulkan API version in 19.3 rel |
| notes |
| - docs/features: Update status list of Vulkan extensions |
| - docs/features: Update VK_KHR_display_swapchain status |
| - i965/fs: add a comment about how the rounding mode in fmul is set |
| - i965/fs: set rounding mode when emitting the flrp instruction |
| - docs/relnotes: add support for GL_ARB_gl_spirv, |
| GL_ARB_spirv_extensions and OpenGL 4.6 on i965 and iris |
| - egl: Remove the 565 pbuffer-only EGL config under X11. |
| |
| Andres Rodriguez (2): |
| |
| - radv: add RADV_DEBUG=allentrypoints |
| - radv: additional query fixes |
| |
| Andrii Simiklit (1): |
| |
| - glsl: disallow incompatible matrices multiplication |
| |
| Anuj Phogat (5): |
| |
| - intel/gen12: Add L3 configurations |
| - intel: Add few Ice Lake brand strings |
| - genxml/gen11+: Add COMMON_SLICE_CHICKEN4 register |
| - intel/gen11+: Enable Hardware filtering of Semi-Pipelined State in WM |
| - intel/isl/icl: Use halign 8 instead of 4 hw workaround |
| |
| Arcady Goldmints-Orlov (1): |
| |
| - anv: fix descriptor limits on gen8 |
| |
| Bas Nieuwenhuizen (63): |
| |
| - radv: Use correct vgpr_comp_cnt for VS if both prim_id and |
| instance_id are needed. |
| - radv: Emit VGT_GS_ONCHIP_CNTL for tess on GFX10. |
| - radv: Disable NGG for geometry shaders. |
| - tu: Set up glsl types. |
| - radv: Only break batch on framebuffer change with dfsm. |
| - radv: Disable dfsm by default even on Raven. |
| - radv: Add DFSM support. |
| - glx: Remove redundant null check. |
| - amd: Build aco only if radv is enabled |
| - radv: Add workaround for hang in The Surge 2. |
| - turnip: Add image->image blitting. |
| - turnip: Always use UINT formats for copies. |
| - turnip: Disallow NPoT formats. |
| - turnip: Add todo for d24_s8 copies |
| - radv: Fix condition for skipping the continue CS. |
| - radv: Fix warning in 32-bit build. |
| - meson: Always add LLVM coroutines module. |
| - amd/llvm: Fix warning due to asserted-only variable. |
| - radv: Implement & enable VK_EXT_texel_buffer_alignment. |
| - radv: Cleanup buffer_from_fd. |
| - radv: Handle device memory alloc failure with normal free. |
| - radv: Split out layout code from image creation. |
| - radv: Delay patching for imported images until layout time. |
| - radv: Handle slightly different image dimensions. |
| - radv: Unset vk_info in radv_image_create_layout. |
| - radv: Add VK_ANDROID_external_memory_android_hardware_buffer. |
| - radv/android: Add android hardware buffer field to device memory. |
| - radv/android: Add android hardware buffer queries. |
| - radv: Disallow sparse shared images. |
| - radv: Derive android usage from create flags. |
| - radv: Deal with Android external formats. |
| - radv/android: Add android hardware buffer import/export. |
| - radv: Allow Android image binding. |
| - radv: Expose image handle compat types for Android handles. |
| - radv: Check the size of the imported buffer. |
| - radv: Enable VK_ANDROID_external_memory_android_hardware_buffer. |
| - nir/dead_cf: Remove dead control flow after infinite loops. |
| - radv: Fix single stage constant flush with merged shaders. |
| - radv: Compute hashes in secure process for secure compilation. |
| - radv: Add an early exit in the secure compile if we already have the |
| cache entries. |
| - radv: Clean up unused variable. |
| - radv: Split out commandbuffer submission. |
| - radv: Do sparse binding in queue submission. |
| - radv: Improve fence signalling in QueueSubmit. |
| - radv: Always enable syncobj when supported for all fences/semaphores. |
| - radv: Split semaphore into two parts as enum+union. |
| - radv: Add temporary datastructure for submissions. |
| - radv: Add timelines with a VK_KHR_timeline_semaphore impl. |
| - radv: Add wait-before-submit support for timelines. |
| - radv: Enable VK_KHR_timeline_semaphore. |
| - radv: Start signalling semaphores in WSI acquire. |
| - radv: Allocate space for temp. semaphore parts. |
| - radv: Fix timeout handling in syncobj wait. |
| - radv: Remove \_mesa_locale_init/fini calls. |
| - turnip: Remove \_mesa_locale_init/fini calls. |
| - anv: Remove \_mesa_locale_init/fini calls. |
| - radv: Fix disk_cache_get size argument. |
| - radv: Close all unnecessary fds in secure compile. |
| - radv: Do not change scratch settings while shaders are active. |
| - radv: Allocate cmdbuffer space for buffer marker write. |
| - radv: Unify max_descriptor_set_size. |
| - radv: Fix timeline semaphore refcounting. |
| - radv: Fix RGBX Android<->Vulkan format correspondence. |
| |
| Ben Crocker (1): |
| |
| - llvmpipe: use ppc64le/ppc64 Large code model for JIT-compiled shaders |
| |
| Boris Brezillon (73): |
| |
| - panfrost: Free the instruction object in mir_remove_instruction() |
| - panfrost: Free all block/instruction objects before leaving |
| midgard_compile_shader_nir() |
| - panfrost: Make sure bundle.instructions[] contains valid instructions |
| - Revert "panfrost: Free all block/instruction objects before leaving |
| midgard_compile_shader_nir()" |
| - panfrost: Use ralloc() to allocate instructions to avoid leaking |
| those objs |
| - panfrost: Reset the damage area on imported resources |
| - panfrost: Add transient BOs to job batches |
| - panfrost: s/job/batch/ |
| - panfrost: Pass a batch to panfrost_drm_submit_vs_fs_batch() |
| - panfrost: Stop passing a ctx to functions being passed a batch |
| - panfrost: Make transient allocation rely on the BO cache |
| - panfrost: Convert ctx->{scratchpad, tiler_heap, tiler_dummy} to plain |
| BOs |
| - panfrost: Get rid of unused panfrost_context fields |
| - panfrost: Get rid of the now unused SLAB allocator |
| - panfrost: Rename pan_bo_cache.c into pan_bo.c |
| - panfrost: Fix a list_assert() in schedule_block() |
| - panfrost: Rework midgard_pair_load_store() to kill the nested foreach |
| loop |
| - panfrost: Use a pipe_framebuffer_state as the batch key |
| - panfrost: Get rid of the unused 'flush jobs accessing res' infra |
| - panfrost: Allow testing if a specific batch is targeting a scanout FB |
| - panfrost: Pass a batch to panfrost_{allocate,upload}_transient() |
| - panfrost: Pass a batch to functions emitting FB descs |
| - panfrost: Use ctx->wallpaper_batch in panfrost_blit_wallpaper() |
| - panfrost: Pass a batch to panfrost_set_value_job() |
| - panfrost: Prepare things to avoid flushes on FB switch |
| - panfrost: Delay payloads[].offset_start initialization |
| - panfrost: Move the fence creation in panfrost_flush() |
| - panfrost: Move the batch submission logic to panfrost_batch_submit() |
| - panfrost: Stop exposing internal panfrost_*_batch() functions |
| - panfrost: Use the correct type for the bo_handle array |
| - panfrost: Add missing panfrost_batch_add_bo() calls |
| - panfrost: Add polygon_list to the batch BO set at allocation time |
| - panfrost: Kill a useless memset(0) in panfrost_create_context() |
| - panfrost: Stop passing has_draws to panfrost_drm_submit_vs_fs_batch() |
| - panfrost: Get rid of pan_drm.c |
| - panfrost: Move panfrost_bo_{reference,unreference}() to pan_bo.c |
| - panfrost: s/PAN_ALLOCATE\_/PAN_BO\_/ |
| - panfrost: Move the BO API to its own header |
| - panfrost: Stop exposing panfrost_bo_cache_{fetch,put}() |
| - panfrost: Don't check if BO is mmaped before calling |
| panfrost_bo_mmap() |
| - panfrost: Stop passing screen around for BO operations |
| - panfrost: Stop using panfrost_bo_release() outside of pan_bo.c |
| - panfrost: Add panfrost_bo_{alloc,free}() |
| - panfrost: Don't return imported/exported BOs to the cache |
| - panfrost: Add the panfrost_batch_create_bo() helper |
| - panfrost: Add FBO BOs to batch->bos earlier |
| - panfrost: Allocate tiler and scratchpad BOs per-batch |
| - Revert "panfrost: Rework midgard_pair_load_store() to kill the nested |
| foreach loop" |
| - panfrost: Fix indexed draws |
| - dEQP-GLES2.functional.buffer.write.use.index_array.\* are passing |
| now. |
| - panfrost: Add the shader BO to the batch in patch_shader_state() |
| - panfrost: Extend the panfrost_batch_add_bo() API to pass access flags |
| - panfrost: Make panfrost_batch->bos a hash table |
| - panfrost: Add a batch fence |
| - panfrost: Use the per-batch fences to wait on the last submitted |
| batch |
| - panfrost: Add a panfrost_freeze_batch() helper |
| - panfrost: Start tracking inter-batch dependencies |
| - panfrost: Prepare panfrost_fence for batch pipelining |
| - panfrost: Add a panfrost_flush_all_batches() helper |
| - panfrost: Add a panfrost_flush_batches_accessing_bo() helper |
| - panfrost: Add flags to reflect the BO imported/exported state |
| - panfrost: Make sure the BO is 'ready' when picked from the cache |
| - panfrost: Do fine-grained flushing when preparing BO for CPU accesses |
| - panfrost: Kill the explicit serialization in panfrost_batch_submit() |
| - panfrost: Get rid of the flush in panfrost_set_framebuffer_state() |
| - Revert "st/dri2: Implement DRI2bufferDamageExtension" |
| - Revert "Revert "st/dri2: Implement DRI2bufferDamageExtension"" |
| - panfrost: Make sure a clear does not re-use a pre-existing batch |
| - panfrost: Draw the wallpaper when only depth/stencil bufs are cleared |
| - panfrost: Fix support for packed 24-bit formats |
| - panfrost: Fix the DISCARD_WHOLE_RES case in transfer_map() |
| - gallium: Fix the ->set_damage_region() implementation |
| - panfrost: Make sure we reset the damage region of RTs at flush time |
| |
| Brian Paul (3): |
| |
| - st/nir: fix illegal designated initializer in st_glsl_to_nir.cpp |
| - REVIEWERS: add VMware reviewers |
| - Call shmget() with permission 0600 instead of 0777 |
| |
| Caio Marcelo de Oliveira Filho (66): |
| |
| - intel/compiler: Silence maybe-uninitialized warning in GCC 9.1.1 |
| - anv: Drop unused local variable |
| - compiler/glsl: Fix warning about unused function |
| - intel/decoders: Avoid uninitialized variable warnings |
| - iris: Guard GEN9-only function in Iris state to avoid warning |
| - tgsi: Remove unused local |
| - i965: Silence brw_blorp uninitialized warning |
| - nir/lower_explicit_io: Handle 1 bit loads and stores |
| - glsl/nir: Avoid overflow when setting max_uniform_location |
| - mesa/st: Do not rely on name to identify special uniforms |
| - compiler: Add glsl_contains_opaque() helper |
| - mesa: Pack gl_program_parameter struct |
| - glsl/nir: Fill in the Parameters in NIR linker |
| - mesa: Fill Parameter storage indices even when not using SPIR-V |
| - mesa/program: Associate uniform storage without using names |
| - mesa/st: Lookup parameters without using names |
| - mesa/st: Extract preprocessing NIR steps |
| - mesa/st: Add support for SPIR-V shaders |
| - mesa/st: Don't expect prog->nir to already exist |
| - mesa/spirv: Set a few more extensions |
| - gallium: Add ARB_gl_spirv support |
| - glsl/nir: Add and use a gl_nir_link() function |
| - iris: Enable ARB_gl_spirv and ARB_spirv_extensions |
| - mesa/st: Fallback to name lookup when the variable have no Parameter |
| - spirv: Update JSON and headers to 1.5 |
| - spirv: Handle ShaderLayer and ShaderViewportIndex capabilities |
| - spirv: Add missing break for capability handling |
| - intel/fs: Add Fall-through comment |
| - mesa: Extension boilerplate for EXT_demote_to_helper_invocation |
| - glsl: Add ir_demote |
| - glsl: Parse \`demote\` statement |
| - glsl: Add helperInvocationEXT() builtin |
| - gallium: Add PIPE_CAP_DEMOTE_TO_HELPER_INVOCATION |
| - iris: Enable EXT_demote_to_helper_invocation |
| - i965: Enable EXT_demote_to_helper_invocation |
| - docs/relnotes: Add EXT_demote_to_helper_invocation support on iris, |
| i965 |
| - docs: Fix GL_EXT_demote_to_helper_invocation name |
| - vulkan: Update the XML and headers to 1.1.124 |
| - spirv: Implement SPV_KHR_shader_clock |
| - anv: Implement VK_KHR_shader_clock |
| - anv: Enable VK_EXT_shader_subgroup_{ballot,vote} |
| - docs: Update recently enabled VK extensions on Intel |
| - intel: Add INTEL_DEBUG=nofc for disabling fast clears |
| - anv: Disable fast clears when running with INTEL_DEBUG=nofc |
| - iris: Disable fast clears when running with INTEL_DEBUG=nofc |
| - i965: Disable fast clears when running with INTEL_DEBUG=nofc |
| - vulkan: Update the XML and headers to 1.1.125 |
| - anv: Advertise VK_KHR_spirv_1_4 |
| - intel/fs/gen12: Add tests for scoreboard pass |
| - nir: Add scoped_memory_barrier intrinsic |
| - nir/tests: Add copy propagation tests with scoped_memory_barrier |
| - intel/fs: Implement scoped_memory_barrier |
| - spirv: Parse memory semantics for atomic operations |
| - spirv: Emit memory barriers for atomic operations |
| - spirv: Add SpvMemoryModelVulkan and related capabilities |
| - spirv: Add option to emit scoped memory barriers |
| - spirv: Handle MakeTexelAvailable/Visible |
| - spirv: Handle MakePointerAvailable/Visible |
| - anv: Implement VK_KHR_vulkan_memory_model |
| - spirv: Add imageoperands_to_string helper |
| - spirv: Check that only one offset is defined as Image Operand |
| - spirv: Add helper to find args of Image Operands |
| - anv: Fix output of INTEL_DEBUG=bat for chained batches |
| - spirv: Don't fail if multiple ordering semantics bits are set |
| - spirv: Don't leak GS initialization to other stages |
| - anv: Initialize depth_bounds_test_enable when not explicitly set |
| |
| Chris Wilson (2): |
| |
| - iris: Allow packed RGB pbo uploads |
| - st/mesa: Map MESA_FORMAT_RGB_UNORM8 <-> PIPE_FORMAT_R8G8B8_UNORM |
| |
| Christian Gmeiner (13): |
| |
| - gallium: util_set_vertex_buffers_mask(..): make use of |
| u_bit_consecutive(..) |
| - etnaviv: a bit of micro-optimization |
| - Revert "gallium: remove PIPE_CAP_TEXTURE_SHADOW_MAP" |
| - etnaviv: disable ARB_shadow |
| - etnaviv: etna_resource_copy_region(..): drop assert |
| - etnaviv: support ARB_framebuffer_object |
| - etnaviv: nir: start to make use of compile_error(..) |
| - etnaviv: output the same shader-db format as freedreno, v3d and intel |
| - etnaviv: fix compile warnings |
| - etnaviv: fix code style |
| - etnaviv: store updated usage in pipe_transfer object |
| - etnaviv: keep track of buffer valid ranges for PIPE_BUFFER |
| - etnaviv: remove dead code |
| |
| Clément Guérin (1): |
| |
| - radeonsi: enable zerovram for Rocket League |
| |
| Connor Abbott (40): |
| |
| - st/nir: Fix num_inputs for VS inputs |
| - radeonsi/nir: Don't recompute num_inputs and num_outputs |
| - ac/nir: Handle const array offsets in get_deref_offset() |
| - ac/nir: Assert GS input index is constant |
| - radeonsi/nir: Don't add const offset to indirect |
| - radeonsi/nir: Add const_index when loading GS inputs |
| - radeonsi/nir: Rewrite store intrinsic gathering |
| - radeonsi/nir: Rewrite output scanning |
| - ac/nir: add a workaround for viewing a slice of 3D as a 2D image |
| - ac/nir: Remove gfx9_stride_size_workaround_for_atomic |
| - ac/nir: Rewrite gather4 integer workaround based on radeonsi |
| - ac/nir: Fix gather4 integer wa with unnormalized coordinates |
| - nir: Fix num_ssbos when lowering atomic counters |
| - ttn: Fill out more info fields |
| - radeonsi/nir: Remove uniform variable scanning |
| - radv/radeonsi: Don't count read-only data when reporting code size |
| - ac/nir: Support load_constant intrinsics |
| - ac/nir: Enable nir_opt_large_constants |
| - st/nir: Call nir_remove_unused_variables() in the opt loop |
| - st/nir: Don't lower indirects when linking |
| - gallium: Plumb through a way to disable GLSL const lowering |
| - radeonsi/nir: Don't lower constant arrays to uniforms |
| - radv: Call nir_propagate_invariant() |
| - lima/gpir: Do all lowerings before rsched |
| - lima/gpir: Ignore unscheduled successors in can_use_complex() |
| - lima/gpir: Fix schedule_first insertion logic |
| - lima/gpir: Fix fake dep handling for schedule_first nodes |
| - lima/gpir: Disallow moves for schedule_first nodes |
| - nir/opt_if: Fix undef handling in opt_split_alu_of_phi() |
| - lima/gpir: Fix compiler warning |
| - lima/gpir: Only try to place actual children |
| - lima/gpir: Support branch instructions |
| - lima/gpir: Use registers for values live in multiple blocks |
| - lima/gpir: Fix postlog2 fixup handling |
| - lima/gpir: Don't emit movs when translating from NIR |
| - lima/gpir: Fix 64-bit shift in scheduler spilling |
| - nir/opt_large_constants: Handle store writemasks |
| - nir: Fix overlapping vars in nir_assign_io_var_locations() |
| - nir/sink: Rewrite loop handling logic |
| - nir/sink: Don't sink load_ubo to outside of its defining loop |
| |
| Daniel Kolesa (1): |
| |
| - util: add auxv based PowerPC AltiVec/VSX detection |
| |
| Daniel Schürmann (44): |
| |
| - nir/algebraic: some subtraction optimizations |
| - aco: Initial commit of independent AMD compiler |
| - radv/aco: Setup alternate path in RADV to support the experimental |
| ACO compiler |
| - radv: enable clustered reductions |
| - radv/aco: enable VK_EXT_shader_demote_to_helper_invocation |
| - radv: remove dead shared variables |
| - aco: only emit waitcnt on loop continues if we there was some load or |
| export |
| - freedreno: Enable the nir_opt_algebraic_late() pass. |
| - nir: recombine nir_op_*sub when lower_sub = false |
| - nir: Remove unnecessary subtraction optimizations |
| - radv/aco: Don't lower subtractions |
| - aco: call nir_opt_algebraic_late() exhaustively |
| - nouveau: set lower_sub = true |
| - aco: re-use existing phi instruction when lowering boolean phis |
| - aco: don't reorder instructions in order to lower boolean phis |
| - aco: don't combine minmax3 if there is a neg or abs modifier in |
| between |
| - aco: ensure that uniform booleans are computed in WQM if their uses |
| happen in WQM |
| - aco: refactor value numbering |
| - aco: restrict scheduling depending on max_waves |
| - aco: only skip RAR dependencies if the variable is killed somewhere |
| - aco: add can_reorder flags to load_ubo and load_constant |
| - aco: don't schedule instructions through depending VMEM instructions |
| - aco: Lower to CSSA |
| - aco: improve live variable analysis |
| - aco: remove potential critical edge on loops. |
| - aco: fix live-range splits of phis |
| - aco: fix transitive affinities of spilled variables |
| - aco: don't insert the exec mask into set of live-out variables when |
| spilling |
| - aco: consider loop_exit blocks like merge blocks, even if they have |
| only one predecessor |
| - aco: don't add interferences between spilled phi operands |
| - aco: simplify calculation of target register pressure when spilling |
| - aco: ensure that spilled VGPR reloads are done after p_logical_start |
| - aco: omit linear VGPRs as spill variables |
| - aco: always set scratch_offset in startpgm |
| - aco: implement VGPR spilling |
| - docs/relnotes/new_features.txt: Add note about ACO |
| - aco: fix immediate offset for spills if scratch is used |
| - aco: only use single-dword loads/stores for spilling |
| - aco: fix accidential reordering of instructions when scheduling |
| - aco: workaround Tonga/Iceland hardware bug |
| - aco: fix invalid access on Pseudo_instructions |
| - aco: preserve kill flag on moved operands during RA |
| - aco: don't split live-ranges of linear VGPRs |
| - aco: fix a couple of value numbering issues |
| |
| Daniel Stone (1): |
| |
| - panfrost: Respect offset for imported resources |
| |
| Danilo Spinella (1): |
| |
| - egl: Include stddef.h in generated source |
| |
| Danylo Piliaiev (10): |
| |
| - nir/loop_unroll: Update the comments for loop_prepare_for_unroll |
| - nir/loop_unroll: Prepare loop for unrolling in wrapper_unroll |
| - nir/loop_analyze: Treat do{}while(false) loops as 0 iterations |
| - glsl: Fix unroll of do{} while(false) like loops |
| - tgsi_to_nir: Translate TGSI_INTERPOLATE_COLOR as INTERP_MODE_NONE |
| - iris: Fix fence leak in iris_fence_flush |
| - st/nine: Ignore D3DSIO_RET if it is the last instruction in a shader |
| - intel/compiler: Fix C++ one definition rule violations |
| - glsl: Initialize all fields of ir_variable in constructor |
| - i965: Unify CC_STATE and BLEND_STATE atoms on Haswell as a workaround |
| |
| Dave Airlie (75): |
| |
| - virgl: drop unused format field |
| - virgl: fix format conversion for recent gallium changes. |
| - gallivm: fix atomic compare-and-swap |
| - llvmpipe: refactor jit type creation |
| - gallivm: make lp_build_float_to_r11g11b10 take a const src |
| - gallivm: handle helper invocation (v2) |
| - gallivm: move first/last level jit texture members. |
| - llvmpipe: handle early test property. |
| - gallivm: add a basic image limit |
| - llvmpipe: move the fragment shader variant key to dynamic length. |
| - draw: add jit image type for vs/gs images. |
| - llvmpipe: introduce image jit type to fragment shader jit. |
| - gallivm/tgsi: add image interface to tgsi builder |
| - gallivm: add image load/store/atomic support |
| - draw: add vs/gs images support |
| - llvmpipe: add fragment shader image support |
| - llvmpipe: bind vertex/geometry shader images |
| - gallivm: add support for fences api on older llvm |
| - gallivm: add memory barrier support |
| - llvmpipe: flush on api memorybarrier. |
| - llvmpipe: enable ARB_shader_image_load_store |
| - docs: add shader image extensions for llvmpipe |
| - gallivm: fix appveyor build after images changes |
| - gallivm: disable accurate cube corner for integer textures. |
| - llvmpipe: enable fb no attach |
| - gallivm/flow: add counter reset for loops |
| - gallivm: add coroutine support files to gallivm. |
| - gallivm: add coroutine pass manager support |
| - llvmpipe: reogranise jit pointer ordering |
| - gallivm: add new compute related intrinsics |
| - gallivm: add support for compute shared memory |
| - llvmpipe: add compute threadpool + mutex |
| - gallivm: add barrier support for compute shaders. |
| - llvmpipe: introduce compute shader context |
| - llvmpipe: add initial compute state structs |
| - gallivm: add compute jit interface. |
| - llvmpipe: add compute debug option |
| - llvmpipe: add initial shader create/bind/destroy variants framework. |
| - llvmpipe: introduce new state dirty tracking for compute. |
| - llvmpipe: introduce variant building infrastrucutre. |
| - llvmpipe: add compute shader generation. |
| - llvmpipe: add grid launch |
| - llvmpipe: add compute pipeline statistics support. |
| - llvmpipe: add support for compute constant buffers. |
| - llvmpipe: add compute sampler + sampler view support. |
| - llvmpipe: add ssbo support to compute shaders |
| - llvmpipe: add compute shader images support |
| - llvmpipe: add compute shader parameter fetching support |
| - llvmpipe: add local memory allocation path |
| - llvmpipe: enable compute shaders if LLVM has coroutines |
| - docs: add llvmpipe features for fb_no_attach and compute shaders |
| - st/mesa: Prefer R8 for bitmap textures |
| - st/mesa: fix R8 bitmap texture for TGSI paths. |
| - llvmpipe: make texture buffer offset alignment == 16 |
| - llvmpipe/draw: fix image sizes for vertex/geometry shaders. |
| - llvmpipe/draw: handle UBOs that are < 16 bytes. |
| - gallivm/sample: add gather component selection to the key. |
| - gallium: add a a new cap for changing the TGSI TG4 instruction |
| encoding |
| - st/glsl: add support for alternate TG4 encoding. |
| - llvmpipe: add support for tg4 component selection. |
| - gallivm: fix coroutines on aarch64 with llvm 8 |
| - gallivm/draw/swr: make the gs_iface not depend on tgsi. |
| - nir: add a pass to lower flat shading. |
| - gallium: add flatshade lowering capability |
| - st/mesa: handling lower flatshading for NIR drivers. |
| - llvmpipe: handle compute shader launch with 0 threads |
| - zink: ask for flatshade lowering |
| - zink: add dri loader |
| - zink: query support (v2) |
| - zink/spirv: store all values as uint. |
| - zink: add support for compressed formats |
| - zink: add sample mask support |
| - zink: add samples to rasterizer |
| - zink: attempt to get multisample resource creation right |
| - llvmpipe/ppc: fix if/ifdef confusion in backport. |
| |
| Dave Stevenson (1): |
| |
| - broadcom/v3d: Allow importing linear BOs with arbitrary |
| offset/stride. |
| |
| Duncan Hopkins (7): |
| |
| - zink: clamped limits to INT_MAX when stored as uint32_t. |
| - zink: fix line-width calculation |
| - zink: respect ubo buffer alignment requirement |
| - zink: limited uniform buffer size so the limits is not exceeded. |
| - zink: pass line width from rast_state to gfx_pipeline_state. |
| - zink: Use optimal layout instead of general. Reduces valid layer |
| warnings. Fixes RADV image noise. |
| - zink: make sure src image is transfer-src-optimal |
| |
| Dylan Baker (120): |
| |
| - docs: Mark 19.2.0-rc2 as done and push back rc3 and rc4/final |
| - glsl/tests: Handle windows \\r\n new lines |
| - meson: don't try to generate i18n translations on windows |
| - meson: Make shared-glapi a combo |
| - meson: don't build glapi_static_check_table on windows |
| - add a git ignore for subprojects |
| - meson: add a zlib subproject |
| - meson: add a expat subproject |
| - glapi: export glapi_destroy_multithread when building shared-glapi on |
| windows |
| - meson: fix dl detection on non cygwin windows |
| - meson: build getopt when using msvc |
| - meson: Add a platform for windows |
| - meson: don't build glx or dri by default on windows |
| - meson: don't allow glvnd on windows |
| - meson: don't generate file into subdirs |
| - Docs: mark that 19.2.0-rc3 has been released |
| - scons: Make scons and meson agree about path to glapi generated |
| headers |
| - docs: Add release notes for 19.2.0 |
| - docs: add SHA256 sum for 19.2.0 |
| - docs: update calendar, add news item, and link release notes for |
| 19.2.0 |
| - release: Push 19.3 back two weeks |
| - bin/get-pick-list: use --oneline=pretty instead of --oneline |
| - meson: fix logic for generating .pc files with old glvnd |
| - meson: Try finding libxvmcw via pkg-config before using find_library |
| - meson: Link xvmc with libxv |
| - meson: gallium media state trackers require libdrm with x11 |
| - docs: update install docs for meson |
| - docs: use https for mesonbuild.com |
| - docs: remove stray newline |
| - meson: remove -DGALLIUM_SOFTPIPE from st/osmesa |
| - docs: Add use of Closes: tag for closing gitlab issues |
| - docs: add a new_features.text file and remove 19.3.0 release notes |
| - scripts: Add a gen_release_notes.py script |
| - release: Add an update_release_calendar.py script |
| - bin: delete unused releasing scripts |
| - meson: Only error building gallium video without libdrm when the |
| platform is drm |
| - docs: Add relnotes for 19.2.1 |
| - docs: Add SHA256 sum for 19.2.1 |
| - docs: update calendar, add news item, and link release notes for |
| 19.2.1 |
| - util: use \_WIN32 instead of WIN32 |
| - meson: add windows compiler checks and libraries |
| - meson: Add windows defines to glapi |
| - meson: Add necessary defines for mesa_gallium on windows |
| - meson: build gallium gdi winsys |
| - meson: build wgl state tracker |
| - meson: build libgl-gdi target |
| - meson: build graw-gdi target |
| - meson: fix gallium-osmesa to build for windows |
| - meson: Don't check for posix_memalign on windows |
| - util/xmlconfig: include strndup.h for windows |
| - meson: fix pipe-loader compilation for windows |
| - meson: don't look for rt on windows |
| - meson: Add support for using win_flex and win_bison on windows |
| - meson: force inclusion of inttypes.h for glcpp with msvc |
| - meson: disable sse4.1 optimizations with msvc |
| - meson: add switches for SWR with MSVC |
| - meson: don't define USE_ELF_TLS for windows |
| - meson: Add idep_getopt for tests |
| - meson: Add msvc compat args to util/tests |
| - meson: Set visibility and compat args for graw |
| - meson: don't build gallium trivial tests on windows |
| - meson: disable graw tests on mingw |
| - meson: don't build or run mesa-sha1 test on windows |
| - meson: maintain names of shared API libraries |
| - meson: add msvc compat args to swr |
| - meson: don't error on formaters with mingw |
| - meson: only build timspec test if timespec is available |
| - meson: glcpp tests are expected to fail on windows |
| - meson/util: Don't run string_buffer tests on mingw |
| - glsl/tests: Handle no-exec errors |
| - docs: update meson docs for windows |
| - appveyor: Add support for meson as well as scons on windows |
| - gitlab-ci: Add a mingw x86_64 job |
| - meson: Don't use expat on windows |
| - gitlab-ci: Add a pkg-config for mingw |
| - Revert "gitlab-ci: Disable meson-mingw32-x86_64 job again for now" |
| - gitlab-ci: Set the meson wrapmode to disabled |
| - appveyor: Cache meson's wrap downloads |
| - meson/llvmpipe: Add dep_llvm to driver_swrast |
| - meson: Add support for wrapping llvm |
| - meson: Use cmake to find LLVM when building for windows |
| - docs: update meson docs for windows |
| - appveyor: Add support for building llvmpipe with meson |
| - appveyor: Move appveyor script into .appveyor directory |
| - docs: Add new feature for compiling for windows with meson |
| - meson: Require meson >= 0.49.1 when using icc or icl |
| - scons: Use print_function ins SConstruct |
| - scons: Print a deprecation warning about using scons on not windows |
| - scons: Also print a deprecation warning on windows |
| - docs: Add release not about scons deprecation |
| - docs: Add release notes for 19.2.2 |
| - docs: Add sha256 sum for 19.2.2 |
| - docs: update calendar, add news item and link release notes for |
| 19.2.2 |
| - bin/gen_release_notes.py: fix conditional of bugfix |
| - bin/gen_release_notes.py: strip '#' from gitlab bugs |
| - bin/gen_release_notes.py: Return "None" if there are no new features |
| - bin/post_version.py: Pass version as an argument |
| - bin/post_version.py: white space fixes |
| - bin/post_release.py: Add .html to hrefs |
| - bin/gen_release_notes.py: html escape all external data |
| - bin/gen_release_notes.py: Add a warning if new features are |
| introduced in a point release |
| - docs: update releasing process to use new scripts and gitlab |
| - nir: Fix invalid code for MSVC |
| - gitlab-ci: refactor out some common stuff for Windows and Linux |
| - gitlab-ci: Add a job for meson on windows |
| - VERSION: bump to rc1 |
| - nir: correct use of identity check in python |
| - meson: Add dep_glvnd to egl deps when building with glvnd |
| - Bump VERSION to 19.3.0-rc2 |
| - cherry-ignore: Update for 19.3-rc3 cycle |
| - Bump version for -rc3 |
| - cherry-ignore: update for 19.3.0-rc4 cycle |
| - VERSION: bump for 19.3.0-rc4 |
| - VERSION: Bump version for -rc5 |
| - VERSION: bump version for 19.3-rc6 |
| - cherry-ignore: update for 19.3-rc7 |
| - meson/broadcom: libbroadcom_cle needs expat headers |
| - meson/broadcom: libbroadcom_cle also needs zlib |
| - Revert "egl: avoid local modifications for eglext.h Khronos standard |
| header file" |
| - Revert "egl: move #include of local headers out of Khronos headers" |
| |
| Eduardo Lima Mitev (4): |
| |
| - nir: Add new texop nir_texop_tex_prefetch |
| - freedreno/ir3: Add a NIR pass to select tex instructions eligible for |
| pre-fetch |
| - nir: Add a new ALU nir_op_imad24_ir3 |
| - freedreno/ir3: Handle newly added opcode nir_op_imad24_ir3 |
| |
| Emil Velikov (3): |
| |
| - mesa: bump version to 19.3.0-devel |
| - docs: add 19.3.0-devel release notes template |
| - docs: update calendar for 19.2.x |
| |
| Eric Anholt (57): |
| |
| - gallium: Add a block depth field to the u_formats table. |
| - gallium: Add block depth to the format utils. |
| - gallium: Add the ASTC 3D formats. |
| - gallium: Fix mesa format name in unit test failure path. |
| - gallium: Skip generating the pack/unpack union if we don't use it. |
| - gallium: Drop the useless union wrapper on pack/unpack. |
| - gallium: Drop a bit of dead code from the pack/unpack python. |
| - gallium: Fix big-endian addressing of non-bitmask array formats. |
| - gallium: Don't emit identical endian-dependent pack/unpack code. |
| - freedreno/a6xx: Fix non-mipmap filtering selection. |
| - freedreno: Fix the type of single-component scaled vertex attrs. |
| - gallium/osmesa: Introduce a test. |
| - gallium/osmesa: Fix a race in creating the stmgr. |
| - gallium/osmesa: Move 565 format selection checks where the rest are. |
| - uapi: Update drm_fourcc.h |
| - dri: Use DRM_FORMAT\_\* instead of defining our own copy. |
| - gitlab-ci: Disable dEQP's watchdog timer. |
| - gitlab-ci: Log the driver version that got tested. |
| - freedreno: Introduce gitlab-based CI. |
| - gitlab-ci/a630: Disable flappy |
| layout_binding.ssbo.fragment_binding_array |
| - egl/android: Fix build since the DRI fourcc removal. |
| - gitlab-ci/a630: Drop remaining dEQP-GLES3.functional.draw.random.\* |
| xfails. |
| - gitlab-ci/a630: Drop the MSAA expected failure. |
| - gitlab-ci: Make the test job fail when bugs are unexpectedly fixed. |
| - freedreno: Fix invalid read when a block has no instructions. |
| - freedreno/a3xx: Mostly fix min-vs-mag filtering decisions on |
| non-mipmap tex. |
| - shader_enums: Move MAX_DRAW_BUFFERS to this file. |
| - turnip: Add a .editorconfig and .dir-locals.el |
| - turnip: Silence compiler warning about uninit pipeline. |
| - turnip: Fix failure behavior of vkCreateGraphicsPipelines. |
| - vc4: Enable the nir_opt_algebraic_late() pass. |
| - v3d: Enable the late algebraic optimizations to get real subs. |
| - nir: Make nir_search's dumping go to stderr. |
| - nir: Skip emitting no-op movs from the builder. |
| - nir: Keep the range analysis HT around intra-pass until we make a |
| change. |
| - nir: Factor out most of the algebraic passes C code to .c/.h. |
| - nir: Fix some wonky whitespace in nir_search.h. |
| - turnip: Drop unused tu_pack_clear_value() return. |
| - turnip: Fill in clear color packing for r10g11b11 and rgb9e5. |
| - turnip: Tell spirv_to_nir that we want fragcoord as a sysval. |
| - turnip: Set up the correct tiling mode for small attachments. |
| - turnip: Emit clears of gmem using linear. |
| - freedreno/ci: Ban texsubimage2d_pbo.r16ui_2d, due to two flakes |
| reported. |
| - mesa: Add debug info to \_mesa_format_from_format_and_type() error |
| path. |
| - mesa: Fix depth/stencil ordering in |
| \_mesa_format_from_format_and_type(). |
| - mesa: Add format/type matching for DEPTH/UINT_24_8. |
| - mesa: Add support for array formats of depth and stencil. |
| - mesa: Refactor the entirety of |
| \_mesa_format_matches_format_and_type(). |
| - v3d: Add Compute Shader support |
| - r100/r200: factor out txformat/txfilter setup from the TFP path. |
| - radeon: Fill in the TXOFFSET field containing the tile bits in our |
| relocs. |
| - radeon: Drop the unused first arg of OUT_BATCH_RELOC. |
| - mesa: Replace the LA16_UNORM packed formats with one array format. |
| - mesa: Replace MESA_FORMAT_L8A8/A8L8 UNORM/SNORM/SRGB with an array |
| format. |
| - gallium: Drop the unused PIPE_FORMAT_A*L\* formats. |
| - mesa: Redefine the RG formats as array formats. |
| - ci: Disable lima until its farm can get fixed. |
| |
| Eric Engestrom (104): |
| |
| - scons: define MESA_LLVM_VERSION_STRING like the other build systems |
| do |
| - llvmpipe: use LLVM version string instead of re-computing it |
| - swr: use LLVM version string instead of re-computing it |
| - scons: add support for MAJOR_IN_{MKDEV,SYSMACROS} |
| - egl: warn user if they set an invalid EGL_PLATFORM |
| - ttn: fix 64-bit shift on 32-bit \`1\` |
| - egl: fix deadlock in malloc error path |
| - util/os_file: fix double-close() |
| - anv: fix format string in error message |
| - freedreno/drm-shim: fix mem leak |
| - nir: fix memleak in error path |
| - gallivm: replace \`0x\` version print with actual version string |
| - meson/scons/android: add LLVM_AVAILABLE binary flag |
| - aux/draw: replace binary HAVE_LLVM checks with LLVM_AVAILABLE |
| - r600: replace binary HAVE_LLVM checks with LLVM_AVAILABLE |
| - svga: replace binary HAVE_LLVM checks with LLVM_AVAILABLE |
| - amd: replace major llvm version checks with LLVM_VERSION_MAJOR |
| - swr: replace major llvm version checks with LLVM_VERSION_MAJOR |
| - gallivm: replace major llvm version checks with LLVM_VERSION_MAJOR |
| - clover: replace major llvm version checks with LLVM_VERSION_MAJOR |
| - gallivm: replace more complex 3.x version check with |
| LLVM_VERSION_MAJOR/MINOR |
| - clover: replace more complex 3.x version check with |
| LLVM_VERSION_MAJOR/MINOR |
| - llvmpipe: replace more complex 3.x version check with |
| LLVM_VERSION_MAJOR/MINOR |
| - meson/scons/android: drop now-unused HAVE_LLVM |
| - gallivm: drop LLVM<3.3 code paths as no build system allows that |
| - anv: add support for driconf |
| - wsi: add minImageCount override |
| - anv: add support for vk_x11_override_min_image_count |
| - amd: move adaptive sync to performance section, as it is defined in |
| xmlpool |
| - radv: add support for vk_x11_override_min_image_count |
| - drirc: override minImageCount=2 for gfxbench |
| - meson/iris: replace partial list of nir dep files with |
| idep_nir_headers |
| - meson/v3d: replace partial list of nir dep files with |
| idep_nir_headers |
| - gitlab-ci: rename stages to something simpler |
| - gl: drop incorrect pkg-config file for glvnd |
| - anv: split instance dispatch table |
| - anv: implement ICD interface v4 |
| - meson: split compiler warnings one per line |
| - radv: fix s/load/store/ copy-paste typo |
| - meson: drop -Wno-foo bug workaround for Meson < 0.46 |
| - meson: split more compiler options to their own line |
| - meson: re-add incorrect pkg-config files with GLVND for backward |
| compatibility |
| - docs/release-calendar: fix bugfix release numbers |
| - docs/release-calendar: add missing <td> and </td> |
| - glsl: turn runtime asserts of compile-time value into compile-time |
| asserts |
| - etnaviv: fix bitmask typo |
| - docs/install: drop autotools references |
| - git: delete .gitattributes |
| - egl: replace MESA_EGL_NO_X11_HEADERS hack with upstream EGL_NO_X11 |
| - loader: replace int/1/0 with bool/true/false |
| - loader: s/int/bool/ for predicate result |
| - loader: use ARRAY_SIZE instead of NULL sentinel |
| - meson/loader: drop unneeded \*.h file |
| - script: drop get_reviewer.pl |
| - meson: add missing idep_nir_headers in iris_gen_libs |
| - meson: use idep_nir instead of libnir in libnouveau |
| - meson: use idep_nir instead of libnir in libclnir |
| - meson: use idep_nir instead of libnir in gallium nine |
| - meson: use idep_nir instead of libnir in haiku softpipe |
| - meson: use idep_nir instead of libnir in pipe-loader |
| - meson: rename libnir to \_libnir to make it clear it's not meant to |
| be used anywhere else |
| - meson: drop duplicate inc_nir from libiris |
| - meson: drop duplicate inc_nir from libglsl |
| - meson: drop duplicate inc_nir from spirv2nir |
| - meson: drop unused inc_nir |
| - include: update drm-uapi |
| - meson: fix sys/mkdev.h detection on Solaris |
| - GL: drop symbols mangling support |
| - meson: rename \`glvnd_missing_pc_files\` to \`not |
| glvnd_has_headers_and_pc_files\` |
| - meson: move a couple of include installs around |
| - meson: split headers one per line |
| - meson: split Mesa headers as a separate installation |
| - meson: skip installation of GLVND-provided headers |
| - symbols-check: ignore exported C++ symbols |
| - anv: add exported symbols check |
| - radv: add exported symbols check |
| - gbm: turn 0/-1 bool into true/false |
| - gbm: replace 1/0 bool with true/false |
| - gbm: replace NULL sentinel with explicit ARRAY_SIZE() |
| - gbm: use size_t for array indexes |
| - gitlab-ci: set a common job parent for container stage |
| - gitlab-ci: set a common job parent for build stage |
| - gitlab-ci: set a common job parent for test stage |
| - mesa/math: delete leftover... from 18 years ago (!) |
| - mesa/math: delete duplicate extern symbol |
| - util/u_atomic: fix return type of p_atomic_{inc,dec}_return() and |
| p_atomic_{cmp,}xchg() |
| - travis: don't (re)install python |
| - travis: test meson install as well |
| - osmesa: add missing #include <stdint.h> |
| - llvmpipe: avoid compiling no-op block on release builds |
| - llvmpipe: avoid generating empty-body blocks |
| - meson: add -Werror=empty-body to disallow \`if(x);\` |
| - anv: fix error message |
| - anv: fix empty-body instruction |
| - radv: fix empty-body instruction |
| - v3d: fix empty-body instruction |
| - tu: fix empty-body instruction |
| - anv: add a couple printflike() annotations |
| - loader: default to iris for all future PCI IDs |
| - travis: fix scons build after deprecation warning |
| - meson: define \_GNU_SOURCE on FreeBSD |
| - egl: fix \_EGL_NATIVE_PLATFORM fallback |
| - egl: move #include of local headers out of Khronos headers |
| - vulkan: delete typo'd header |
| |
| Erico Nunes (7): |
| |
| - lima: fix ppir spill stack allocation |
| - lima/ppir: lower selects to scalars |
| - lima/ppir: enable vectorize optimization |
| - lima/ppir: mark regalloc created ssa unspillable |
| - lima/ppir: optimizations in regalloc spilling code |
| - lima/ppir: improve regalloc spill cost calculation |
| - lima: remove partial clear support from pipe->clear() |
| |
| Erik Faye-Lund (210): |
| |
| - gallium/auxiliary/indices: consistently apply start only to input |
| - mesa/main: remove unused include |
| - util: fix SSE-version needed for double opcodes |
| - util: do not assume MSVC implies SSE |
| - mesa/x86: improve SSE-checks for MSVC |
| - util: only allow \_BitScanReverse64 on 64-bit cpus |
| - gallium/gdi: use GALLIUM_FOO rather than HAVE_FOO |
| - st/mesa: remove always-true expression |
| - .mailmap: add an alias for Michel Dänzer |
| - .mailmap: add an alias for Eric Engestrom |
| - .mailmap: add an alias for Bas Nieuwenhuizen |
| - .mailmap: add an alias for Frank Binns |
| - glsl: correct bitcast-helpers |
| - loader/dri3: do not blit outside old/new buffers |
| - .mailmap: specify spelling for Elie Tournier |
| - .mailmap: add an alias for Alexandros Frantzis |
| - .mailmap: add an alias for Gert Wollny |
| - .mailmap: add an alias for Tomeu Vizoso |
| - .mailmap: add a couple of aliases for Jakob Bornecrantz |
| - nir: initialize uses_discard to false |
| - nir: initialize needs_helper_invocations as well |
| - mesa/main: prefer R8-textures instead of A8 for glBitmap in display |
| lists |
| - gallium/u_blitter: set a more sane viewport-state |
| - mesa: expose alpha-ref as a state-variable |
| - nir: allow passing alpha-ref state to lowering-code |
| - mesa/gallium: automatically lower alpha-testing |
| - st/mesa: move point_size_per_vertex-logic to helper |
| - nir: add lowering-pass for point-size mov |
| - mesa/gallium: automatically lower point-size |
| - nir: support derefs in two-sided lighting lowering |
| - mesa/gallium: automatically lower two-sided lighting |
| - nir: support lowering clipdist to arrays |
| - nir: support feeding state to nir_lower_clip_[vg]s |
| - mesa/program: support referencing the clip-space clip-plane state |
| - mesa/st: support lowering user-clip-planes automatically |
| - panfrost: do not report alpha-test as supported |
| - vc4: do not report alpha-test as supported |
| - v3d: do not report alpha-test as supported |
| - nir: drop support for using load_alpha_ref_float |
| - nir: drop unused alpha_ref_float |
| - mesa/st: assert that lowering is supported |
| - Revert "nir: drop unused alpha_ref_float" |
| - Revert "nir: drop support for using load_alpha_ref_float" |
| - Revert "v3d: do not report alpha-test as supported" |
| - Revert "vc4: do not report alpha-test as supported" |
| - zink: introduce opengl over vulkan |
| - zink: detect presence of VK_KHR_maintenance1 |
| - zink/spirv: implement point-sprites |
| - zink: transform z-range |
| - zink: remove discard_if |
| - zink/spirv: implement some integer ops |
| - zink/spirv: handle reading registers |
| - zink/spirv: prepare for control-flow |
| - zink/spirv: implement if-statements |
| - zink/spirv: implement discard |
| - zink/spirv: implement loops |
| - zink: prepare for caching of renderpases/framebuffers |
| - zink: move render-pass begin to helper |
| - zink: do not leak image-views |
| - zink: move cmdbuf-resetting into a helper |
| - zink: prepare for multiple cmdbufs |
| - zink: pass zink_render_pass to pipeline-creation |
| - zink: cache programs |
| - zink: move renderpass inside gfx pipeline state |
| - zink: cache those pipelines |
| - zink: reference renderpass and framebuffer from cmdbuf |
| - zink: return old fence from zink_flush |
| - zink: reference vertex and index buffers |
| - zink: reference ubos and textures |
| - zink: wait for idle on context-destroy |
| - zink: whitespace cleanup |
| - zink: reference blit/copy-region resources |
| - zink: add curr_cmdbuf-helper |
| - zink: delete samplers after the current cmdbuf |
| - zink: texture-rects? |
| - zink: store shader_info in zink_shader |
| - zink: implement fmod |
| - zink: track used resources |
| - zink: do not destroy staging-resource, deref it |
| - zink: use uvec for undefs |
| - zink: emit dedicated block for variables |
| - zink: ensure non-fragment shaders use lod-versions of texture |
| - zink: ensure textures are transitioned properly |
| - zink: assign increasing locations to varyings |
| - zink: move primitive-topology stuff into program |
| - zink: tweak state handling |
| - zink: remove unusual alignment |
| - zink: return after blitting |
| - zink: implement batching |
| - zink: simplify renderpass/framebuffer logic a tad |
| - zink: cache render-passes |
| - zink: cache framebuffers |
| - zink: more batch-ism |
| - zink: use helper |
| - zink: fixup parameter name |
| - zink: ensure sampler-views survive a batch |
| - zink: remove hack-comment |
| - zink: clean up render-pass management |
| - zink: rename sampler-view destroy function |
| - zink: pass screen instead of device to program-functions |
| - zink: keep a reference to used render-passes |
| - zink: prepare for shadow-samplers |
| - zink: kill dead code |
| - zink: clamp scissors |
| - zink: do not use hash-table for regs |
| - zink: squashme: forward declare hash_table |
| - zink: squashme: trade cplusplus wrapper for header-guard |
| - zink: fix off-by-one in assert |
| - zink: reuse constants |
| - zink: pool descriptors per batch |
| - zink: request alpha-test lowering |
| - zink/spirv: var -> regs |
| - zink/spirv: rename vec_type |
| - zink: do not lower io |
| - zink: request ucp-lowering |
| - zink: cleanup zink_end_batch |
| - zink: drop unused argument |
| - zink: refactor fence destruction |
| - zink: only consider format-desc if checking details |
| - zink: document end-of-frame hack |
| - zink: use pipe_stencil_ref instead of uint32_t-array |
| - zink: store sampler and image_view counts |
| - zink: save original scissor and viewport |
| - zink: save all supported util_blitter states |
| - zink: process one aspect-mask bit at the time |
| - zink: clean up opcode-emitting a bit |
| - zink: add some opcodes |
| - zink: add division ops |
| - zink: add shift ops |
| - zink: implement ineg |
| - zink: more comparison-ops |
| - zink: more converts |
| - zink: add more compares |
| - zink: crash hard on unknown queries |
| - zink: abort on submit-failure |
| - zink: stub resource_from_handle |
| - zink: make sure imageExtent.depth is 1 for arrays |
| - zink/spirv: correct opcode |
| - zink: support more texturing |
| - zink: wait for transfer when reading |
| - zink/spirv: be a bit more strict with fragment-results |
| - zink/spirv: debug-print unknown varying slots |
| - zink: ensure layout is reasonable before copying |
| - zink: fixup: save rasterizer |
| - zink: set ExecutionModeDepthReplacing when depth is written |
| - zink: avoid texelFetch until it's implemented |
| - zink: remove insecure comment |
| - zink: don't crash when setting rast-state to NULL |
| - zink: add note about enabling PIPE_CAP_CLIP_HALFZ |
| - zink/spirv: always enable Sampled1D for fragment shaders |
| - zink: do not use both depth and stencil aspects for sampler-views |
| - zink/spirv: support vec1 coordinates |
| - zink: fixup boolean queries |
| - zink: disable timestamp-queries |
| - zink: move set_active_query_state-stub to zink_query.c |
| - HACK: zink: suspend / resume queries on batch-boundaries |
| - zink: also accept txl |
| - zink: use primconvert to get rid of 8-bit indices |
| - zink: initialize nr_samples for pipe_surface |
| - zink: fix rendering to 3D-textures |
| - zink: support shadow-samplers |
| - zink: disable PIPE_CAP_QUERY_TIME_ELAPSED for now |
| - zink: add missing sRGB DXT-formats |
| - zink: lower point-size |
| - zink/spirv: use ordered compares |
| - zink/spirv: implement f2b1 |
| - zink/spirv: assert bit-size |
| - zink/spirv: implement bcsel |
| - zink/spirv: implement bitwise ops |
| - zink/spirv: implement b2i32 |
| - zink/spirv: implement emit_select helper |
| - zink/spirv: implement emit_float_const helper |
| - zink/spirv: use bit_size instead of hard-coding |
| - zink/spirv: add emit_bitcast-helper |
| - zink/spirv: add emit_uint_const-helper |
| - zink/spirv: inline get_uvec_constant into emit_load_const |
| - zink/spirv: clean up get_[fu]vec_constant |
| - zink/spirv: fixup b2i32 and implement b2f32 |
| - zink/spirv: prepare for 1-bit booleans |
| - zink: do not lower bools to float |
| - zink/spirv: fixup b2i32 |
| - zink/spirv: implement load_front_face |
| - zink/spirv: alias generic varyings on non-generic ones |
| - zink: lower two-sided coloring |
| - zink/spirv: alias var0 on tex0 etc instead |
| - zink: do not set VK_IMAGE_CREATE_2D_ARRAY_COMPATIBLE_BIT for non-3D |
| textures |
| - zink: use VK_FORMAT_B8G8R8A8_UNORM for PIPE_FORMAT_B8G8R8X8_UNORM |
| - zink: implement resource_from_handle |
| - zink: refactor blitting |
| - zink: fixup return-value |
| - zink: pass screen to zink_create_gfx_pipeline |
| - zink: do not set lineWidth to invalid value |
| - zink: fixup scissoring |
| - zink/spirv: more complete sampler-dim handling |
| - zink: simplify gl-to-vulkan lowering |
| - gitlab-ci: also build Zink on CI |
| - gitlab-ci: fixup debian tags |
| - zink: error if VK_KHR_maintenance1 isn't supported |
| - zink: emulate optional depth-formats |
| - st/mesa: lower global vars to local after lowering clip |
| - zink: use dynamic state for line-width |
| - zink: use bitfield for dirty flagging |
| - zink: drop nop descriptor-updates |
| - zink: only enable KHR_external_memory_fd if supported |
| - zink: emit line-width when using polygon line-mode |
| - zink: use actual format for render-pass |
| - zink: always allow mutating the format |
| - zink: do not advertize coherent mapping |
| - zink: disable fragment-shader texture-lod |
| - zink: correct depth-stencil format |
| |
| Francisco Jerez (56): |
| |
| - intel/fs: Teach fs_inst::is_send_from_grf() about some missing |
| send-like instructions. |
| - intel/fs: Define is_payload() method of the IR instruction class. |
| - intel/fs: Define is_send() convenience IR helper. |
| - intel/fs: Fix constness of implied_mrf_writes() argument. |
| - intel/eu: Split brw_inst ex_desc accessors for SEND(C) vs. SENDS(C). |
| - intel/eu: Fix up various type conversions in brw_eu.c that are |
| illegal C++. |
| - intel/eu: Rework opcode description tables to allow efficient look-up |
| by either HW or IR opcode. |
| - intel/eu: Encode and decode native instruction opcodes from/to IR |
| opcodes. |
| - intel/ir: Drop hard-coded correspondence between IR and HW opcodes. |
| - intel/ir: Represent physical and logical subsets of the CFG. |
| - intel/ir: Add helper function to push block onto CFG analysis stack. |
| - intel/ir: Represent logical edge of BREAK instruction. |
| - intel/ir: Represent physical edge of ELSE instruction. |
| - intel/ir: Represent physical edge of unconditional CONTINUE |
| instruction. |
| - intel/eu/gen12: Extend brw_inst.h macros for Gen12 support. |
| - intel/eu/gen12: Add sanity-check asserts to brw_inst_bits() and |
| brw_inst_set_bits(). |
| - intel/eu/gen12: Implement basic instruction binary encoding. |
| - intel/eu/gen12: Implement three-source instruction binary encoding. |
| - intel/eu/gen12: Implement control flow instruction binary encoding. |
| - intel/eu/gen12: Implement SEND instruction binary encoding. |
| - intel/eu/gen12: Implement indirect region binary encoding. |
| - intel/eu/gen12: Implement compact instruction binary encoding. |
| - intel/eu/gen12: Implement datatype binary encoding. |
| - intel/eu/gen11+: Mark dot product opcodes as unsupported on |
| opcode_descs table. |
| - intel/eu/gen12: Add Gen12 opcode descriptions to the table. |
| - intel/eu/gen12: Fix codegen of immediate source regions. |
| - intel/eu/gen12: Codegen three-source instruction source and |
| destination regions. |
| - intel/eu/gen12: Codegen control flow instructions correctly. |
| - intel/eu/gen12: Codegen pathological SEND source and destination |
| regions. |
| - intel/eu/gen12: Codegen SEND descriptor regions correctly. |
| - intel/eu/gen12: Use SEND instruction for split sends. |
| - intel/eu/gen12: Don't set DD control, it's gone. |
| - intel/eu/gen12: Don't set thread control, it's gone. |
| - intel/ir/gen12: Add SYNC hardware instruction. |
| - intel/fs/gen12: Add codegen support for the SYNC instruction. |
| - intel/eu/gen12: Add auxiliary type to represent SWSB information |
| during codegen. |
| - intel/eu/gen12: Add tracking of default SWSB state to the current |
| brw_codegen instruction. |
| - intel/eu/gen12: Set SWSB annotations in hand-crafted assembly. |
| - intel/fs/gen12: Add scheduling information to the IR. |
| - intel/fs/gen12: Introduce software scoreboard lowering pass. |
| - intel/fs/gen12: Demodernize software scoreboard lowering pass. |
| - intel/disasm/gen12: Disassemble software scoreboard information. |
| - intel/disasm/gen12: Fix disassembly of some common instruction |
| controls. |
| - intel/disasm/gen12: Disassemble three-source instruction source and |
| destination regions. |
| - intel/disasm/gen12: Disassemble Gen12 SYNC instruction. |
| - intel/disasm/gen12: Disassemble Gen12 SEND instructions. |
| - intel/disasm: Don't disassemble saturate control on SEND |
| instructions. |
| - intel/disasm: Disassemble register file of split SEND sources. |
| - intel/fs/gen12: Don't support source mods for 32x16 integer multiply. |
| - intel/eu/validate/gen12: Implement integer multiply restrictions in |
| EU validator. |
| - intel/eu/validate/gen12: Fix validation of SYNC instruction. |
| - intel/eu/validate/gen12: Validation fixes for SEND instruction. |
| - intel/ir/gen12: Update assert in brw_stage_has_packed_dispatch(). |
| - intel/eu: Don't set notify descriptor field of gateway barrier |
| message. |
| - intel/fs/gen12: Fix barrier codegen. |
| - intel/fs/gen11+: Fix CS_OPCODE_CS_TERMINATE codegen. |
| |
| Fritz Koenig (5): |
| |
| - include/GLES2: Sync GLES2 headers with Khronos |
| - mesa: GetFramebufferParameteriv spelling |
| - mesa: Allow MESA_framebuffer_flip_y for GLES 3 |
| - gallium: Enable MESA_framebuffer_flip_y |
| - freedreno: reorder format check |
| |
| Gert Wollny (4): |
| |
| - radeonsi: Release storage for smda_uploads when the context is |
| destroyed |
| - etnaviv: enable triangle strips only when the hardware supports it |
| - r600: Fix interpolateAtCentroid |
| - r600: Disable eight bit three channel formats |
| |
| Greg V (1): |
| |
| - clover: use iterator_range in get_kernel_nodes |
| |
| Gurchetan Singh (4): |
| |
| - virgl: remove stride from virgl_hw_res |
| - virgl: modify resource_create_from_handle(..) callback |
| - virgl: modify internal structures to track winsys-supplied data |
| - virgl: honor winsys supplied metadata |
| |
| Haihao Xiang (1): |
| |
| - i965: support AYUV/XYUV for external import only |
| |
| Hal Gentz (11): |
| |
| - glx: Fix SEGV due to dereferencing a NULL ptr from XCB-GLX. |
| - clover: Fix build after clang r370122. |
| - gallium/osmesa: Fix the inability to set no context as current. |
| - egl: Add EGL_CONFIG_SELECT_GROUP_MESA ext. |
| - egl: Fixes transparency with EGL and X11. |
| - egl: Puts RGBA visuals in the second config selection group. |
| - egl: Configs w/o double buffering support have no \`EGL_WINDOW_BIT`. |
| - Revert "egl: Configs w/o double buffering support have no |
| \`EGL_WINDOW_BIT`." |
| - Revert "egl: Puts RGBA visuals in the second config selection group." |
| - Revert "egl: Fixes transparency with EGL and X11." |
| - Revert "egl: Add EGL_CONFIG_SELECT_GROUP_MESA ext." |
| |
| Heinrich Fink (8): |
| |
| - include: sync GL headers with registry |
| - specs: Sync framebuffer_flip_y text with GL registry |
| - headers: remove redundant GL token from GL wrapper |
| - specs: Add GL_MESA_EGL_sync |
| - registry: update gl.xml with GL_MESA_EGL_sync token |
| - headers: Add GL_MESA_EGL_sync token to GL |
| - egl: Add GL_MESA_EGL_sync support |
| - mesa/gl: Sync with Khronos registry |
| |
| Hyunjun Ko (3): |
| |
| - freedreno/ir3: Add data structures to support texture pre-fetch |
| - freedreno/ir3: Add support for texture sampling pre-dispatch |
| - freedreno/ir3: fix printing output registers of FS. |
| |
| Iago Toral (1): |
| |
| - v3d: drop unused shader_rec_count member from context |
| |
| Iago Toral Quiroga (13): |
| |
| - prog_to_nir: VARYING_SLOT_PSIZ is a scalar |
| - gallium/ttn: VARYING_SLOT_PSIZ and VARYING_SLOT_FOGC are scalar |
| - nir/lower_point_size: assume scalar PSIZ |
| - v3d: add missing line break for performance debug message |
| - v3d: make sure we have enough space in the CL for the primitive |
| counts packet |
| - v3d: remove redundant update of queued draw calls |
| - v3d: fix TF primitive counts for resume without draw |
| - mesa/main: GL_GEOMETRY_SHADER_INVOCATIONS exists in |
| GL_OES_geometry_shader |
| - v3d: trivial update to obsolete comment |
| - v3d: add new flag dirty TMU cache at v3d_compiler |
| - broadcom: document known hardware issues for L2T flush command |
| - v3d: request the kernel to flush caches when TMU is dirty |
| - st/mesa: only require ESSL 3.1 for geometry shaders |
| |
| Ian Romanick (22): |
| |
| - nir/algrbraic: Don't optimize open-coded bitfield reverse when |
| lowering is enabled |
| - intel/compiler: Request bitfield_reverse lowering on pre-Gen7 |
| hardware |
| - nir/algebraic: Mark some value range analysis-based optimizations |
| imprecise |
| - nir/algebraic: Clean up value range analysis-based optimizations |
| - nir/range-analysis: Adjust result range of exp2 to account for |
| flush-to-zero |
| - nir/range-analysis: Adjust result range of multiplication to account |
| for flush-to-zero |
| - nir/range-analysis: Fix incorrect fadd range result for (ne_zero, |
| ne_zero) |
| - nir/range-analysis: Handle constants in nir_op_mov just like |
| nir_op_bcsel |
| - nir/range-analysis: Range tracking for fpow |
| - nir/range-analysis: Add a lot more assertions about the contents of |
| tables |
| - nir/algebraic: Do not apply late DPH optimization in vertex |
| processing stages |
| - nir/algebraic: Additional D3D Boolean optimization |
| - nir/range-analysis: Bail if the types don't match |
| - nir/range-analysis: Use types in the hash key |
| - nir/range-analysis: Use types to provide better ranges from bcsel and |
| mov |
| - nir/search: Fix possible NULL dereference in is_fsign |
| - intel/vec4: Don't try both sources as immediates for DPH |
| - intel/compiler: Report the number of non-spill/fill SEND messages on |
| vec4 too |
| - nir/algebraic: Add the ability to mark a replacement as exact |
| - nir/algebraic: Mark other comparison exact when removing a == a |
| - intel/fs: Disable conditional discard optimization on Gen4 and Gen5 |
| - intel/compiler: Fix 'comparison is always true' warning |
| |
| Icenowy Zheng (4): |
| |
| - lima: reset scissor state if scissor test is disabled |
| - lima: fix PLBU viewport configuration |
| - lima: support rectangle texture |
| - lima: do not set the PP uniforms address lowest bits |
| |
| Ilia Mirkin (6): |
| |
| - gallium/vl: use compute preference for all multimedia, not just blit |
| - teximage: ensure that Tex*SubImage\* checks format |
| - gallium/tgsi: add support for DEMOTE and READ_HELPER opcodes |
| - nvc0: add support for GL_EXT_demote_to_helper_invocation |
| - gm107/ir: fix loading z offset for layered 3d image bindings |
| - nv50/ir: mark STORE destination inputs as used |
| |
| Illia Iorin (2): |
| |
| - Revert "mesa/main: Fix multisample texture initialize" |
| - mesa/main: Ignore filter state for MS texture completeness |
| |
| Indrajit Das (1): |
| |
| - radeon/vcn: exclude raven2 from vcn 2.0 encode initialization |
| |
| James Xiong (5): |
| |
| - gallium: simplify throttle implementation |
| - gallium: rename PIPE_CAP_MAX_FRAMES_IN_FLIGHT to PIPE_CAP_THROTTLE |
| - iris: finish aux import on get_param |
| - gallium: do not increase ref count of the new throttle fence |
| - iris: try to set the specified tiling when importing a dmabuf |
| |
| Jan Beich (6): |
| |
| - gallium/hud: add CPU usage support for DragonFly/NetBSD/OpenBSD |
| - util: skip NEON detection if built with -mfpu=neon |
| - util: detect NEON at runtime on FreeBSD |
| - util: skip AltiVec detection if built with -maltivec |
| - util: detect AltiVec at runtime on BSDs |
| - util: simplify BSD includes |
| |
| Jan Zielinski (3): |
| |
| - swr/rasterizer: Enable ARB_fragment_layer_viewport |
| - swr/rasterizer: Fix GS attributes processing |
| - gallium/swr: Fix depth values for blit scenario |
| |
| Jason Ekstrand (57): |
| |
| - nir: Add explicit signs to image min/max intrinsics |
| - intel/nir: Add a helper for getting BRW_AOP from an intrinsic |
| - v3d: Use the correct opcodes for signed image min/max |
| - intel/fs: Drop the gl_program from fs_visitor |
| - intel/fs: Fix FB write inst groups |
| - Revert "intel/fs: Move the scalar-region conversion to the |
| generator." |
| - anv: Bump maxComputeWorkgroupSize |
| - intel/tools: Decode 3DSTATE_BINDING_TABLE_POINTERS on SNB |
| - intel/tools: Decode PS kernels on SNB |
| - blorp: Memset surface info to zero when initializing it |
| - intel/blorp: Expose surf_retile_w_to_y internally |
| - intel/blorp: Expose surf_fake_interleaved_msaa internally |
| - intel/blorp: Use wide formats for nicely aligned stencil clears |
| - nir: Handle complex derefs in nir_split_array_vars |
| - nir: Don't infinitely recurse in lower_ssa_defs_to_regs_block |
| - nir: Add a block_is_unreachable helper |
| - nir/repair_ssa: Repair dominance for unreachable blocks |
| - nir/repair_ssa: Insert deref casts when needed |
| - nir/dead_cf: Repair SSA if the pass makes progress |
| - intel/fs: Handle UNDEF in split_virtual_grfs |
| - vulkan: Update the XML and headers to 1.1.123 |
| - Move blob from compiler/ to util/ |
| - util/rb_tree: Add the unit tests |
| - util/rb_tree: Reverse the order of comparison functions |
| - intel/fs: Allow UB, B, and HF types in brw_nir_reduction_op_identity |
| - intel/fs: Allow CLUSTER_BROADCAST to do type conversion |
| - intel/fs: Do 8-bit subgroup scan operations in 16 bits |
| - anv: Advertise VK_KHR_shader_subgroup_extended_types |
| - nir/repair_ssa: Replace the unreachable check with the phi builder |
| - util/rb_tree: Replace useless ifs with asserts |
| - util/rb_tree: Also test \_safe iterators |
| - util/rb_tree: Stop relying on &iter->field != NULL |
| - intel/fs: Fix fs_inst::flags_read for ANY/ALL predicates |
| - anv/pipeline: Capture serialized NIR |
| - intel/eu/validate/gen12: Don't blow up on indirect src0. |
| - intel/fs/gen12: Implement gl_FrontFacing on gen12+. |
| - intel/genxml: Remove W-tiling on gen12 |
| - intel/isl: Select Y-tiling for stencil on gen12 |
| - intel/isl: Add isl_aux_usage_has_ccs |
| - spirv/info: Add a memorymodel_to_string helper |
| - Revert "mapi: Inline call x86_current_tls." |
| - intel/blorp: Use surf instead of aux_surf for image dimensions |
| - intel/isl: Add new aux modes available on gen12 |
| - intel/isl/fill_state: Separate aux_mode handling from aux_surf |
| - intel/isl: Update surf_fill_state for gen12 |
| - intel/isl: Support HIZ_CCS in emit_depth_stencil_hiz |
| - anv: Delay allocation of relocation lists |
| - anv: Reduce the minimum number of relocations |
| - intel/vec4: Set brw_stage_prog_data::has_ubo_pull |
| - anv: Avoid emitting UBO surface states that won't be used |
| - anv: Fix a potential BO handle leak |
| - anv/tests: Zero-initialize instances |
| - anv: Set the batch allocator for compute pipelines |
| - anv: Stop bounds-checking pushed UBOs |
| - anv: Set up SBE_SWIZ properly for gl_Viewport |
| - anv: Re-emit all compute state on pipeline switch |
| - anv: Don't leak when set_tiling fails |
| |
| Jean Hertel (1): |
| |
| - Fix missing dri2_load_driver on platform_drm |
| |
| Jiadong Zhu (1): |
| |
| - mesa: fix texStore for FORMAT_Z32_FLOAT_S8X24_UINT |
| |
| Jiang, Sonny (1): |
| |
| - loader: always map the "amdgpu" kernel driver name to radeonsi (v2) |
| |
| John Stultz (1): |
| |
| - Android.mk: Fix missing \\ from recent llvm change |
| |
| Jon Turney (2): |
| |
| - Fix timespec_from_nsec test for 32-bit time_t |
| - rbug: Fix use of alloca() without #include "c99_alloca.h" |
| |
| Jonathan Gray (3): |
| |
| - mapi: Adapted libglvnd x86 tsd changes |
| - winsys/amdgpu: avoid double simple_mtx_unlock() |
| - i965: update Makefile.sources for perf changes |
| |
| Jonathan Marek (90): |
| |
| - freedreno/a2xx: ir2: fix lowering of instructions after float |
| lowering |
| - freedreno/a2xx: ir2: remove pointcoord y invert |
| - freedreno/a2xx: ir2: set lower_fdph |
| - freedreno/a2xx: ir2: fix saturate in cp |
| - freedreno/a2xx: ir2: check opcode on the right instruction in export |
| cp |
| - freedreno/a2xx: ir2: fix incorrect instruction reordering |
| - freedreno/a2xx: ir2: update register state in scalar insert |
| - freedreno/a2xx: fix SRC_ALPHA_SATURATE for alpha blend function |
| - freedreno/a2xx: implement polygon offset |
| - freedreno/a2xx: fix depth gmem restore |
| - freedreno/a2xx: formats update |
| - u_format: add ETC2 to util_format_srgb/util_format_linear |
| - u_format: float type for R11G11B10_FLOAT/R9G9B9E5_FLOAT |
| - etnaviv: fix two-sided stencil |
| - turnip: fix binning shader compilation |
| - turnip: use image tile_mode for gmem configuration |
| - turnip: emit shader immediates |
| - turnip: fix vertex_id |
| - turnip: implement sampler state |
| - turnip: implement image view descriptor |
| - turnip: use linear tiling for scanout image |
| - turnip: align layer_size |
| - turnip: enable linear filtering |
| - turnip: basic descriptor sets (uniform buffer and samplers) |
| - turnip: lower samplers and uniform buffer indices |
| - turnip: use nir_opt_copy_prop_vars |
| - turnip: add some shader information in pipeline state |
| - turnip: emit texture and uniform state |
| - etnaviv: nir: fix gl_FrontFacing |
| - etnaviv: nir: allocate contiguous components for LOAD destination |
| - etnaviv: nir: set num_components for inputs/outputs |
| - qetnaviv: nir: use new immediates when possible |
| - etnaviv: nir: add native integers (HALTI2+) |
| - etnaviv: nir: use store_deref instead of store_output |
| - etnaviv: nir: remove "options" struct |
| - etnaviv: remove extra allocation for shader code |
| - etnaviv: nir: make lower_alu easier to follow |
| - etnaviv: disable earlyZ when shader writes fragment depth |
| - etnaviv: nir: fix gl_FragDepth |
| - etnaviv: update headers from rnndb |
| - etnaviv: implement texture comparator |
| - etnaviv: set texture INT_FILTER bit |
| - etnaviv: clear texture cache and flush ts when texture is modified |
| - etnaviv: get addressing mode from tiling layout |
| - etnaviv: rework compatible render base |
| - etnaviv: rework etna_resource_create tiling choice |
| - freedreno/ir3: remove input ncomp field |
| - freedreno/ir3: increase size of inputs/outputs arrays |
| - freedreno/ir3: implement fdd{x,y}_coarse opcodes |
| - freedreno/ir3: fix GETLOD for negative LODs |
| - freedreno/ir3: implement texop_texture_samples |
| - freedreno/ir3: implement fquantize2f16 |
| - freedreno/regs: update a6xx 2d blit bits |
| - turnip: fix triangle strip |
| - turnip: fix 32 vertex attributes case |
| - turnip: fix segmentation fault in events |
| - turnip: fix segmentation fault with compute pipeline |
| - turnip: fix assert failing for 0 color attachments |
| - turnip: add astc format layout |
| - turnip: add format_is_uint/format_is_sint |
| - turnip: format table fixes |
| - turnip: add more 2d_ifmt translations |
| - turnip: improve view descriptor |
| - turnip: improve sampler descriptor |
| - turnip: add black border color |
| - turnip: add VK_KHR_sampler_mirror_clamp_to_edge |
| - turnip: update setup_slices |
| - turnip: disable tiling as necessary |
| - turnip: add anisotropy and compressed formats to device features |
| - turnip: update some shader state bits from GL driver |
| - turnip: fixup consts |
| - turnip: add code to lower indirect samplers |
| - turnip: add missing nir passes |
| - turnip: use nir_assign_io_var_locations instead of |
| nir_assign_var_locations |
| - turnip: improve CmdCopyImage and implement CmdBlitImage |
| - turnip: basic msaa working |
| - turnip: depth/stencil |
| - turnip: push constants |
| - turnip: more descriptor sets |
| - spirv: set correct dest_type for texture query ops |
| - etnaviv: fix linear_nearest / nearest_linear filters on GC7000Lite |
| - etnaviv: fix TS samplers on GC7000L |
| - etnaviv: check NO_ASTC feature bit |
| - freedreno/a2xx: use sysval for pointcoord |
| - freedreno/a2xx: add missing vertex formats (SSCALE/USCALE/FIXED) |
| - etnaviv: fix depth bias |
| - etnaviv: stencil fix |
| - etnaviv: fix non-pointsprite points on GC7000L |
| - freedreno/ir3: disable texture prefetch for 1d array textures |
| - freedreno/registers: fix a6xx_2d_blit_cntl ROTATE |
| |
| Jordan Justen (42): |
| |
| - intel/genxml: Handle field names with different spacing/hyphen |
| - intel/genxml/gen11: Add spaces in EnableUnormPathInColorPipe |
| - intel/genxml: Run sort_xml.sh to tidy gen9.xml and gen11.xml |
| - intel/genxml: Add gen12.xml as a copy of gen11.xml |
| - intel/genxml: Build gen12 genxml |
| - intel/isl: Build gen12 using gen11 code paths |
| - intel/compiler: Disable compaction on gen12 for now |
| - intel/l3: Don't assert on gen12 (use gen11 config temporarily) |
| - iris: Build for gen12 |
| - anv: Build for gen12 |
| - i965: Exit with error if gen12+ is detected |
| - pci_id_driver_map: Support preferring iris over i965 |
| - anv,iris: L3ALLOC register replaces L3CNTLREG for gen12 |
| - iris/state: Move reg/mem load/store functions earlier in file |
| - intel/ir: Lower fpow on Gen12. |
| - intel/genxml,isl: Add gen12 render surface state changes |
| - intel/genxml,isl: Add gen12 depth buffer changes |
| - intel/genxml,isl: Add gen12 stencil buffer changes |
| - intel/isl: Add gen12 depth/stencil surface alignments |
| - iris: Let isl decide the supported tiling in more situations |
| - intel/isl: Add R10G10B10_FLOAT_A2_UNORM format |
| - iris/resource: Use isl surface alignment during bo allocation |
| - intel/common: Add interface to allocate device buffers |
| - anv: Implement aux-map allocator interface |
| - intel/common: Add surface to aux map translation table support |
| - anv/gen12: Initialize aux map context |
| - genxml/gen12: Add AUX MAP register definitions |
| - anv/gen12: Write GFX_AUX_TABLE base address register |
| - iris/bufmgr: Initialize aux map context for gen12 |
| - isl/gen12: 64k surface alignment |
| - iris: Map each surf to it's aux-surf in the aux-map tables |
| - iris/gen12: Write GFX_AUX_TABLE base address register |
| - iris: Mark aux-map BO as used by all batches |
| - intel: Update alignment restrictions for HiZ surfaces. |
| - iris: Set MOCS for external surfaces to uncached |
| - intel/genxml: Add gen12 tile cache flush bit |
| - intel/dev: Add preliminary device info for Tigerlake |
| - intel/eu/validate/gen12: Add TGL to eu_validate tests. |
| - docs/relnotes/new_features.txt: Add note about gen12 support |
| - iris: Add IRIS_DIRTY_RENDER_BUFFER state flag |
| - iris/gen11+: Move flush for render target change |
| - iris: Allow max dynamic pool size of 2GB for gen12 |
| |
| Jose Maria Casanova Crespo (5): |
| |
| - mesa: recover target_check before get_current_tex_objects |
| - v3d: writes to magic registers aren't RF writes after THREND |
| - v3d: flag dirty state when binding compute states |
| - v3d: Explicitly expose OpenGL ES Shading Language 3.1 |
| - v3d: Fix predication with atomic image operations |
| |
| José Fonseca (5): |
| |
| - glx: Fix incompatible function pointer types. |
| - util: Prevent implicit declaration of function getenv. |
| - util: Prevent strcasecmp macro redefinion. |
| - scons: Make GCC builds stricter. |
| - scons: Fix force_scons parsing. |
| |
| Juan A. Suarez Romero (14): |
| |
| - docs: add release notes for 19.1.5 |
| - docs: add sha256 checksums for 19.1.5 |
| - docs: update calendar, add news item and link release notes for |
| 19.1.5 |
| - docs: add release notes for 19.1.6 |
| - docs: add sha256 checksums for 19.1.6 |
| - docs: update calendar, add news item and link release notes for |
| 19.1.6 |
| - docs: extend 19.1.x releases |
| - docs: add release notes for 19.1.7 |
| - docs: add sha256 checksums for 19.1.7 |
| - docs: update calendar, add news item and link release notes for |
| 19.1.7 |
| - bin/get-pick-list.sh: sha1 commits can be smaller than 8 chars |
| - docs: add release notes for 19.1.8 |
| - docs: add release notes for 19.1.8 |
| - docs: update calendar, add news item and link release notes for |
| 19.1.8 |
| |
| Karol Herbst (15): |
| |
| - gallium: add blob field to pipe_llvm_program_header |
| - rename pipe_llvm_program_header to pipe_binary_program_header |
| - clover/functional: add id_equals helper |
| - clover: add support for drivers having no proper binary format |
| - clover: prepare supporting multiple IRs |
| - clover: add support for passing kernels as nir to the driver |
| - nvc0: expose spirv support |
| - clover/nir: fix compilation with g++-5.5 and maybe earlier |
| - nv50/ir: fix unnecessary parentheses warning |
| - nv50/ir/nir: comparison of integer expressions of different |
| signedness warning |
| - clover/llvm: remove harmful std::move call |
| - clover/codegen: remove unused get_symbol_offsets function |
| - clover: eliminate "ignoring attributes on template argument" warning |
| - st/mesa: fix crash for drivers supporting nir defaulting to tgsi |
| - nv50/ir: remove DUMMY edge type |
| |
| Ken Mays (1): |
| |
| - haiku: fix Mesa build |
| |
| Kenneth Graunke (86): |
| |
| - gallium/ddebug: Wrap resource_get_param if available |
| - gallium/trace: Wrap resource_get_param if available |
| - gallium/rbug: Wrap resource_get_param if available |
| - gallium/noop: Implement resource_get_param |
| - iris: Replace devinfo->gen with GEN_GEN |
| - iris: Fix broken aux.possible/sampler_usages bitmask handling |
| - iris: Update fast clear colors on Gen9 with direct immediate writes. |
| - iris: Drop copy format hacks from copy region based transfer path. |
| - iris: Avoid unnecessary resolves on transfer maps |
| - iris: Set MOCS in all STATE_BASE_ADDRESS commands |
| - iris: Fix large timeout handling in rel2abs() |
| - isl: Drop UnormPathInColorPipe for buffer surfaces. |
| - isl: Don't set UnormPathInColorPipe for integer surfaces. |
| - iris: Delete dead prototype |
| - intel/compiler: Fix src0/desc setter ordering |
| - intel/compiler: Handle bits 15:12 in |
| brw_send_indirect_split_message() |
| - intel/compiler: Refactor FB write message control setup into a |
| helper. |
| - intel/compiler: Use generic SEND for Gen7+ FB writes |
| - intel/compiler: Use new Gen11 headerless RT writes for MRT cases |
| - util: Add a \_mesa_i64roundevenf() helper. |
| - mesa: Fix \_mesa_float_to_unorm() on 32-bit systems. |
| - iris: Drop swizzling parameter from s8_offset. |
| - iris: Don't auto-flush/dirty on transfer unmap for coherent buffers |
| - iris: Actually describe bo_reuse driconf option |
| - iris: Fix partial fast clear checks to account for miplevel. |
| - iris: Lessen texture cache hack flush for blits/copies on Icelake. |
| - iris: Report correct number of planes for planar images |
| - iris: Invalidate state/texture/constant caches after |
| STATE_BASE_ADDRESS |
| - intel: Stop redirecting state cache to command streamer cache section |
| - iris: Support the disable_throttling=true driconf option. |
| - iris: Ignore line stipple information if it's disabled |
| - iris: Add support for the always_flush_cache=true debug option. |
| - iris: Optimize out redundant sampler state binds |
| - iris: Avoid flushing for cache history on transfer range flushes |
| - iris: Fix constant buffer sizes for non-UBOs |
| - gallium: Fix util_format_get_depth_only |
| - iris: Finish initializing the BO before stuffing it in the hash table |
| - iris: Set bo->reusable = false in iris_bo_make_external_locked |
| - st/mesa: Only pause queries if there are any active queries to pause. |
| - iris: trivial whitespace fixes |
| - iris: Initialize ice->state.prim_mode to an invalid value |
| - st/mesa: Prefer 5551 formats for GL_UNSIGNED_SHORT_5_5_5_1. |
| - st/mesa: Increase GL_POINT_SIZE_RANGE minimum to 1.0 |
| - intel/compiler: Set "Null Render Target" ex_desc bit on Gen11 |
| - iris: Skip allocating a null surface when there are 0 color regions. |
| - iris: Flag IRIS_DIRTY_BINDINGS_XS on constant buffer rebinds |
| - iris: Explicitly emit 3DSTATE_BTP_XS on Gen9 with DIRTY_CONSTANTS_XS |
| - iris: Don't flag IRIS_DIRTY_BINDINGS for constant usage history |
| - iris: Track per-stage bind history, reduce work accordingly |
| - intel/compiler: Record whether any pull constant loads occur |
| - iris: Avoid uploading SURFACE_STATE descriptors for UBOs if possible |
| - iris: Use state_refs for draw parameters. |
| - iris: Rework iris_update_draw_parameters to be more efficient |
| - iris: Skip double-disabling TCS/TES/GS after BLORP operations |
| - isl: Drop WaDisableSamplerL2BypassForTextureCompressedFormats on |
| Gen11 |
| - st/mesa: Bail on incomplete attachments in discard_framebuffer |
| - intel/genxml: Stop manually scrubbing 'α' -> "alpha" |
| - broadcom/genxml: Stop manually scrubbing 'α' -> "alpha" |
| - Revert "intel/gen11+: Enable Hardware filtering of Semi-Pipelined |
| State in WM" |
| - intel: Increase Gen11 compute shader scratch IDs to 64. |
| - iris: Only resolve for image levels/layers which are actually in use. |
| - iris: Disable CCS_E for 32-bit floating point textures. |
| - iris: Fix iris_rebind_buffer() for VBOs with non-zero offsets. |
| - st/dri: Perform MSAA downsampling for \__DRI2_THROTTLE_COPYSUBBUFFER |
| - dri: Avoid swapbuffer throttling in glXCopySubBufferMESA |
| - iris: Refactor push constant allocation so we can reuse it |
| - iris: Hack up a SKL/Gen9LP PS push constant fifo depth workaround |
| - Revert "iris: Hack up a SKL/Gen9LP PS push constant fifo depth |
| workaround" |
| - iris: Drop bonus parameters from iris_init_*_context() |
| - iris: Drop vtbl usage for some load_register calls |
| - iris: Update comment about 3-component formats and buffer textures |
| - iris: Properly unreference extra VBOs for draw parameters |
| - st/mesa: Fix inverted polygon stipple condition |
| - iris: Implement the Broadwell NP Z PMA Stall Fix |
| - intel/fs/gen12: Use TCS 8_PATCH mode. |
| - iris: Implement the Gen < 9 tessellation quads workaround |
| - mesa: Use ctx->ReadBuffer in glReadBuffer back-to-front tests |
| - mesa: Make back_to_front_if_single_buffered non-static |
| - mesa: Handle pbuffers in desktop GL framebuffer attachment queries |
| - intel/compiler: Report the number of non-spill/fill SEND messages |
| - st/mesa: Silence chatty debug printf |
| - iris: Rework edgeflag handling |
| - nir: Use VARYING_SLOT_TESS_MAX to size indirect bitmasks |
| - iris: Fix "Force Zero RTA Index Enable" setting again |
| - driconf, glsl: Add a vs_position_always_invariant option |
| - drirc: Set vs_position_always_invariant for Shadow of Mordor on Intel |
| |
| Kevin Strasser (14): |
| |
| - drm-uapi: Update headers for fp16 formats |
| - i965: Add helper function for allowed config formats |
| - gallium: Use consistent approach for config format filtering |
| - dri: Add config attributes for color channel shift |
| - util: move bitcount to bitscan.h |
| - egl: Convert configs to use shifts and sizes instead of masks |
| - glx: Add fields for color shifts |
| - dri: Handle configs with floating point pixel data |
| - egl: Handle dri configs with floating point pixel data |
| - dri: Add fp16 formats |
| - gbm: Add buffer handling and visuals for fp16 formats |
| - i965: Add handling for fp16 configs |
| - gallium: Add buffer and configs handling or fp16 formats |
| - egl: Fix implicit declaration of ffs |
| |
| Khaled Emara (2): |
| |
| - freedreno/a3xx: fix texture tiling parameters |
| - freedreno/a3xx: fix sysmem <-> gmem tiles transfer |
| |
| Kristian Høgsberg (40): |
| |
| - freedreno/a6xx: Let the GPU track streamout offsets |
| - freedreno/a6xx: Implement primitive count queries on GPU |
| - freedreno/a6xx: Track location of gl_Position out as we link it |
| - freedreno/a6xx: Share shader state constructor and destructor |
| - freedreno/a6xx: Turn on vectorize_io |
| - freedreno/a6xx: Write multiple regs for SP_VS_OUT_REG and |
| SP_VS_VPC_DST_REG |
| - freedreno/regs: Fix CP_DRAW_INDX_OFFSET command |
| - freedreno/regs: A couple of tess updates |
| - freedreno/a6xx: Factor out const state setup |
| - freedreno: Rename vp and fp to vs and fs in fd_program_stateobj |
| - freedreno: Add state binding functions for HS/DS/GS |
| - freedreno: Move fs functions after geometry pipeline stages |
| - freedreno/a6xx: Add generic program stateobj support for HS/DS/GS |
| - freedreno/ir3: Add HS/DS/GS to shader key and cache |
| - freedreno/a6xx: Emit const and texture state for HS/DS/GS |
| - freedreno/a6xx: Move instrlen and obj_start writes to fd6_emit_shader |
| - freedreno/registers: Update with GS, HS and DS registers |
| - freedreno/a6xx: Trim a few regs from fd6_emit_restore() |
| - freedreno/ir3: Add support for CHSH and CHMASK instructions |
| - freedreno/ir3: Use third register for offset for LDL and LDLV |
| - freedreno/ir3: Extend RA with mechanism for pre-coloring registers |
| - freedreno/ir3: Add new LDLW/STLW instructions |
| - freedreno/ir3: Add intrinsics that map to LDLW/STLW |
| - freedreno/a6xx: Add missing adjacency primitives to table |
| - freedreno/ir3: Add has_gs flag to shader key |
| - freedreno/ir3: Implement lowering passes for VS and GS |
| - freedreno/ir3: Implement primitive layout intrinsics |
| - freedreno/ir3: Setup ir3 inputs and outputs for GS |
| - freedreno/ir3: Pre-color GS header and primitive ID |
| - freedreno/ir3: Start GS with (ss) and (sy) |
| - freedreno/ir3: End VS with CHMASK and CHSH in GS pipelines |
| - freedreno/a6xx: Emit program state for GS |
| - freedreno/a6xx: Support layered render targets |
| - st/mesa: Also enable GS when ESSLVersion > 320 |
| - freedreno/blitter: Save GS state |
| - freedreno/a6xx: Implement PIPE_QUERY_PRIMITIVES_GENERATED for GS |
| - freedreno/ci: Add failing tests to skip list |
| - freedreno/a6xx: Turn on geometry shaders |
| - nir: Use BITSET for tracking varyings in lower_io_arrays |
| - freedreno/a6xx: Disable geometry shaders for release |
| |
| Krzysztof Raszkowski (2): |
| |
| - util: Add unreachable() definition for clang compiler. |
| - gallium/swr: Enable GL_ARB_gpu_shader5: multiple streams |
| |
| Laurent Carlier (1): |
| |
| - egl: avoid local modifications for eglext.h Khronos standard header |
| file |
| |
| Leo Liu (3): |
| |
| - radeon/vcn: add RENOIR VCN decode support |
| - radeon/vcn: Add VP9 8K decode support |
| - radeonsi: enable 8K video decode support for HEVC and VP9 |
| |
| Lepton Wu (14): |
| |
| - st/mesa: Allow zero as [level|layer]_override |
| - virgl: Fix pipe_resource leaks under multi-sample. |
| - egl/android: Only keep BGRA EGL configs as fallback |
| - virgl: replace fprintf with \_debug_printf |
| - virgl: Remove wrong EAGAIN handling for drmIoctl |
| - gbm: Add GBM_MAX_PLANES definition |
| - egl/android: Remove our own reference to buffers. |
| - virgl: Remove formats with unusual sample count. |
| - mapi: Inline call x86_current_tls. |
| - mapi: split entry_generate_or_patch for x86 tls |
| - mapi: Clean up entry_patch_public for x86 tls |
| - mapi: Inline call x86_current_tls. |
| - mapi: Improve the x86 tsd stubs performance. |
| - gallium: dri2: Use index as plane number. |
| |
| Lionel Landwerlin (59): |
| |
| - glsl/tests: take refs on glsl types |
| - nir/tests: take reference on glsl types |
| - compiler: ensure glsl types are not created without a reference |
| - mesa/compiler: rework tear down of builtin/types |
| - radeonsi: take reference glsl types for compile threads |
| - i965: honor scanout requirement from DRI |
| - util/timespec: use unsigned 64 bit integers for nsec values |
| - util: fix compilation on macos |
| - egl: fix platform selection |
| - vulkan/overlay: bounce image back to present layout |
| - intel: update product names for WHL |
| - radv: store engine name |
| - driconfig: add a new engine name/version parameter |
| - vulkan: add vk_x11_strict_image_count option |
| - util/xmlconfig: fix regexp compile failure check |
| - drirc: include unreal engine version 0 to 23 |
| - anv: gem-stubs: return a valid fd got anv_gem_userptr() |
| - intel: use proper label for Comet Lake skus |
| - intel: Add new Comet Lake PCI-ids |
| - mesa: don't forget to clear \_Layer field on texture unit |
| - intel: fix topology query |
| - intel/error2aub: add support for platforms without PPGTT |
| - intel: fix subslice computation from topology data |
| - intel/isl: Set null surface format to R32_UINT |
| - intel/isl: set surface array appropriately |
| - intel/isl: set vertical surface alignment on null surfaces |
| - etnaviv: remove variable from global namespace |
| - anv: fix vkUpdateDescriptorSets with inline uniform blocks |
| - anv: fix memory leak on device destroy |
| - anv: fix unwind of vkCreateDevice fail |
| - intel/perf: add mdapi maker helper |
| - intel/perf: expose some utility functions |
| - intel/perf: extract register configuration |
| - intel/perf: move registers to their own header |
| - drm-uapi: Update headers from drm-next |
| - intel/perf: add support for querying kernel loaded configurations |
| - intel/genxml: add generic perf counters registers |
| - intel/genxml: add RPSTAT register for core frequency |
| - intel/perf: add mdapi writes for register perf counters |
| - anv: implement VK_INTEL_performance_query |
| - docs: Add new Intel extension |
| - intel/dev: store whether the device uses an aux map tables on devinfo |
| - anv: Add aux-map translation for gen12+ |
| - intel/perf: update ICL configurations |
| - intel/dump_gpu: handle context create extended ioctl |
| - intel/dev: set default num_eu_per_subslice on gen12 |
| - mesa: check draw buffer completeness on |
| glClearBufferfi/glClearBufferiv |
| - anv: Properly handle host query reset of performance queries |
| - mesa: check framebuffer completeness only after state update |
| - anv: invalidate file descriptor of semaphore sync fd at vkQueueSubmit |
| - anv: remove list items on batch fini |
| - anv/wsi: signal the semaphore in the acquireNextImage |
| - intel/perf: fix invalid hw_id in query results |
| - intel/perf: set read buffer len to 0 to identify empty buffer |
| - intel/perf: take into account that reports read can be fairly old |
| - intel/perf: simplify the processing of OA reports |
| - intel/perf: fix improper pointer access |
| - anv: fix missing gen12 handling |
| - anv: fix incorrect VMA alignment for CCS main surfaces |
| |
| Lucas Stach (17): |
| |
| - etnaviv: fix vertex buffer state emission for single stream GPUs |
| - gallium/util: don't depend on implementation defined behavior in |
| listen() |
| - rbug: fix transmitted texture sizes |
| - rbug: unwrap index buffer resource |
| - rbug: move flush_resource initialization |
| - rbug: implement missing explicit sync related fence functions |
| - rbug: forward texture_barrier to pipe driver |
| - rbug: forward can_create_resource to pipe driver |
| - rbug: implement resource creation with modifier |
| - rbug: remove superfluous NULL check |
| - etnaviv: keep references to pending resources |
| - etnaviv: drm: remove unused etna_cmd_stream_finish |
| - etnaviv: rework the stream flush to always go through the context |
| flush |
| - etnaviv: drm: add softpin interface |
| - etnaviv: check for softpin availability on Halti5 devices |
| - etnaviv: add linear texture support on GC7000 |
| - etnaviv: GC7000: flush TX descriptor and instruction cache |
| |
| Marek Olšák (161): |
| |
| - radeonsi/gfx10: fix the legacy pipeline by storing as_ngg in the |
| shader cache |
| - radeonsi: move some global shader cache flags to per-binary flags |
| - radeonsi/gfx10: fix tessellation for the legacy pipeline |
| - radeonsi/gfx10: fix the PRIMITIVES_GENERATED query if using legacy |
| streamout |
| - radeonsi/gfx10: create the GS copy shader if using legacy streamout |
| - radeonsi/gfx10: add as_ngg variant for VS as ES to select Wave32/64 |
| - radeonsi/gfx10: fix InstanceID for legacy VS+GS |
| - radeonsi/gfx10: don't initialize VGT_INSTANCE_STEP_RATE_0 |
| - radeonsi/gfx10: always use the legacy pipeline for streamout |
| - radeonsi/gfx10: finish up Navi14, add PCI ID |
| - radeonsi/gfx10: add AMD_DEBUG=nongg |
| - winsys/amdgpu+radeon: process AMD_DEBUG in addition to R600_DEBUG |
| - radeonsi: add PKT3_CONTEXT_REG_RMW |
| - radeonsi/gfx10: remove incorrect ngg/pos_writes_edgeflag variables |
| - radeonsi/gfx10: set PA_CL_VS_OUT_CNTL with CONTEXT_REG_RMW to fix |
| edge flags |
| - radeonsi: consolidate determining VGPR_COMP_CNT for API VS |
| - radeonsi: align scratch and ring buffer allocations for faster memory |
| access |
| - radeonsi: unbind blend/DSA/rasterizer state correctly in delete |
| functions |
| - radeonsi: fix scratch buffer WAVESIZE setting leading to corruption |
| - ac: enable LLVM atomic optimizations |
| - ac: use fma on gfx10 |
| - radeonsi/gfx10: use fma for TGSI_OPCODE_FMA |
| - radeonsi/gfx10: don't call gfx10_destroy_query with compute-only |
| contexts |
| - radeonsi: disable DCC when importing a texture from an incompatible |
| driver |
| - radeonsi: only support at most 1024 threads per block |
| - radeonsi/gfx10: fix wave occupancy computations |
| - r300,r600,radeonsi: read winsys_handle::stride,offset in drivers, not |
| winsyses |
| - r300,r600,radeonsi: set winsys_handle::stride,offset in drivers, not |
| winsyses |
| - ac/surface: add RADEON_SURF_NO_FMASK |
| - radeonsi: handle NO_DCC early |
| - radeonsi: move HTILE allocation outside of radeonsi |
| - radeonsi: move texture storage allocation outside of radeonsi |
| - radeonsi: remove redundant si_texture offset and size fields |
| - ac: replace HAVE_LLVM with LLVM_VERSION_MAJOR for |
| atomic-optimizations |
| - prog_to_nir, tgsi_to_nir: make sure kill doesn't discard NaNs |
| - radeonsi/gfx9: honor user stride for imported buffers |
| - radeonsi: add Navi12 PCI ID |
| - ac: move PBB MAX_ALLOC_COUNT into radeon_info |
| - ac: move num_sdp_interfaces into radeon_info |
| - ac: move ac_get_max_wave64_per_simd into radeon_info |
| - ac: move ac_get_num_physical_sgprs into radeon_info |
| - ac: move ac_get_num_physical_vgprs into radeon_info |
| - gallium: extend resource_get_param to be as capable as |
| resource_get_handle |
| - radeonsi: implement pipe_screen::resource_get_param |
| - radeonsi: include drm_fourcc.h to fix the build |
| - amd: add more PCI IDs for Navi14 |
| - ac/addrlib: fix chip identification for Vega10, Arcturus, Raven2, |
| Renoir |
| - ac: stop using PCI IDs for chip identification |
| - amd: remove all PCI IDs supported by amdgpu |
| - nir: don't add bindless variables to num_textures and num_images |
| - nir: define 8-byte size and alignment for bindless variables |
| - tgsi_to_nir: fix masked out image loads |
| - tgsi_to_nir: fix 2-component system values like |
| tess_level_inner_default |
| - ac/nir: port Z compare value clamping from radeonsi |
| - ac/nir: force unnormalized coordinates for RECT |
| - radeonsi: initialize displayable DCC using the retile blit to prevent |
| hangs |
| - gallium/vl: don't set PIPE_HANDLE_USAGE_EXPLICIT_FLUSH |
| - radeonsi/gfx10: fix L2 cache rinse programming |
| - ac: fix incorrect vram_size reported by the kernel |
| - ac: add radeon_info::tcc_harvested |
| - radeonsi/gfx10: fix corruption for chips with harvested TCCs |
| - ac: fix num_good_cu_per_sh for harvested chips |
| - ac: set the number of SDPs same as the number of TCCs |
| - ac: reorder and print all radeon_info fields |
| - tgsi_to_nir: handle PIPE_FORMAT_NONE in image opcodes |
| - ac/surface: don't allocate FMASK if there is no graphics |
| - ac: add ac_build_image_get_sample_count from radeonsi |
| - ac/nir: fix GLSL imageSamples() |
| - winsys/radeon: initialize SIMD properties in radeon_info |
| - util: use simple_mtx_t for util_range |
| - gallium: add PIPE_RESOURCE_FLAG_SINGLE_THREAD_USE to skip util_range |
| lock |
| - st/mesa: use simple_mtx_t instead of mtx_t |
| - radeonsi: use simple_mtx_t instead of mtx_t |
| - amd: don't use AMD_FAMILY definitions from amdgpu_drm.h |
| - gallium/util: remove enum numbering from util_format_layout |
| - gallium/util: add planar format layouts and helpers |
| - gallium/u_tests: test NV12 allocation and export |
| - vl: use u_format in vl_video_buffer_formats |
| - radeonsi: allocate planar multimedia formats in 1 buffer |
| - radeonsi: remove si_vid_join_surfaces and use combined planar |
| allocations |
| - radeonsi: ignore metadata for non-zero planes |
| - radeonsi: don't set BO metadata for non-zero planes |
| - nir: add shader_info::last_msaa_image |
| - tgsi/scan: add tgsi_shader_info::msaa_images_declared |
| - radeonsi: fix GLSL imageSamples() |
| - radeonsi: set the sample index for shader images correctly |
| - radeonsi: add FMASK slots for shader images (for MSAA images) |
| - radeonsi: clean up image_fetch_rsrc |
| - radeonsi: apply FMASK to MSAA image loads |
| - radeonsi: expand FMASK before MSAA image stores are used |
| - radeonsi: enable MSAA shader images |
| - nir: add a strip parameter to nir_serialize |
| - nir: move gl_nir_opt_access from glsl directory |
| - nir/drawpixels: handle load_color0, load_input, |
| load_interpolated_input |
| - nir/drawpixels: fix what appears to be a copy-paste bug in |
| get_texcoord_const |
| - tgsi_to_nir: add #ifdef header guards |
| - nir: add nir_shader_compiler_options::lower_to_scalar |
| - st/mesa: use nir_shader_compiler_options::lower_to_scalar |
| - tgsi_to_nir: use nir_shader_compiler_options::lower_to_scalar |
| - gallium: remove PIPE_SHADER_CAP_SCALAR_ISA |
| - ac/nir: add back nir_op_fmod |
| - clover: fix the nir_serialize build failure |
| - st/mesa: always allocate pack/unpack buffers as staging |
| - radeonsi/nir: simplify si_lower_nir signature |
| - st/mesa: use \*prog at the end of st_link_nir |
| - st/mesa: deduplicate code for ATI fs in st_program_string_notify |
| - st/mesa: simplify the signature of st_release_basic_variants |
| - st/mesa: don't store stream output info to shader cache for tess ctrl |
| shaders |
| - st/mesa: remove st_compute_program in favor of st_common_program |
| - st/mesa: deduplicate cases in st_deserialise_ir_program |
| - st/mesa: sink TCS/TES/GS/CS translate code into |
| st_translate_common_program |
| - st/mesa: deduplicate st_common_program code in |
| st_program_string_notify |
| - st/mesa: clean up more after the removal of st_compute_program |
| - st/mesa: move vertex program preparation code into |
| st_prepare_vertex_program |
| - st/mesa: unify transform feedback info translation code |
| - st/mesa: finalize NIR after shader variant passes for TCS/TES/GS/CS |
| - st/mesa: don't call translate_*_program functions for NIR |
| - st/mesa: call prog_to_nir sooner for ARB_fp |
| - st/mesa: reorder and document code in st_translate_vertex_program |
| - st/mesa: call the reset callback if glGetGraphicsResetStatus returns |
| a failure |
| - radeonsi: call the reset callback if get_device_reset_status returns |
| a failure |
| - radeonsi: recreate aux_context after a GPU reset |
| - gallium/u_blitter: remove an unused variable |
| - st/mesa: silence a warning in st_nir_lower_tex_src_plane |
| - st/mesa: call st_nir_opts for linked shaders only once |
| - st/mesa: lower doubles for NIR after linking |
| - st/mesa: rename st_xxx_program::tgsi to state |
| - st/mesa: rename basic -> common for st_common_program |
| - st/mesa: remove num_tgsi_tokens from st_xx_program |
| - st/mesa: remove st_vp_variant_key in favor of st_common_variant_key |
| - st/mesa: remove unused st_xxx_program::sha1 |
| - st/mesa: remove redundant function st_reference_compprog |
| - st/mesa: merge st_fragment_program into st_common_program |
| - st/mesa: don't call variables "tgsi" when they can reference NIR |
| - nir: allow nir_lower_uniforms_to_ubo to be run repeatedly |
| - st/mesa: replace pipe_shader_state with tgsi_token\* in st_vp_variant |
| - gallium/noop: implement get_disk_shader_cache and |
| get_compiler_options |
| - util/disk_cache: finish all queue jobs in destroy instead of killing |
| them |
| - util/u_queue: skip util_queue_finish if num_threads is 0 |
| - st/mesa: move some NIR lowering before shader caching |
| - st/mesa: don't lower_global_vars_to_local for VS if there are no dead |
| inputs |
| - st/mesa: assign driver locations for VS inputs for NIR before caching |
| - st/mesa: update VS shader_info for NIR after lowering passes |
| - gallium: add pipe_screen::finalize_nir |
| - tgsi_to_nir: use pipe_screen::finalize_nir |
| - st/mesa: use pipe_screen::finalize_nir |
| - radeonsi/nir: implement pipe_screen::finalize_nir |
| - glsl/serialize: restructure remap table code |
| - glsl/serialize: optimize for equal offsets in uniform remap tables |
| - include: add the definition of EGL_EXT_image_flush_external |
| - dri_interface: add interface for EGL_EXT_image_flush_external |
| - st/dri: assume external consumers of back buffers can write to the |
| buffers |
| - st/dri: add support for EGL_EXT_image_flush_external |
| - egl: handle EGL_IMAGE_EXTERNAL_FLUSH_EXT |
| - egl: implement new functions from EGL_EXT_image_flush_external |
| - docs: document new feature EGL_EXT_image_flush_external |
| - radeonsi: don't print diagnostic LLVM remarks and notes |
| - radeonsi: initialize shader compilers in threads on demand |
| - ac: get tcc_harvested from the kernel |
| - winsys/amdgpu: use the new GPU reset query |
| - st/mesa: fix Sanctuary and Tropics by disabling ARB_gpu_shader5 for |
| them |
| |
| Marek Vasut (4): |
| |
| - etnaviv: Make contexts track resources |
| - etnaviv: Rework resource status tracking |
| - etnaviv: Command buffer realloc |
| - etnaviv: Rework locking |
| |
| Marijn Suijten (2): |
| |
| - freedreno/a5xx: enable a510 |
| - freedreno/ir3: Add missing ir3_nir_lower_tex_prefetch.c to Android.mk |
| |
| Matt Turner (6): |
| |
| - clover: Remove unused code |
| - intel/compiler: Remove unreachable() from brw_reg_type.c |
| - intel/compiler: Restructure instruction compaction in preparation for |
| Gen12 |
| - intel/compiler: Inline get_src_index() |
| - intel/compiler: Make separate src0/src1 index tables |
| - intel/compiler: Add instruction compaction support on Gen12 |
| |
| Mauro Rossi (8): |
| |
| - android: mesa: revert "Enable asm unconditionally" |
| - android: anv: libmesa_vulkan_common: add libmesa_util static |
| dependency |
| - android: aco: fix undefined template 'std::__1::array' build errors |
| - android: compiler/nir: build nir_divergence_analysis.c |
| - android: aco: add support for libmesa_aco |
| - android: amd/common: export amd/llvm headers |
| - android: aco: fix Lower to CSSA |
| - android: radeonsi: fix build after vl refactoring (v2) |
| |
| Maya Rashish (3): |
| |
| - intel/compiler: avoid truncating int64_t to int |
| - meson: Test for -Wl,--build-id=sha1 |
| - llvmpipe: avoid left-shifting a negative number. |
| |
| Michael Schellenberger Costa (1): |
| |
| - aco: Cleanup insert_before_logical_end |
| |
| Michel Dänzer (48): |
| |
| - gitlab-ci: Move up meson-main job definition |
| - gitlab-ci: Use new needs: keyword |
| - gitlab-ci: Explicitly install linux-libc-dev for foreign |
| architectures |
| - gitlab-ci: Keep g++ from stretch when installing foreign toolchains |
| - gitlab-ci: Add needs stanza to arm64_a306_gles2 job definition |
| - gitlab-ci: Use multiple inheritance instead of YAML references |
| - gitlab-ci: Simplify some job definitions by extending more similar |
| jobs |
| - gitlab-ci: Move dependencies/needs for meson-main job to .deqp-test |
| - gitlab-ci: Move up meson-arm64 job definition |
| - gallivm: Limit DEBUG workaround to LLVM < 7 |
| - swr: Limit DEBUG workaround to LLVM < 7 |
| - ac: Remove DEBUG workaround |
| - gitlab-ci: Reference full ci-templates commit hash |
| - gitlab-ci: Pass --no-remove to apt-get where possible |
| - gitlab-ci: Create separate docker images for Debian stretch & buster |
| - gitlab-ci: Use newer packages from backports by default |
| - gitlab-ci: Use crossbuild-essential-\* packages |
| - gitlab-ci: Move scons build/test commands to a separate shell script |
| - gitlab-ci: Test scons with all LLVM versions |
| - gitlab-ci: Merge scons-nollvm and scons-llvm jobs |
| - radeonsi: fix VAAPI segfault due to various bugs |
| - loader: Avoid use-after-free / use of uninitialized local variables |
| - gitlab-ci: Declare needs: for stretch docker image |
| - gitlab-ci: Add needs: for x86 buster docker image |
| - gitlab-ci: Add test-container:arm64 to needs: for arm64 test jobs |
| - gitlab-ci: Set ccache path for cross compilers in meson cross file |
| - gitlab-ci: Use per-job ccache |
| - dri3: Pass \__DRI2_THROTTLE_COPYSUBBUFFER from |
| loader_dri3_copy_drawable |
| - loader: Simplify handling of the radeonsi driver |
| - gitlab-ci/lava: Add needs: for container image to test jobs |
| - gitlab-ci: Remove redundant .meson-cross template script |
| - gitlab-ci: Add .use-debian-10 template |
| - gitlab-ci: Disable meson-mingw32-x86_64 job again for now |
| - gitlab-ci: Sort ARM docker image packages in alphabetical order |
| - gitlab-ci: Bring ARM docker image install script in line with x86_64 |
| - gitlab-ci: Explicitly list debian-10 in needs: for .deqp-test |
| template |
| - gitlab-ci: Use native aarch64 runner for ARM build jobs |
| - gitlab-ci: Update the meson cross file for LLVM_VERSION as well |
| - gitlab-ci: Enable llvmpipe in ARM build jobs |
| - intel/compiler: Don't left-shift by >= the number of bits of the type |
| - intel/compiler: Cast to target type before shifting left |
| - intel/fs: Check for NULL key in fs_visitor constructor |
| - gallium/util: Cast to target type before shifting left |
| - util: Use uint64_t for shifting left in sign_extend and strunc |
| - util/tests: Avoid int64_t overflow issues in fast_idiv_by_const test |
| - gitlab-ci: Enable UBSan for the meson-vulkan job |
| - gitlab-ci: Only run the pipeline if any files affecting it have |
| changed |
| - gitlab-ci: Disable meson-windows job for the time being |
| |
| Michel Zou (1): |
| |
| - scons: add py3 support |
| |
| Nanley Chery (47): |
| |
| - anv/blorp: Use BLORP_BATCH_NO_UPDATE_CLEAR_COLOR |
| - anv: Properly allocate aux-tracking space for CCS_E |
| - anv/formats: Disable I915_FORMAT_MOD_Y_TILED_CCS on TGL+ |
| - iris: Drop support for I915_FORMAT_MOD_Y_TILED_CCS on TGL+ |
| - isl: Disable CCS_D on Gen12+ |
| - anv/image: Disable CCS_D on Gen12+ |
| - anv/cmd_buffer: Don't assume CCS_E includes CCS_D |
| - iris: Don't assume CCS_E includes CCS_D |
| - isl: Round up some pitches to 512B for Gen12's CCS |
| - intel/blorp: Halve the Gen12 fast-clear/resolve rectangle |
| - intel/blorp: Don't assert aux slices match main slices |
| - anv/private: Modify aux slice helpers for Gen12 CCS |
| - i965/miptree: Avoid -Wswitch for the Gen12 aux modes |
| - isl/drm: Map HiZ and CCS tilings to Y |
| - iris: Allow for non-Y-tiled aux allocation |
| - isl: Add and use isl_tiling_flag_to_enum() |
| - isl: Redefine the CCS layout for Gen12 |
| - intel: Enable CCS_E for some formats on Gen12 |
| - intel/blorp: Disable depth testing for slow depth clears |
| - iris: Clear ::has_hiz when disabling aux |
| - intel: Use RENDER_SURFACE_STATE::DepthStencilResource |
| - intel: Use 3DSTATE_DEPTH_BUFFER::ControlSurfaceEnable |
| - intel: Enable CCS_E for R24_UNORM_X8_TYPELESS on TGL+ |
| - isl: Reduce assertions during aux surf creation |
| - intel: Support HIZ_CCS in isl_surf_get_ccs_surf |
| - intel/blorp: Assert against HiZ in surface states |
| - intel/blorp: Treat HIZ_CCS like HiZ |
| - iris: Don't guess the aux_usage |
| - iris: Create an unusable secondary aux surface |
| - iris: Define initial HIZ_CCS state and transitions |
| - iris: Enable HIZ_CCS in depth buffer instructions |
| - isl: Add isl_surf_supports_hiz_ccs_wt() |
| - intel: Refactor blorp_can_hiz_clear_depth() |
| - intel/blorp: Satisfy HIZ_CCS fast-clear alignments |
| - iris: Start using blorp_can_hiz_clear_depth() |
| - intel: Fix and use HIZ_CCS write through mode |
| - intel/blorp: Satisfy clear color rules for HIZ_CCS |
| - iris: Enable HIZ_CCS sampling |
| - iris: Don't leak the resource for unsupported modifier |
| - iris: Disallow incomplete resource creation |
| - iris: Drop iris_resource::aux::extra_aux::bo |
| - iris: Bail resource creation upon aux creation error |
| - iris: Determine aux offsets within configure_aux |
| - iris: Allocate main and aux surfaces together |
| - gallium/dri2: Fix creation of multi-planar modifier images |
| - gallium: Store the image format in winsys_handle |
| - iris: Fix import of multi-planar surfaces with modifiers |
| |
| Nataraj Deshpande (1): |
| |
| - egl/android: Enable HAL_PIXEL_FORMAT_RGBA_FP16 format |
| |
| Neil Armstrong (1): |
| |
| - Revert "ci: Disable lima until its farm can get fixed." |
| |
| Neil Roberts (6): |
| |
| - glsl: Store the precision for a function return type |
| - nir/builder: Move nir_atan and nir_atan2 from SPIR-V translator |
| - nir/builtin: Add #include u_math.h to the header |
| - nir/builtin: Add extern "C" guards to nir_builtin_builder.h |
| - glsl: Add opcodes for atan and atan2 |
| - glsl/builtin: Add alternate versions of atan using new ops |
| |
| OBATA Akio (1): |
| |
| - util: fix to detect NetBSD properly |
| |
| Paulo Zanoni (8): |
| |
| - intel/fs: grab fail_msg from v32 instead of v16 when v32->run_cs |
| fails |
| - intel/fs: make scan/reduce work with SIMD32 when it fits 2 registers |
| - intel/fs: roll the loop with the <0,1,0> additions in emit_scan() |
| - intel/fs: the maximum supported stride width is 16 |
| - intel/fs: fix SHADER_OPCODE_CLUSTER_BROADCAST for SIMD32 |
| - intel/fs: don't forget the stride at generate_shuffle |
| - intel/compiler: remove the operand restriction for src1 on GLK |
| - intel/compiler: fix nir_op_{i,u}*32 on ICL |
| |
| Pierre Moreau (5): |
| |
| - meson: Check for SPIRV-Tools and llvm-spirv |
| - clover/spirv: Add functions for validating SPIR-V binaries |
| - clover/spirv: Add functions for parsing arguments, linking programs, |
| etc. |
| - clover/llvm: Add options for dumping SPIR-V binaries |
| - clover/llvm: Add functions for compiling from source to SPIR-V |
| |
| Pierre-Eric Pelloux Prayer (1): |
| |
| - mesa: implement glTextureStorageNDEXT functions |
| |
| Pierre-Eric Pelloux-Prayer (23): |
| |
| - glsl: replace 'x + (-x)' with constant 0 |
| - mesa: fix invalid target error handling for teximage |
| - mesa: add EXT_dsa glNamedRenderbufferStorageEXT and |
| glGetNamedRenderbufferParameterivEXT |
| - mesa: add EXT_dsa glClientAttribDefaultEXT / |
| glPushClientAttribDefaultEXT |
| - mesa: add EXT_dsa NamedProgram functions |
| - mesa: add EXT_dsa glProgramUniform*EXT functions |
| - mesa: add EXT_dsa + EXT_texture_buffer_object functions |
| - mesa: add EXT_dsa + EXT_texture_integer functions |
| - mesa: add EXT_dsa + EXT_gpu_shader4 functions |
| - mesa: add EXT_dsa + EXT_gpu_program_parameters functions |
| - mesa: add EXT_dsa glGetFloati_vEXT/glGetDoublei_vEXT |
| - mesa: refactor GenerateTextureMipmap handling |
| - mesa: add EXT_dsa Generate*MipmapEXT functions |
| - mesa: add EXT_dsa NamedRenderbufferStorageMultisampleEXT function |
| - mesa: add EXT_dsa NamedCopyBufferSubDataEXT function |
| - radeonsi: align sdma byte count to dw |
| - radeonsi: sdma misc fixes |
| - radeonsi: disable sdma for gfx10 |
| - radeonsi: tell the shader disk cache what IR is used |
| - mesa: enable msaa in clear_with_quad if needed |
| - radeonsi: fix shader disk cache key |
| - radeonsi: fix multi plane buffers creation |
| - radeonsi: use gfx9.surf_offset to compute texture offset |
| |
| Plamena Manolova (8): |
| |
| - genxml: Add 3DSTATE_DEPTH_BOUNDS instruction. |
| - iris: Add support for depth bounds testing. |
| - anv: Add support for depth bounds testing. |
| - genxml: Change 3DSTATE_DEPTH_BOUNDS bias. |
| - anv: Set depthBounds to true in anv_GetPhysicalDeviceFeatures. |
| - genxml: Add 3DSTATE_SO_BUFFER_INDEX\_\* instructions |
| - iris: Implement new way for setting streamout buffers. |
| - anv: Implement new way for setting streamout buffers. |
| |
| Prodea Alexandru-Liviu (4): |
| |
| - scons/windows: Fix build with LLVM>=8 |
| - scons/MSYS2-MinGW-W64: Fix build options defaults Signed-off-by: |
| Prodea Alexandru-Liviu <liviuprodea@yahoo.com> Reviewed-by: Jose |
| Fonseca <jfonseca@vmware.com> Cc: <mesa-stable@lists.freedesktop.org> |
| - Appveyor/Meson: Add build test of osmesa gallium Signed-off-by: |
| Prodea Alexandru-Liviu <liviuprodea@yahoo.com> Acked-by: Eric |
| Engestrom <eric@engestrom.ch> Reviewed-by: Dylan Baker |
| <dylan@pnwbakers.com> |
| - Meson: Remove lib prefix from graw and osmesa when building with |
| Mingw. Also remove version sufix from osmesa swrast on Windows. |
| |
| Qiang Yu (4): |
| |
| - lima: move format handling to unified place |
| - lima: implement EGL_KHR_partial_update |
| - lima: don't use damage system when full damage |
| - lima: move damage bound build to resource |
| |
| Rafael Antognolli (13): |
| |
| - anv: Only re-emit non-dynamic state that has changed. |
| - intel/tools: Fix aubinator usage of rb_tree. |
| - anv/block_pool: Align anv_block_pool state to 64 bits. |
| - intel/tools: Factor out GGTT allocation. |
| - intel/tools: Use common code for GGTT address allocation. |
| - intel/tools: Add basic aub_context code and helpers. |
| - intel/tools: Support multiple contexts in intel_dump_gpu. |
| - intel/blorp/gen12: Set FWCC when storing the clear color. |
| - anv: Align fast clear color state buffer to a page. |
| - iris: Align fast clear color state buffer to a page. |
| - iris: Add Tile Cache Flush for Unified Cache. |
| - blorp: Add Tile Cache Flush for Unified Cache. |
| - anv: Add Tile Cache Flush for Unified Cache. |
| |
| Rhys Perry (84): |
| |
| - nir/lower_io_to_vector: allow FS outputs to be vectorized |
| - nir/lower_io_to_vector: add flat mode |
| - util: include u_endian.h in u_math.h |
| - nir/lower_io_to_vector: don't merge compact varyings |
| - radv: keep GS threads with excessive emissions which could write to |
| memory |
| - radv: always emit a position export in gs copy shaders |
| - radv: never kill a NGG GS shader |
| - nir/opt_remove_phis: handle phis with no sources |
| - aco: run nir_lower_int64() before nir_lower_idiv() |
| - aco: implement 64-bit ineg |
| - aco: fix GFX9 opcode for v_xad_u32 |
| - aco: fix v_subrev_co_u32_e64 opcode |
| - aco: fix opcode for s_mul_hi_i32 |
| - aco: check for duplicate opcode numbers |
| - radv/aco: actually disable ACO when unsupported |
| - aco,radv/aco: get dissassembly for release builds if requested |
| - aco: store printed backend IR in binary |
| - radv/aco: return a correct name and description for the backend IR |
| - aco,radv: rename record_llvm_ir/llvm_ir_string to record_ir/ir_string |
| - aco: don't CSE v_readlane_b32/v_readfirstlane_b32 |
| - aco: CSE readlane/readfirstlane/permute/reduce with the same exec |
| mask |
| - aco: set loop_info::has_discard for demotes |
| - aco: don't remove the loop exec mask in transition_to_Exact() |
| - radv/aco,aco: set lower_fmod |
| - nir/print: always use the right FILE \* |
| - aco: fix load_constant with multiple arrays |
| - nir/constant_folding: add back and use constant_fold_state |
| - nir/constant_folding: fold load_constant intrinsics |
| - aco: move s_andn2_b64 instructions out of the p_discard_if |
| - aco: enable nir_opt_sink |
| - aco: Allow literals on VOP3 instructions. |
| - aco: Assemble opsel in VOP3 instructions. |
| - aco: workaround GFX10 0x3f branch bug |
| - aco: pad code with s_code_end on GFX10 |
| - aco: Initial work to avoid GFX10 hazards. |
| - aco: Use the VOP3-only add/sub GFX10 instructions if needed. |
| - aco: Have s_waitcnt_vscnt write to NULL. |
| - radv/aco: disable NGG when ACO is used |
| - aco/gfx10: fix inline uniform blocks |
| - aco/gfx10: disable GFX9 1D texture workarounds |
| - aco: rework scratch resource code |
| - aco: update print_ir |
| - nir/lower_non_uniform: lower image/texture instructions taking derefs |
| - nir/lower_input_attachments: pass on non-uniform access flag |
| - aco: don't apply sgprs/constants to read/write lane instructions |
| - aco: use can_accept_constant in valu_can_accept_literal |
| - aco: readfirstlane vgpr pointers in convert_pointer_to_64_bit() |
| - aco: implement divergent vulkan_resource_index |
| - aco: don't use p_as_uniform for vgpr sampler/image indices |
| - aco: fix scheduling with s_memtime/s_memrealtime |
| - aco: don't CSE s_memtime |
| - aco: emit_split_vector() s_memtime results |
| - nir/lower_idiv: add new llvm-based path |
| - aco: use nir_lower_idiv_precise |
| - aco: run opt_algebraic in a loop |
| - aco: small stage corrections |
| - aco: fix 64-bit p_extract_vector on 32-bit p_create_vector |
| - aco: create load_lds/store_lds helpers |
| - aco: fix sparse store_lds() |
| - aco: properly combine additions into ds_write2_b64/ds_read2_b64 |
| - aco: use ds_read2_b64/ds_write2_b64 |
| - aco: add a few missing checks in value numbering |
| - aco: keep can_reorder/barrier when combining addition into SMEM |
| - aco: add missing bld.scc() |
| - Revert "aco: only emit waitcnt on loop continues if we there was some |
| load or export" |
| - radv: round vgprs/sgprs before calculating max_waves |
| - aco: increase accuracy of SGPR limits |
| - aco: take LDS into account when calculating num_waves |
| - aco: Fix reductions on GFX10. |
| - aco: Remove dead code in reduction lowering. |
| - aco: try to group together VMEM loads of the same resource |
| - aco: a couple loop handling fixes for GFX10 hazard pass |
| - aco: rename README to README.md |
| - aco: fix new_demand calculation for first instructions |
| - aco: fix shuffle with uniform operands |
| - aco: fix read_invocation with VGPR lane index |
| - aco: don't propagate vgprs into v_readlane/v_writelane |
| - aco: don't combine literals into v_cndmask_b32/v_subb/v_addc |
| - aco: fix 64-bit fsign with 0 |
| - aco: propagate p_wqm on an image_sample's coordinate p_create_vector |
| - aco: fix i2i64 |
| - aco: add v_nop inbetween exec write and VMEM/DS/FLAT |
| - radv: set writes_memory for global memory stores/atomics |
| - nir/lower_io_to_vector: don't create arrays when not needed |
| |
| Rob Clark (60): |
| |
| - freedreno/ir3: convert block->predecessors to set |
| - freedreno/ir3: maintain predecessors/successors |
| - freedreno/ir3: do better job of marking convergence points |
| - nir: remove unused constant_fold_state |
| - freedreno/drm: fix 64b iova shifts |
| - freedreno/ir3: use uniform base |
| - freedreno/ir3: cleanup "partially const" ubo srcs |
| - freedreno/ir3: fix addr/pred spilling |
| - freedreno/ir3: fix mad copy propagation special case |
| - freedreno/ir3: assert that only single address |
| - freedreno/ir3: fix cp cmps.s opt |
| - freedreno/ir3: allow copy propagation for relative |
| - util: android logging support |
| - freedreno/a6xx: don't tile things that are too small |
| - freedreno/a6xx: fix 3d tex layout |
| - freedreno: fix compiler warning |
| - freedreno/a6xx: pre-calculate userconst stateobj size |
| - gitlab-ci/a630: skip |
| dEQP-GLES3.functional.fbo.msaa.2_samples.stencil_index8 |
| - freedreno/a6xx: un-open-code PC_PRIMITIVE_CNTL_1.PSIZE |
| - freedreno/a6xx: fix binning pass vs. xfb |
| - freedreno/a6xx: do streamout only in binning pass |
| - freedreno/ir3: drop unused param |
| - freedreno/ir3: handle multi component alu src when propagating shifts |
| - freedreno: update registers |
| - freedreno/ir3: remove unused ir3_instruction::inout |
| - freedreno/ir3: track sysval slot for inputs |
| - freedreno/ir3: don't DCE ij_pix if used for pre-fs-texture-fetch |
| - freedreno/ir3: add meta instruction for pre-fs texture fetch |
| - freedreno/ir3: fixup register footprint to account for prefetch |
| - freedreno/ir3: add dummy bary.f(ei) for pre-fs-fetch |
| - freedreno/ir3: add pre-dispatch tex fetch to disasm |
| - freedreno/ir3: force i/j pixel to r0.x |
| - freedreno/a6xx: add support for pre-fs texture fetch |
| - turnip: add support for pre-fs texture fetch |
| - freedreno/ir3: enable pre-fs texture fetch for a6xx |
| - nir/search: fix the PoT helpers |
| - freedreno/ir3: rename mul.s/mul.u |
| - nir: Add a new ALU nir_op_imul24 |
| - nir: add amul instruction |
| - nir: add address calc related opt rules |
| - nir: add nir_lower_amul pass |
| - freedreno/ir3: add rule to generate imad24 |
| - freedreno/ir3: optimize immed 2nd src to mad |
| - freedreno/ir3: add imul24 opcode |
| - freedreno/ir3: handle imad24_ir3 case in UBO lowering |
| - freedreno/ir3: handle scalarized varying inputs |
| - freedreno/ir3: fixup register footprint fixup |
| - freedreno/ir3: debug cleanup |
| - freedreno/ir3: make high regs easier to see in IR dumps |
| - freedreno/ir3: propagate dest flags for collect/fanin |
| - freedreno/ir3: treat high vs low reg as conversion |
| - freedreno/ir3: allow copy-propagate out of fanout |
| - freedreno/ir3: remove restrictions on const + (abs)/(neg) |
| - freedreno/ir3: handle the progress case |
| - freedreno/a6xx: remove some left over dead code |
| - freedreno/a6xx: cleanup magic registers |
| - freedreno/a6xx: add a618 support |
| - freedreno/ir3: fix gpu hang with pre-fs-tex-fetch |
| - Revert "freedreno/ir3: enable pre-fs texture fetch for a6xx" |
| - nir/lower_clip: Fix incorrect driver loc for clipdist outputs |
| |
| Robin Murphy (1): |
| |
| - egl/gbm: Fix config validation |
| |
| Rohan Garg (3): |
| |
| - panfrost: Remove unused argument from panfrost_drm_submit_vs_fs_job() |
| - panfrost: Jobs must be per context, not per screen |
| - panfrost: protect access to shared bo cache and transient pool |
| |
| Roland Scheidegger (4): |
| |
| - gallivm: use fallback code for mul_hi with llvm >= 7.0 |
| - llvmpipe: fix CALLOC vs. free mismatches |
| - llvmpipe: increase max texture size to 2GB |
| - gallivm: Fix saturated signed psub/padd intrinsics on llvm 8 |
| |
| Roman Stratiienko (1): |
| |
| - lima: Return fence unconditionally |
| |
| Sagar Ghuge (26): |
| |
| - intel/eu/gen12: Implement immediate 64 bit constant encoding. |
| - nir: Add alpha_to_coverage lowering pass |
| - intel/compiler: Remove emit_alpha_to_coverage workaround from backend |
| - intel: Add missing entry for brw_nir_lower_alpha_to_coverage in |
| Makefile |
| - intel/compiler: Add Immediate support for 3 source instruction |
| - intel/compiler: Set bits according to source file |
| - intel/compiler: Don't move immediate in register |
| - intel/compiler: Refactor disassembly of sources in 3src instruction |
| - intel/isl: Don't reconfigure aux surfaces for MCS |
| - iris: Initialize CCS to fast clear while using with MCS |
| - iris: Define MCS_CCS state transitions and usages |
| - intel/blorp: Use isl_aux_usage_has_mcs instead of comparing |
| - iris: Get correct resource aux usage for copy |
| - intel/isl: Support lossless compression with multisamples |
| - iris: Create resource with aux_usage MCS_CCS |
| - genxml/gen12: Add Stencil Buffer Resolve Enable bit |
| - intel/blorp: Assign correct view while clearing depth stencil |
| - intel/blorp: Add helper function for stencil buffer resolve |
| - intel: Track stencil aux usage on Gen12+ |
| - intel/blorp: Set stencil resolve enable bit |
| - iris: Resolve stencil buffer lossless compression with WM_HZ_OP |
| packet |
| - iris: Prepare stencil resource before clear depth stencil |
| - iris: Prepare depth resource if clear_depth enable |
| - iris: Prepare resources before stencil blit operation |
| - iris: Resolve stencil resource prior to copy or used by CPU |
| - intel/isl: Allow stencil buffer to support compression on Gen12+ |
| |
| Samuel Iglesias Gonsálvez (26): |
| |
| - spirv: check support for SPV_KHR_float_controls capabilities |
| - spirv/nir: keep track of SPV_KHR_float_controls execution modes |
| - nir: add auxiliary functions to detect if a mode is enabled |
| - nir: add support for flushing to zero denorm constants |
| - util: add softfloat functions to operate with doubles and floats |
| - util: add float to float16 conversions with RTZ and RTNE |
| - util: add fp64 -> fp32 conversion support for RTNE and RTZ rounding |
| modes |
| - nir: add support for round to zero rounding mode to nir_op_f2f32 |
| - nir: mind rounding mode on fadd, fsub, fmul and fma opcodes |
| - nir/opcodes: make sure f2f16_rtz and f2f16_rtne behavior is not |
| overriden by the float controls execution mode |
| - nir/constant_expressions: mind rounding mode converting from float to |
| float16 destinations |
| - nir/algebraic: disable inexact optimizations depending on float |
| controls execution mode |
| - nir: fix denorms in unpack_half_1x16() |
| - nir: fix denorm flush-to-zero in sqrt's lowering at |
| nir_lower_double_ops |
| - nir: fix fmin/fmax support for doubles |
| - intel/nir: do not apply the fsin and fcos trig workarounds for consts |
| - i965/fs/nir: add nir_op_unpack_half_2x16_split_*_flush_to_zero |
| - i965/fs/generator: refactor rounding mode helper in preparation for |
| float controls |
| - i965/fs/generator: add new opcode to set float controls modes in |
| control register |
| - i965/fs: add emit_shader_float_controls_execution_mode() and aux |
| functions |
| - i965/fs: set rounding mode when emitting fadd, fmul and ffma |
| instructions |
| - i965/fs: set rounding mode when emitting nir_op_f2f32 or nir_op_f2f16 |
| - i965/fs: add support for shader float control to |
| remove_extra_rounding_modes() |
| - anv: enable VK_KHR_shader_float_controls and SPV_KHR_float_controls |
| - docs/relnotes: add support for VK_KHR_shader_float_controls on Intel |
| - nir/algebraic: refactor inexact opcode restrictions |
| |
| Samuel Pitoiset (136): |
| |
| - radv/gfx10: tidy up gfx10_format_table.py |
| - radv/gfx10: hardcode some depth+stencil formats in the format table |
| - radv: allow to enable VK_AMD_shader_ballot only on GFX8+ |
| - radv: add a new debug option called RADV_DEBUG=noshaderballot |
| - radv: force enable VK_AMD_shader_ballot for Wolfenstein Youngblood |
| - radv: implement VK_AMD_shader_core_properties2 |
| - ac: fix exclusive scans on GFX8-GFX9 |
| - ac,radv,radeonsi: remove LLVM 7 support |
| - gitlab-ci: bump LLVM to 8 for meson-vulkan and meson-clover |
| - radv/gfx10: don't initialize VGT_INSTANCE_STEP_RATE_0 |
| - radv/gfx10: do not use NGG with NAVI14 |
| - radv: fix getting the index type size for uint8_t |
| - radv: add radv_process_depth_image_layer() helper |
| - radv: add mipmaps support for decompress/resummarize |
| - radv: decompress mipmapped depth/stencil images during transitions |
| - radv: allocate metadata space for mipmapped depth/stencil images |
| - radv: add mipmap support for the TC-compat zrange bug |
| - radv: add mipmap support for the clear depth/stencil values |
| - ac: drop llvm8 from some load/store helpers |
| - ac: add has_clear_state to ac_gpu_info |
| - ac: add has_distributed_tess to ac_gpu_info |
| - ac: add has_dcc_constant_encode to ac_gpu_info |
| - ac: add has_rbplus to ac_gpu_info |
| - ac: add has_load_ctx_reg_pkt to ac_gpu_info |
| - ac: add has_out_of_order_rast to ac_gpu_info |
| - ac: add cpdma_prefetch_writes_memory to ac_gpu_info |
| - ac: add has_gfx9_scissor_bug to ac_gpu_info |
| - ac: add has_tc_compat_zrange_bug to ac_gpu_info |
| - ac: add rbplus_allowed to ac_gpu_info |
| - ac: add has_msaa_sample_loc_bug to ac_gpu_info |
| - ac: add has_ls_vgpr_init_bug to ac_gpu_info |
| - radv: make use of has_ls_vgpr_init_bug |
| - radv/gfx10: compute the LDS size for exporting PrimID for VS |
| - ac: import linear/perspective PS input parameters from radv/radeonsi |
| - ac: drop now useless lookup_interp_param from ABI |
| - radv: gather info about PS inputs in the shader info pass |
| - radv: move lowering PS inputs/outputs at the right place |
| - radv: remove some unused fields from radv_shader_context |
| - radv: remove unused shader_info parameter in ac_compile_llvm_module() |
| - radv: remove useless ac_llvm_util.h include from the WSI code |
| - radv: remove radv_init_llvm_target() helper |
| - radv: replace ac_nir_build_if by ac_build_ifcc |
| - radv: move setting can_discard to ac_fill_shader_info() |
| - radv: keep a pointer to a NIR shader into radv_shader_context |
| - nir: do not assume that the result of fexp2(a) is always an integral |
| - radv/gfx10: always set ballot_mask_bits to 64 |
| - radv: merge radv_shader_variant_info into radv_shader_info |
| - radv: move ac_fill_shader_info() to radv_nir_shader_info_pass() |
| - radv: gather clip/cull distances in the shader info pass |
| - radv: gather pointsize in the shader info pass |
| - radv: gather viewport in the shader info pass |
| - radv: gather layer in the shader info pass |
| - radv: gather primitive ID in the shader info pass |
| - radv: calculate the GSVS vertex size in the shader info pass |
| - radv: calculate esgs_itemsize in the shader info pass |
| - radv/gfx10: account for the subpass view for the NGG GS storage |
| - radv/gfx10: make use the output usage mask when exporting NGG GS |
| params |
| - radv/gfx10: determine the number of vertices per primitive for TES |
| - radv: do not pass all compiler options to the shader info pass |
| - radv: fill shader info for all stages in the pipeline |
| - radv: store GFX9 GS state as part of the shader info |
| - radv: store GFX10 NGG state as part of the shader info |
| - radv: store the ESGS ring size as part of gfx10_ngg_info |
| - radv: calculate GFX9 GS and GFX10 NGG states before compiling shader |
| variants |
| - radv/gfx10: declare a LDS symbol for the NGG emit space |
| - radv: fix allocating number of user sgprs if streamout is used |
| - radv/winsys: add support for GS and OA domains |
| - radv/gfx10: add an option to switch from legacy to NGG streamout |
| - radv/gfx10: implement NGG streamout begin/end functions |
| - radv/gfx10: allocate GDS/OA buffer objects for NGG streamout |
| - radv/gfx10: adjust the GS NGG scratch size for streamout |
| - radv/gfx10: unconditionally declare scratch space for NGG streamout |
| without GS |
| - radv/gfx10: adjust the LDS size for VS/TES NGG streamout |
| - radv/gfx10: fix unnecessary LDS overallocation for NGG GS |
| - radv/gfx10: compute the correct buffer size for NGG streamout |
| - radv/gfx10: gather GS output for VS as NGG |
| - radv/gfx10: enable NGG_WAVE_ID_EN for NGG streamout |
| - radv/gfx10: make GDS idle when leaving the IB |
| - radv/gfx10: make sure to wait for idle before clearing GDS |
| - radv/gfx10: implement NGG streamout |
| - radv/gfx10: disable unsupported transform feedback features for NGG |
| - radv: fix writing depth/stencil clear values to image |
| - radv: fix loading 64-bit GS inputs |
| - radv/gfx10: fix VK_KHR_pipeline_executable_properties with NGG GS |
| - radv/gfx10: add radv_device::use_ngg |
| - radv/gfx10: add missing counter buffer to the BO list |
| - radv/gfx10: fix storing/loading NGG stream outputs for VS and TES |
| - radv/gfx10: use the component mask when storing/loading NGG stream |
| outputs |
| - radv/gfx10: fix storing/loading NGG stream outputs for GS |
| - radv/gfx10: fix NGG streamout with triangle strips for VS |
| - radv: rework the slow depthstencil clear to write depth from PS |
| - Revert "radv: disable viewport clamping even if FS doesn't write Z" |
| - radv: fix build |
| - radv/gfx10: fix the ESGS ring size symbol |
| - radv: enable lower_fmod for the LLVM path |
| - ac/nir: remove unused code for nir_op_{fmod,frem} |
| - radv: implement VK_KHR_shader_clock |
| - drirc: enable vk_x11_override_min_image_count for DOOM |
| - radv: bump minTexelBufferOffsetAlignment to 4 |
| - radv: get the device name from radeon_info::name |
| - radv: sync before resetting query pools if timestamps have been |
| written |
| - radv: use a compute shader for copying timestamp query results |
| - radv: fix DCC fast clear code for intensity formats |
| - radv: rename VK_KHR_shader_float16_int8 structs/constants |
| - Revert "radv: do not emit PKT3_CONTEXT_CONTROL with AMDGPU 3.6.0+" |
| - radv: fix DCC fast clear code for intensity formats (correctly) |
| - ac/llvm: add ac_build_canonicalize() helper |
| - ac/llvm: add AC_FLOAT_MODE_ROUND_TO_ZERO |
| - ac/llvm: force fneg/fabs to flush denorms to zero if requested |
| - radv: implement VK_KHR_shader_float_controls |
| - radv: enable VK_KHR_shader_float_controls on GFX6-GFX7 |
| - radv: do not print useless descriptors info in hang reports |
| - radv: print which ring is dumped in hang reports |
| - radv: dump trace files earlier if a GPU hang is detected |
| - radv: do not dump descriptors twice in hang reports |
| - radv: advertise VK_KHR_spirv_1_4 |
| - ac/llvm: fix ac_to_integer_type() for 32-bit const addr space |
| pointers |
| - radv: fix updating bound fast ds clear values with different aspects |
| - radv: do not create meta pipelines with 16 samples |
| - radv: add an assertion in radv_gfx10_compute_bin_size() |
| - radv: do not emit rbplus if attachments are undefined |
| - radv/gfx10: re-enable fast depth/stencil clears with separate aspects |
| - radv/gfx10: fix 3D images |
| - radv: fix vkUpdateDescriptorSets with inline uniform blocks |
| - radv: fix a performance regression with graphics depth/stencil clears |
| - radv: compute the number of records correctly for vertex buffers |
| - radv: fix VK_KHR_shader_float_controls dependency on GFX6-7 |
| - radv: enable fast depth/stencil clears with separate aspects on GFX8 |
| - radv: fix OpQuantizeToF16 for NaN on GFX6-7 |
| - radv: fix dumping SPIR-V into hang reports |
| - radv: move nomemorycache debug option at the right palce |
| - radv: fix perftest options |
| - radv: fix compute pipeline keys when optimizations are disabled |
| - radv: fix enabling sample shading with SampleID/SamplePosition |
| - radv/gfx10: fix implementation of exclusive scans |
| - ac/nir: fix out-of-bound access when loading constants from global |
| |
| Sergii Romantsov (4): |
| |
| - intel/dri: finish proper glthread |
| - nir/large_constants: more careful data copying |
| - nir/large_constants: pass after lowering copy_deref |
| - meta: leak of shader program when decompressing tex-images |
| |
| Stephen Barber (1): |
| |
| - nouveau: add idep_nir_headers as dep for libnouveau |
| |
| Tapani Pälli (23): |
| |
| - util: fix os_create_anonymous_file on android |
| - iris/android: fix build and link with libmesa_intel_perf |
| - egl: reset blob cache set/get functions on terminate |
| - intel/genxml: generate pack files for gen12 on android builds |
| - intel/isl: build android libmesa_isl for gen12 |
| - iris: build android libmesa_iris for gen12 |
| - anv: build libanv for gen12 in android build |
| - i965: initialize bo_reuse when creating brw_bufmgr |
| - iris: use driconf for 'bo_reuse' parameter |
| - android: fix linking issues with liblog |
| - iris: close screen fd on iris_destroy_screen |
| - egl: check for NULL value like eglGetSyncAttribKHR does |
| - iris: disable aux on first get_param if not created with aux |
| - mesa/st: calculate texture size based on EGLImage miplevel |
| - anv/android: fix images created with external format support |
| - i965: setup sized internalformat for MESA_FORMAT_R10G10B10A2_UNORM |
| - mesa: add [Program]Uniform*64ARB display list support |
| - mesa: enable ARB_gpu_shader_int64 in compat profile |
| - Revert "egl: implement new functions from |
| EGL_EXT_image_flush_external" |
| - Revert "egl: handle EGL_IMAGE_EXTERNAL_FLUSH_EXT" |
| - Revert "st/dri: add support for EGL_EXT_image_flush_external" |
| - Revert "st/dri: assume external consumers of back buffers can write |
| to the buffers" |
| - Revert "dri_interface: add interface for |
| EGL_EXT_image_flush_external" |
| |
| Thomas Hellstrom (2): |
| |
| - svga: Fix banded DMA upload unmap |
| - winsys/svga: Limit the maximum DMA hardware buffer size |
| |
| Thong Thai (2): |
| |
| - Revert "radeonsi: don't emit PKT3_CONTEXT_CONTROL on amdgpu" |
| - radeonsi: add JPEG decode support for VCN 2.0 devices |
| |
| Timothy Arceri (35): |
| |
| - radeonsi/nir: fix number of used samplers |
| - util/disk_cache: bump thread count assigned to disk cache queue |
| - util/u_queue: track job size and limit the size of queue growth |
| - util/disk_cache: make use of the total job size limiting feature |
| - radeonsi/nir: lower load constants to scalar |
| - glsl: fix crash compiling bindless samplers inside unnamed UBOs |
| - nir: fix nir_variable_data packing |
| - nir: improve nir_variable packing |
| - glsl: remove propagate_invariance() call from the linker |
| - radv: get topology from pipeline key rather than |
| VkGraphicsPipelineCreateInfo |
| - radv: add debug option to turn off in memory cache |
| - radv: add radv_create_shaders() to radv_shader.h |
| - radv: add radv_secure_compile_type enum |
| - radv: add some new members to radv device and instance for secure |
| compile |
| - radv: add radv_device_use_secure_compile() helper |
| - radv: allow the secure process to read and write from disk cache |
| - radv: for secure compile exit early from radv_shader_variant_create() |
| - radv: add radv_secure_compile() |
| - radv: a support for a secure compile fork at device creation |
| - radv: enable secure compile support |
| - util: remove LIST_INITHEAD macro |
| - util: remove LIST_ADDTAIL macro |
| - util: remove LIST_ADD macro |
| - util: remove LIST_REPLACE macro |
| - util: remove LIST_DELINIT macro |
| - util: remove LIST_DEL macro |
| - util: rename list_empty() to list_is_empty() |
| - util: remove LIST_IS_EMPTY macro |
| - radv: allow select() calls in secure compile |
| - radv: add radv_sc_read() helper |
| - radv: make use of radv_sc_read() |
| - radv: add some infrastructure for fresh forks for each secure compile |
| - radv: add a secure_compile_open_fifo_fds() helper |
| - radv: create a fresh fork for each pipeline compile |
| - glsl/nir: iterate the system values list when adding varyings |
| |
| Timur Kristóf (48): |
| |
| - st/nine: Properly initialize GLSL types for NIR shaders. |
| - nir: Carve out nir_lower_samplers from GLSL code. |
| - tgsi_to_nir: Remove dependency on libglsl. |
| - amd/common: Move ac_export_mrt_z to ac_llvm_build. |
| - amd/common: Extract some helper functions to ac_shader_util. |
| - amd/common: Add num_shared_vgprs to ac_shader_config for GFX10. |
| - radv: Set shared VGPR count in radv_postprocess_config. |
| - amd/common: Introduce ac_get_fs_input_vgpr_cnt. |
| - radv: Add debug option to dump meta shaders. |
| - radv: Fix L2 cache rinse programming. |
| - amd: Move all amd/common code that depends on LLVM to amd/llvm. |
| - aco: Set +wavefrontsize64 for LLVM disassembler in GFX10 wave64 mode. |
| - aco: Add missing GFX10 specific fields and some README notes. |
| - aco: Support GFX10 SMEM in aco_assembler. |
| - aco: Support GFX10 VINTRP in aco_assembler. |
| - aco: Support GFX10 DS in aco_assembler. |
| - aco: Support GFX10 MUBUF in aco_assembler. |
| - amd/common: Add extern "C" to some headers that were missing it. |
| - aco: Link ACO with amd/common. |
| - aco: Support GFX10 MTBUF in aco_assembler. |
| - aco: Support GFX10 MIMG and GFX9 D16 in aco_assembler. |
| - aco: Fix GFX9 FLAT, SCRATCH, GLOBAL instructions, add GFX10 support. |
| - aco: Support GFX10 EXP in aco_assembler. |
| - aco: Support GFX10 VOP3 and VOP1 as VOP3 in aco_assembler. |
| - aco: Set GFX10 DLC bit properly. |
| - aco: Use ac_get_sampler_dim, delete duplicate code. |
| - aco: Set GFX10 dimensionality on the instructions that need it. |
| - aco: Support subvector loops in aco_assembler. |
| - aco: Fix VS input VGPRs on GFX10. |
| - aco: Fix s_dcache_wb on GFX10. |
| - aco: Add extra assertion for number of FS input VGPRs. |
| - aco: Clean up usages of PhysReg::reg from aco_assembler. |
| - aco/gfx10: Wait for pending SMEM stores before loads |
| - aco/gfx10: Fix PS exports for SPI_SHADER_32_AR. |
| - aco/gfx10: Update constant addresses in fix_branches_gfx10. |
| - aco/gfx10: Add notes about some GFX10 hazards. |
| - aco/gfx10: Mitigate VcmpxPermlaneHazard. |
| - aco/gfx10: Mitigate VcmpxExecWARHazard. |
| - aco/gfx10: Mitigate SMEMtoVectorWriteHazard. |
| - aco/gfx10: Mitigate LdsBranchVmemWARHazard. |
| - aco/gfx10: Fix mitigation of VMEMtoScalarWriteHazard. |
| - aco: Refactor hazard mitigations, separate pass for GFX10. |
| - st/nine: Fix build with -Werror=empty-body |
| - st/nine: Fix unused variable warnings in release build. |
| - aco: Implement subgroup shuffle in GFX10 wave64 mode. |
| - aco: Introduce vgpr_limit to keep track of available VGPRs. |
| - radv: Enable ACO on Navi. |
| - ac: Handle invalid GFX10 format correctly in ac_get_tbuffer_format. |
| |
| Tomeu Vizoso (19): |
| |
| - panfrost/ci: Use Volt-based runner for dEQP tests |
| - panfrost/ci: Print bootstrap log |
| - panfrost/ci: Build kernel with CONFIG_DETECT_HUNG_TASK |
| - panfrost/ci: Install qemu-arm-static into chroot |
| - panfrost/ci: Print load stats |
| - panfrost/ci: Print only regressions |
| - panfrost/ci: Re-add support for armhf |
| - panfrost/ci: Use special runner for LAVA jobs |
| - panfrost/ci: Increase timeouts |
| - panfrost/ci: Run dEQP with the surfaceless platform |
| - panfrost/ci: Update kernel to 5.3-rc8 |
| - panfrost/ci: Use releases for Volt dEQP |
| - gitlab-ci: Run dEQP on devices with Panfrost |
| - gitlab-ci: Move LAVA-related files into top-level ci dir |
| - gitlab-ci/lava: Fix image to use in test jobs |
| - gitlab-ci/lava: Use files to list tests to skip |
| - gitlab-ci/lava: Test Lima driver with dEQP |
| - panfrost: Keep track of active BOs |
| - gitlab-ci: Update kernel for LAVA jobs to 5.4-rc4 |
| |
| Urja Rannikko (1): |
| |
| - panfrost: allocate bo for occlusion query results |
| |
| Vasily Khoruzhick (35): |
| |
| - lima/ppir: refactor const lowering |
| - lima/ppir: clone ld_{uni,tex,var} into each block |
| - lima/ppir: add support for unconditional branches and condition |
| negation |
| - lima/ppir: set write mask for texture loads if dest is reg |
| - lima/ppir: fix ordering deps |
| - lima/ppir: add write after read deps for registers |
| - lima/ppir: add dummy op |
| - lima/ppir: create ppir block for each corresponding NIR block |
| - lima/ppir: turn store_color into ALU node |
| - lima/ppir: validate shader outputs |
| - lima/ppir: add better liveness analysis |
| - lima/ppir: add control flow support |
| - lima/ppir: print register index and components number for spilled |
| register |
| - lima: fix texture descriptor issues |
| - lima/ppir: add common helper for creating movs |
| - lima/ppir: don't assume that load coords gets value from register |
| - lima/ppir: clone uniforms and load_coords into each successor |
| - nir: allow specifying filter callback in lower_alu_to_scalar |
| - lima/ppir: don't lower vector {b,f}csel to scalar if condition is |
| scalar |
| - lima/ppir: don't lower phis to scalar |
| - lima/gpir: lower fceil |
| - lima/gpir: fix warning in gpir disassembler |
| - lima: run opt_algebraic between int_to_float and boot_to_float for vs |
| - lima/ppir: drop fge/flt/feq/fne options |
| - lima: set .out_sync field of req in lima_submit_start() |
| - lima: add standalone disassembler with primitive MBS parser |
| - lima: use 0 to poll if BO is busy in lima_bo_wait() |
| - lima: implement BO cache |
| - lima/ppir: don't attempt to clone tex coords if it's not varying |
| - lima/ppir: add node dependency types |
| - lima/ppir: add support for indirect load of uniforms and varyings |
| - lima/ppir: add NIR pass to split varying loads |
| - lima: set uniforms_address lower bits properly |
| - lima/ppir: don't clone texture loads |
| - lima: fix PP stack size |
| |
| Vinson Lee (7): |
| |
| - glx: Fix up glXQueryGLXPbufferSGIX on macOS. |
| - swr: Fix build with llvm-9.0 again. |
| - travis: Fail build if any command in if statement fails. |
| - util: Define strchrnul on macOS. |
| - swr: Fix make_unique build error. |
| - scons: Add coroutines component to build. |
| - meson: Add coroutines component to llvmpipe build. |
| |
| Wladimir J. van der Laan (1): |
| |
| - etnaviv: GC7000: Texture descriptors |
| |
| Yevhenii Kolesnikov (2): |
| |
| - glsl: Enable textureSize for samplerExternalOES |
| - meson: Fix linkage of libgallium_nine with libgalliumvl |
| |
| Zebediah Figura (1): |
| |
| - Revert "draw: revert using correct order for prim decomposition." |
| |
| Zhaowei Yuan (1): |
| |
| - broadcom/vc4: Expand width of dst surface |
| |
| Zhu, James (1): |
| |
| - radeon: Fix mjpeg issue for ARCTURUS |
| |
| nia (1): |
| |
| - loader: include limits.h for PATH_MAX |
| |
| pal1000 (3): |
| |
| - scons/windows: Support build with LLVM 9. |
| - scons: Fix MSYS2 Mingw-w64 build. |
| - scons/windows: Enable compute shaders when possible. |
| |
| renchenglei (1): |
| |
| - egl/android: Enable HAL_PIXEL_FORMAT_RGBA_1010102 format |