A week in Openverse: 2024-06-17 – 2024-06-24

openverse

Merged PRs

Analytics

  • #4330: Add catalog indexer worker

APIAPI An API or Application Programming Interface is a software intermediary that allows programs to interact with each other and share data in limited, clearly defined ways.

  • #4330: Add catalog indexer worker
  • #4500: Publish changelog for api-2024.06.17.15.33.56
  • #4508: Log dead link verification request timings

Catalog

  • #4330: Add catalog indexer worker
  • #4473: Fix trim and deduplicate tags deduplication
  • #4483: Changes all sensible occurrences of the just commands to have them run using ov
  • #4488: Publish changelog for catalog-2024.06.13.17.07.54
  • #4501: Publish changelog for catalog-2024.06.17.15.33.56
  • #4502: Update dependency apacheApache Apache is the most widely used web server software. Developed and maintained by Apache Software Foundation. Apache is an Open Source software available for free.-airflow to v2.9.2 [SECURITY]
  • #4524: Explicitly include FilterFilter Filters are one of the two types of Hooks https://codex.wordpress.org/Plugin_API/Hooks. They provide a way for functions to modify data of other functions. They are the counterpart to Actions. Unlike Actions, filters are meant to work in an isolated manner, and should never have side effects such as affecting global variables and output. Data step in ingestion server removal IP
  • #4532: Bump requests from 2.31.0 to 2.32.2 in /indexer_worker

Documentation

  • #4465: Add data flow diagram for various ETL steps in pipelines
  • #4483: Changes all sensible occurrences of the just commands to have them run using ov
  • #4486: Update works count on the frontend
  • #4488: Publish changelog for catalog-2024.06.13.17.07.54
  • #4498: Publish changelog for frontend-2024.06.17.15.33.55
  • #4500: Publish changelog for api-2024.06.17.15.33.56
  • #4501: Publish changelog for catalog-2024.06.17.15.33.56
  • #4518: Update current_maintainers.md to add @zackkrida
  • #4524: Explicitly include Filter Data step in ingestion server removal IP

Frontend

  • #4446: Stop opening links in a new tab
  • #4483: Changes all sensible occurrences of the just commands to have them run using ov
  • #4486: Update works count on the frontend
  • #4498: Publish changelog for frontend-2024.06.17.15.33.55

Infra

  • #4491: Use a persistent container for `ov`
  • #4508: Log dead link verification request timings
  • #4527: Set `ov` workdir to current working directory

Ingestion Server

  • #4330: Add catalog indexer worker
  • #4519: Bump urllib3 from 2.2.1 to 2.2.2 in /ingestion_server
  • #4524: Explicitly include Filter Data step in ingestion server removal IP

Management

  • #4330: Add catalog indexer worker
  • #4483: Changes all sensible occurrences of the just commands to have them run using ov
  • #4496: Fix ov reference in hooksHooks In WordPress theme and development, hooks are functions that can be applied to an action or a Filter in WordPress. Actions are functions performed when a certain event occurs in WordPress. Filters allow you to modify certain functions. Arguments used to hook both filters and actions look the same.
  • #4503: Bump urllib3 from 2.1.0 to 2.2.2 in /utilities/project_planning
  • #4504: Make read contents permission explicit for PR automations
  • #4506: Prevent concurrency between release app and draft releases
  • #4511: Bump urllib3 from 2.2.1 to 2.2.2 in /automations/python
  • #4525: Make `ov clean` work when a container, image or volume does not exist
  • #4537: Sync the dependencies for PR automation init workflow to infra repo

Closed issues

API

  • #3199: Avoid API failure when requests URLURL A specific web address of a website or web page on the Internet, such as a website’s URL www.wordpress.org params aren't fully encoded
  • #3480: Bad Request error for url from Europeana when requesting thumbnail

Catalog

  • #4147: Implement new catalog indexer-worker
  • #4456: Update ingestion server removal IP to include plan for filtering tags

Documentation

  • #4455: Document current & desired ETL steps and data flow
  • #4480: Update the record count on the homepage
  • #4482: Update references to our developer tools to have the `./ov` prefix

Frontend

  • #496: Do not open external links in new tabs
  • #519: `Unable to get property 'name' of undefined or null reference` in useMatchRoute on Edge
  • #520: `TypeMismatchError` on search in Edge
  • #4480: Update the record count on the homepage

Infra

  • #4490: Refactor `ov` to create a persistent container

Ingestion Server

  • #4456: Update ingestion server removal IP to include plan for filtering tags

Management

  • #4422: `ov` hooks should reference the `ov` script directly, rather than relying on it being in the PATH
  • #4505: Prevent race condition with "Draft release" and "Release app" workflows

openverse-infrastructure

Merged PRs

Catalog

  • #937: Bump airflow to rel-2024.06.17.15.33.56

Frontend

  • #938: Fix duplicate nuxt alarms clashing

Infra

  • #930: Remove unnecessary policy from ECS task role
  • #936: Fix ansible/exec recipe
  • #938: Fix duplicate nuxt alarms clashing

Ingestion Server

  • #931: Bump ingestion server to rel-2024.06.13.17.07.56

Management

  • #939: 🔄 synced file(s) with WordPress/openverseOpenverse Openverse is a search engine for openly-licensed media, including images and audio. Find Openverse on GitHub and at https://openverse.org.
  • #946: 🔄 synced file(s) with WordPress/openverse

Closed issues

Infra

  • #216: Remove the execution role from the task role
  • #844: Exclude `/api/event` endpoint from Nuxt HTTPHTTP HTTP is an acronym for Hyper Text Transfer Protocol. HTTP is the underlying protocol used by the World Wide Web and this protocol defines how messages are formatted and transmitted, and what actions Web servers and browsers should take in response to various commands. 5XX response alarm

#openverse, #week-in-openverse