A week in Openverse: 2024-06-24 – 2024-07-01

openverse

Merged PRs

Analytics

  • #4550: Fix Plausible setup after domain was already set
  • #4568: Specify pull policy for `openverseOpenverse Openverse is a search engine for openly-licensed media, including images and audio. Find Openverse on GitHub and at https://openverse.org.-` images

APIAPI An API or Application Programming Interface is a software intermediary that allows programs to interact with each other and share data in limited, clearly defined ways.

  • #4499: Include xml in frontend attribution options
  • #4530: Drop FK constraint on media_obj in MediaDecisionThrough, update backfillmoderationdecision command
  • #4536: Do not use unstable pook reference for API
  • #4540: Add linting for Dockerfiles
  • #4544: Create sensitive and deleted media models for decisions
  • #4547: Publish changelog for api-2024.06.24.18.01.42
  • #4551: Shorten PDM hash to first 8 characters
  • #4554: Remove backfillmoderationdecision management command after production run
  • #4568: Specify pull policy for `openverse-` images

Catalog

  • #4475: Add DAG to decode and deduplicate image tags with escaped literal unicode sequences
  • #4495: Fix placing test S3 data into MinIO
  • #4497: Add CI/CD and PDM to new indexer worker
  • #4526: Fix separators in catalog and dev-env images and dev-env volume
  • #4540: Add linting for Dockerfiles
  • #4555: Ensure plpython3u exists in live db when using it
  • #4557: Remove `trim_and_deduplicate_tags` DAG after successful run
  • #4568: Specify pull policy for `openverse-` images

Documentation

  • #4417: Implementation Plan: Augment the catalog database with suitable Rekognition tags
  • #4475: Add DAG to decode and deduplicate image tags with escaped literal unicode sequences
  • #4546: combine frontend testing documentation pages
  • #4547: Publish changelog for api-2024.06.24.18.01.42
  • #4548: Publish changelog for frontend-2024.06.24.18.01.44
  • #4557: Remove `trim_and_deduplicate_tags` DAG after successful run
  • #4562: Publish changelog for frontend-2024.06.26.17.18.17

Frontend

  • #4291: Display generated tags separately
  • #4497: Add CI/CD and PDM to new indexer worker
  • #4499: Include xml in frontend attribution options
  • #4509: Replace "Over…" language with more precise "Top…"
  • #4516: Add caching to frontend NginxNGINX NGINX is open source software for web serving, reverse proxying, caching, load balancing, media streaming, and more. It started out as a web server designed for maximum performance and stability. In addition to its HTTP server capabilities, NGINX can also function as a proxy server for email (IMAP, POP3, and SMTP) and a reverse proxy and load balancer for HTTP, TCP, and UDP servers. https://www.nginx.com/. configuration
  • #4523: Fix possible TypeError when accessing properties of `route.value`
  • #4540: Add linting for Dockerfiles
  • #4548: Publish changelog for frontend-2024.06.24.18.01.44
  • #4549: Re-add tags page text
  • #4559: Fix flaky VCollectionHeader snapshot tests
  • #4562: Publish changelog for frontend-2024.06.26.17.18.17

Infra

  • #4516: Add caching to frontend Nginx configuration

Ingestion Server

  • #4471: Remove single quotes in values of Ingestion Server's TSV files
  • #4529: Upload Ingestion Server's TSV files to AWS S3 (skip tags)

Management

  • #4497: Add CI/CD and PDM to new indexer worker
  • #4526: Fix separators in catalog and dev-env images and dev-env volume
  • #4539: Add dev tools jq and HTTPie to `ov`
  • #4540: Add linting for Dockerfiles
  • #4546: combine frontend testing documentation pages
  • #4568: Specify pull policy for `openverse-` images

Closed issues

API

  • #4430: Attribution: XML/RDF/Turtle please.
  • #4454: Determine if all tags in the catalog database have an associated provider
  • #4512: The `AbstractMediaDecisionThrough` class and its inheriting classes shouldn't use actual foreign keys to media tables
  • #4513: Creating `MediaDecision` has no effect on deindexed actions

Catalog

  • #663: Upgrade catalog to Python 3.11
  • #1464: Create a DAG to log and report code review response times
  • #4199: Remove and de-duplicate tags with leading/trailing whitespace
  • #4454: Determine if all tags in the catalog database have an associated provider
  • #4494: Test S3 inaturalist files are not found in MinIO

Documentation

  • #4040: Implementation Plan: Augment the catalog database with suitable Rekognition tags
  • #4514: Combine frontend testing documentation pages

Frontend

  • #461: Add a message to inform the user about more filters when one media type is chosen
  • #2130: Update sensitive browsing designs to allow re-blurring of search results
  • #2213: Frontend local dev error `Cannot convert undefined or null to object`
  • #4192: Displaying machine-generated content
  • #4379: Write a page describing the machine-generated tags for the frontend
  • #4430: Attribution: XML/RDF/Turtle please.
  • #4470: Add caching of static assets to frontend Nginx
  • #4522: TypeError: Cannot read properties of undefined (reading 'name') in `useMatchRoute()`
  • #4558: vcollectionheader storybook visual regression test broken

Infra

  • #4470: Add caching of static assets to frontend Nginx

Ingestion Server

  • #3912: Upload Ingestion Server's TSV files to AWS S3

openverse-infrastructure

Merged PRs

API

  • #951: Fix and improve api-management-command script

Frontend

  • #940: Cache frontend assets at edge for 3 days

Infra

  • #924: Add `StatusCheckFailed` alarms for EC2 services
  • #940: Cache frontend assets at edge for 3 days
  • #942: Add Grafana PDC
  • #944: Touch up indexer worker pools to match IP requirements
  • #947: Bypass WAF for Cloudflare Access services
  • #948: Ignore changes to `actions_enabled` on externally controlled alarm
  • #951: Fix and improve api-management-command script
  • #956: BlockBlock Block is the abstract term used to describe units of markup that, composed together, form the content or layout of a webpage using the WordPress editor. The idea combines concepts of what in the past may have achieved with shortcodes, custom HTML, and embed discovery into a single consistent API and user experience. malicious ASNs and UA string pattern 2024-06-27/28 incident

Management

  • #955: 🔄 synced file(s) with WordPress/openverse

Closed issues

API

  • #950: Disable migrationMigration Moving the code, database and media files for a website site from one server to another. Most typically done when changing hosting companies. running during management command executions

Frontend

  • #927: Change frontend edge caching rules

Infra

  • #254: Audit logging costs and find savings
  • #792: Add EC2 instance state change monitor
  • #927: Change frontend edge caching rules
  • #941: Wire up Grafana PDC
  • #943: Add Cloudflare WAF skip rule for Airflow
  • #950: Disable migration running during management command executions

#openverse, #week-in-openverse