Issue 8124 8126 fix 7660 7682 2 6266 after review by melton-jason · Pull Request #8154 · specify/specify7

melton-jason · 2026-06-01T17:29:53Z

Taken from #8142 (comment) by @CarolineDenis:

Fixes #8124
Fixes #8126
Fixes #8101
Fixes #8065

Checklist

Self-review the PR after opening it to make sure the changes look good and
self-explanatory (or properly documented)
Add relevant issue to release milestone
Add pr to documentation list
Add automated tests
Add a reverse migration if a migration is present in the PR
Add migration function to

specify7/specifyweb/specify/management/commands/run_key_migration_functions.py

Line 50 in ea04665

def fix_schema_config(stdout: WriteToStdOut | None = None):

Testing instructions

Summary by CodeRabbit

Bug Fixes
- Corrected schema configuration references in data migrations
- Fixed discipline and tree definition linking issues
- Improved admin and permission initialization logic
- Enhanced query logging controls based on debug settings
Refactor
- Streamlined migration utilities for improved efficiency
- Simplified multi-database routing during migrations
- Optimized permission seeding with per-user checks
- Restructured role assignment logic using set-based operations
Chores
- Removed legacy debugging code and unused migration helpers
- Updated function signatures to remove unnecessary parameters

(cherry picked from commit ab5b3e3)

(cherry picked from commit 60a6797)

(cherry picked from commit f58c785)

(cherry picked from commit 2c3b7d5)

(cherry picked from commit c452f7d)

Fix: compare full rule definitions before deleting anything in uniqueness (cherry picked from commit c61a530)

(cherry picked from commit ef0eed0)

(cherry picked from commit b1f0346)

(cherry picked from commit 00fe897)

(cherry picked from commit c633180)

(cherry picked from commit d4c0231)

(cherry picked from commit 9953d6a)

(cherry picked from commit 8c9cc13)

(cherry picked from commit 6381382)

(cherry picked from commit bab5083)

(cherry picked from commit 28b9f5b)

(cherry picked from commit 11b6608)

(cherry picked from commit af4c963)

(cherry picked from commit 57bca64)

(cherry picked from commit 00a21ef)

(cherry picked from commit 3da5761)

(cherry picked from commit e07aaeb)

(cherry picked from commit 36b307c)

(cherry picked from commit d97acbd)

(cherry picked from commit d3c6c98)

(cherry picked from commit b183031)

(cherry picked from commit 2e9e4ad)

(cherry picked from commit 2254fc4)

(cherry picked from commit 39b001a)

(cherry picked from commit d8bd82d)

…ng links

melton-jason · 2026-06-04T16:25:08Z

@coderabbitai review

coderabbitai · 2026-06-04T16:25:15Z

✅ Action performed

Review finished.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai

Actionable comments posted: 2

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

specifyweb/backend/permissions/initialize.py (1)

89-92: ⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Duplicate UserPolicy assignment.

UserPolicy is assigned on line 89 and again on line 91. The second assignment is redundant.

🧹 Suggested fix

 def assign_users_to_roles(apps=apps) -> None:
     Role = apps.get_model('permissions', 'Role')
     UserPolicy = apps.get_model('permissions', 'UserPolicy')
     Agent = apps.get_model('specify', 'Agent')
-    UserPolicy = apps.get_model('permissions', 'UserPolicy')
     UserRole = apps.get_model('permissions', 'UserRole')

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@specifyweb/backend/permissions/initialize.py` around lines 89 - 92, Remove
the redundant reassignment of UserPolicy: the code calls
apps.get_model('permissions', 'UserPolicy') twice (once assigned to UserPolicy
on the same block alongside Agent and UserRole); delete the duplicate line so
UserPolicy is only assigned once and keep the existing Agent and UserRole
assignments intact to avoid shadowing or confusion.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@specifyweb/specify/migration_utils/tectonic_ranks.py`:
- Around line 97-106: Tighten root-node detection by requiring both parent=None
and rankid=0 when identifying roots: update the Exists filter on TectonicUnit
(used in the TectonicUnitTreeDef.objects.annotate root_node_exists) to include
rankid=0 alongside parent=None and definition=OuterRef("pk"), and make the same
change in the revert/delete path that currently selects root candidates by name
(the logic around lines referenced for deletion) so both forward and revert
paths use parent=None AND rankid=0 when locating root TectonicUnit rows.

In `@specifyweb/specify/migrations/0002_geo.py`:
- Line 75: The migration call to create_default_collection_types currently omits
the migration DB alias and thus writes to the default DB; change the call in the
RunPython migration in 0002_geo.py to pass the migration alias (use
schema_editor.connection.alias, e.g. create_default_collection_types(apps,
schema_editor.connection.alias)), then update the
create_default_collection_types function signature to accept a db_alias
parameter and ensure all DB operations inside (queries/creates/updates)
explicitly use .using(db_alias) or the equivalent to avoid cross-database
writes.

---

Outside diff comments:
In `@specifyweb/backend/permissions/initialize.py`:
- Around line 89-92: Remove the redundant reassignment of UserPolicy: the code
calls apps.get_model('permissions', 'UserPolicy') twice (once assigned to
UserPolicy on the same block alongside Agent and UserRole); delete the duplicate
line so UserPolicy is only assigned once and keep the existing Agent and
UserRole assignments intact to avoid shadowing or confusion.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: 1694a881-7df6-4fad-8cb1-edb41d3c353b

📥 Commits

Reviewing files that changed from the base of the PR and between 3202d36 and 39ec012.

📒 Files selected for processing (16)

specifyweb/backend/businessrules/migration_utils.py
specifyweb/backend/businessrules/migrations/0004_catnum_uniquerule.py
specifyweb/backend/businessrules/uniqueness_rules.py
specifyweb/backend/patches/migration_utils.py
specifyweb/backend/permissions/initialize.py
specifyweb/backend/stored_queries/execution.py
specifyweb/backend/stored_queries/utils.py
specifyweb/specify/api/utils.py
specifyweb/specify/management/commands/run_key_migration_functions.py
specifyweb/specify/migration_utils/default_cots.py
specifyweb/specify/migration_utils/sp7_schemaconfig.py
specifyweb/specify/migration_utils/tectonic_ranks.py
specifyweb/specify/migration_utils/update_schema_config.py
specifyweb/specify/migrations/0002_geo.py
specifyweb/specify/migrations/0008_ageCitations_fix.py
specifyweb/specify/migrations/0009_tectonic_ranks.py

coderabbitai · 2026-06-04T16:35:02Z

    def consolidated_python_django_migration_operations(apps, schema_editor):
        db_alias = schema_editor.connection.alias or 'migrator'
-        create_default_collection_types(apps, using=db_alias)
+        create_default_collection_types(apps)


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Pass the migration DB alias into create_default_collection_types to avoid cross-database writes.

This call now executes on the default connection, while the migration is alias-aware elsewhere; on non-default aliases, this can populate/update records in the wrong database.

Proposed fix (cross-file)

--- a/specifyweb/specify/migrations/0002_geo.py +++ b/specifyweb/specify/migrations/0002_geo.py @@ - create_default_collection_types(apps) + create_default_collection_types(apps, using=db_alias)

--- a/specifyweb/specify/migration_utils/default_cots.py +++ b/specifyweb/specify/migration_utils/default_cots.py @@ -def create_default_collection_types(apps): +def create_default_collection_types(apps, using='default'): @@ - for collection in Collection.objects.filter(collectionobjecttype__isnull=True): + for collection in Collection.objects.using(using).filter(collectionobjecttype__isnull=True): @@ - cot, created = Collectionobjecttype.objects.get_or_create( + cot, created = Collectionobjecttype.objects.using(using).get_or_create( @@ - Collectionobject.objects.filter( + Collectionobject.objects.using(using).filter( collection=collection).update(collectionobjecttype=cot) collection.collectionobjecttype = cot - collection.save() + collection.save(using=using)

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@specifyweb/specify/migrations/0002_geo.py` at line 75, The migration call to create_default_collection_types currently omits the migration DB alias and thus writes to the default DB; change the call in the RunPython migration in 0002_geo.py to pass the migration alias (use schema_editor.connection.alias, e.g. create_default_collection_types(apps, schema_editor.connection.alias)), then update the create_default_collection_types function signature to accept a db_alias parameter and ensure all DB operations inside (queries/creates/updates) explicitly use .using(db_alias) or the equivalent to avoid cross-database writes.

I've implemented a more general way of ensuring all current and future functions within run_key_migration_functions will be executed with the same database connection that Migrations are run in.
The important changes are included in
9f9df79 (this PR), can you take a look and give me your thoughts?

The only difference between the migrations database and the default database is the user that is used to connect.
Ideally, the functions executed as part of run_key_migration_functions should never require the elevated level of permissions that is provided to the migrations user (the ability to perform DDL statements like dropping and creating tables, indices, etc.).
With this in mind, I am generally on the fence about whether we should even allow the migrator user to perform run_key_migration_functions: they should only be data movements, and never DDL statements.

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash gh api repos/specify/specify7/commits/9f9df797b631157353253296059e5cb7f2ec73fd --jq '.files[] | "=== \(.filename) ===\n\(.patch)"'

Length of output: 10197

🏁 Script executed:

#!/bin/bash # Check how 'migrations' alias is set up in settings rg -n "migrations" specifyweb/settings/__init__.py -A 3 -B 3 # Check if there's a fallback/guard for the 'migrations' alias rg -n "migrations" specifyweb/specify/migration_utils/router.py

Length of output: 748

@melton-jason The approach is elegant — a ContextVar-based router is the right tool here: it's thread-safe, requires no signature changes across utility functions, and scales automatically to future functions added to run_key_migration_functions. A few thoughts:

What works well:

The ContextVar/MigrationRouter combination is clean and idiomatic Django. Returning None implicitly when the flag is unset means it has zero impact on normal ORM calls outside of run_key_migration_functions.

The 'migrations' alias is always defined in settings/__init__.py, so there's no risk of ConnectionDoesNotExist.

The @contextmanager decorator makes _GeneratorContextManager a ContextDecorator (since Python 3.2), so the decorator-style example in the docstring (@use_migration_connection()) is valid.

One cosmetic nit:
The module-level docstring in router.py appears between the use_migration_connection function and the MigrationRouter class, rather than at the top or directly above the class. It would read more clearly if placed directly above class MigrationRouter: as a class docstring or a # comment block.

On the design question you raised:
Your instinct is reasonable. Since run_key_migration_functions is intended to be DML-only, unconditionally routing it through the 'migrations' connection does hand it DDL-level credentials unnecessarily. A few options:

Keep current approach — favors parity with migration execution context; acceptable if the migrator user's extra privileges don't pose a security concern in your deployment.

Remove use_migration_connection() from run_key_migration_functions — it would then use 'default' (app user, DML-only credentials), which matches the principle of least privilege. The risk is that future contributors accidentally add DDL-requiring calls.

Add an opt-in flag (e.g. --use-migration-connection) — makes the intent explicit and allows both cases.

Option 2 seems like the safer long-term default given your stated goal, but it's your call based on how the command is invoked in practice.

Overall the original concern about cross-database writes is now properly resolved via this approach.

melton-jason · 2026-06-03T15:03:55Z

+        # REFACTOR: See if we can simplify this even further. We should be able
+        # to collapse this query -> iteration -> check workflow to a single
+        # query.
+        # That would eliminate the N + 1 problem with this current approach,
+        # where every scoped rule needs to be evaluated.


(note for discussion)

I left this comment in the code, but I'll leave this as a comment on the PR for visibility.
There was a noticeable change in the asymptotic growth (time complexity) with this change.

Noticeably, the previous implementation of fix_global_default_rules:

Iterated over each distinct non-null DisciplineID

For each DisciplineID, performed a minimum of 2 queries to identify the "duplicate" scoped Uniqueness Rule and another 2 queries to delete them

Importantly, the number of Queries and iteration only grew on the number of Disciplines. So assuming the number of Disciplines remained the same, there could be any number of added Uniqueness Rules within those Disciplines without seeing a slowdown or additional database queries.
(This is only partly/theoretically true: there will be a slowdown for those queries if there is a sufficiently large amount of Uniqueness Rules and the queries are performed on unindexed fields or in a way that can't utilize the indexes, though I'm holding that slowdown as negligible or constant between the two approaches).

I've cleaned up the new approach and eliminated extra queries in 6e37e1e (this PR), but there still is a change in the approach:

Now we make a DB query to fetch all Uniqueness Rules that are scoped (have a non-null DisciplineID)

For each rule, we check whether there is an identical unscoped Uniqueness Rule

If there is an identified duplicate, we delete the Uniqueness Rule

Our "bottleneck" is now largely on the amount of scoped Uniqueness Rules, rather than the number of Disciplines.
Practically, I'm not sure how much of a slowdown this would introduce.
If we have the time, we can try and resolve this slowdown (I imagine we can use and filter on annotations on the query for scoped rules for duplicate rules, or just revert back to something like the past approach if there are no bugs), and/or we can evaluate how much of a slowdown this introduces.

melton-jason · 2026-06-03T15:58:54Z

+        # The ordering here for checks here is intentional: it's more likely a
+        # user has Sp 7 permissions than being an admin, so we do the former
+        # check first
+        if is_sp6_user_permissions_migrated(user=user, apps=apps):


(note here on why I brought back is_sp6_user_permissions_migrated)

I brought back is_sp6_user_permissions_migrated here.
It was needed for the following workflow:

A user is an admin in Specify 6, and should not be an admin in Specify 7

In this scenario, the user's Specify 7 Institution Admin privileges have been explicitly revoked

Previously, because this is a part of Run Key Migration Functions, the user in question would be re-made into an Institutional Admin, even if they were explicitly removed as an admin.

With is_sp6_user_permissions_migrated, we assume that users that don't have Sp 7 permissions can't have been intentionally removed as Specify 7 Institution Admins, so we can safely migrate their Admin preference. Otherwise, we can't tell if the user was intentionally removed as an admin

melton-jason · 2026-06-05T15:44:31Z

    def consolidated_python_django_migration_operations(apps, schema_editor):
        db_alias = schema_editor.connection.alias or 'migrator'
-        create_default_collection_types(apps, using=db_alias)
+        create_default_collection_types(apps)


I've implemented a more general way of ensuring all current and future functions within run_key_migration_functions will be executed with the same database connection that Migrations are run in.
The important changes are included in
9f9df79 (this PR), can you take a look and give me your thoughts?

The only difference between the migrations database and the default database is the user that is used to connect.
Ideally, the functions executed as part of run_key_migration_functions should never require the elevated level of permissions that is provided to the migrations user (the ability to perform DDL statements like dropping and creating tables, indices, etc.).
With this in mind, I am generally on the fence about whether we should even allow the migrator user to perform run_key_migration_functions: they should only be data movements, and never DDL statements.

…e applied

melton-jason · 2026-06-05T21:04:40Z

@coderabbitai review

coderabbitai · 2026-06-05T21:04:46Z

✅ Action performed

Review finished.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai

Actionable comments posted: 2

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

specifyweb/backend/businessrules/uniqueness_rules.py (1)

531-548: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Bind this cleanup transaction to the same DB alias as the ORM writes.

With the migration-router changes in this PR, UniquenessRule can be written on a non-default alias, but transaction.atomic() here still opens a transaction on the default connection. If an error occurs mid-cleanup, the deletes on the routed DB are no longer protected by this atomic block.

Suggested fix

+    db_alias = router.db_for_write(UniquenessRule)
-    with transaction.atomic():
+    with transaction.atomic(using=db_alias):
         # REFACTOR: See if we can simplify this even further. We should be able
         # to collapse this query -> iteration -> check workflow to a single
         # query.
         # That would eliminate the N + 1 problem with this current approach,
         # where every scoped rule needs to be evaluated.
-        for rule in UniquenessRule.objects.exclude(discipline__isnull=True).prefetch_related("uniquenessrulefield_set"):
+        for rule in (
+            UniquenessRule.objects.using(db_alias)
+            .exclude(discipline__isnull=True)
+            .prefetch_related("uniquenessrulefield_set")
+        ):

If this helper is migration-only, passing schema_editor.connection.alias down from the caller would be safer still.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@specifyweb/backend/businessrules/uniqueness_rules.py` around lines 531 - 548,
The transaction.atomic() block must be bound to the same DB alias used for
writing UniquenessRule records; change the cleanup to accept an explicit DB
alias (e.g. pass schema_editor.connection.alias from the caller if
migration-only) and use transaction.atomic(using=alias) and operate on the
routed DB by calling UniquenessRule.objects.using(alias).exclude(...) and use
.using(alias) for the queryset deletes
(rule.uniquenessrulefield_set.all().using(alias).delete() and
rule.delete(using=alias) or delete via the queryset) so all reads/deletes are
executed and rolled back on the same connection.

♻️ Duplicate comments (1)

specifyweb/specify/migration_utils/default_cots.py (1)

33-63: ⚠️ Potential issue | 🟠 Major

Keep these migration helpers alias-aware.

Dropping the explicit DB alias here means 0002_geo now runs these reads/writes on Django’s default connection unless every caller re-routes them some other way. On non-default migration runs, that seeds and backfills the wrong database.

In Django 4.2 `RunPython` migrations, do queryset reads/writes and `model.save()` automatically use `schema_editor.connection.alias`, or must migrations explicitly call `.using(schema_editor.connection.alias)` / `save(using=...)`?

Also applies to: 65-86, 111-123

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@specifyweb/specify/migration_utils/default_cots.py` around lines 33 - 63, The
helper create_default_discipline_for_tree_defs (and the other helpers at the
noted ranges) currently perform queries and .save() calls without honoring the
migration DB alias; change their signatures to accept (apps, schema_editor), get
alias = schema_editor.connection.alias, and perform all queries as
Model.objects.using(alias).all() (e.g., Discipline.objects.using(alias).all(),
Institution.objects.using(alias).all()) and call .save(using=alias) when saving
tree definition instances; update any other helpers mentioned (lines 65-86,
111-123) to follow the same pattern so reads/writes run on
schema_editor.connection.alias rather than the default DB.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@specifyweb/backend/businessrules/uniqueness_rules.py`:
- Around line 198-206: The migration gate and cleanup hard-code the default DB
and cache key, causing DB-alias leakage; update
_initial_businessrules_migration_applied and
_cached_businessrules_migration_applied to accept a connection or alias (or
derive it from the registry/registry.apps) instead of using
connections["default"], and use a cache key that includes the connection.alias;
update validate_unique to pass the right alias/connection when calling
_cached_businessrules_migration_applied. In fix_global_default_rules (migration
0008_fix_global_default_rules.py) run cleanup against schema_editor.connection
by wrapping operations in
transaction.atomic(using=schema_editor.connection.alias) and perform
queries/deletes with Model.objects.using(schema_editor.connection.alias) (or
QuerySet.using) so all reads/writes occur on the migration-provided
connection/transaction. Ensure all references mention the functions
_initial_businessrules_migration_applied,
_cached_businessrules_migration_applied, validate_unique, and
fix_global_default_rules/schema_editor.connection.alias.

In `@specifyweb/specify/migrations/0009_tectonic_ranks.py`:
- Around line 17-46: Narrow the reverse deletion to only remove the exact
default chain created by this migration by adding checks for the exact
parent/child chain and names instead of deleting any "Root" or leaf with that
name: update the TectonicUnit queryset to require a Root node whose single child
is named "Tectonic Unit" (and that child’s single child is "Tectonic Subunit" if
applicable), ensure counts/no extra siblings (e.g. annotate child_count via
Subquery/Count and require expected counts), and similarly tighten the
TectonicUnitTreeDefItem filter to assert the exact tree shape (Root -> "Tectonic
Unit" -> "Tectonic Subunit") via Exists/Subquery checks against parent and
definitionitem relationships before calling .delete() so only the specific chain
created by this migration is removed.

---

Outside diff comments:
In `@specifyweb/backend/businessrules/uniqueness_rules.py`:
- Around line 531-548: The transaction.atomic() block must be bound to the same
DB alias used for writing UniquenessRule records; change the cleanup to accept
an explicit DB alias (e.g. pass schema_editor.connection.alias from the caller
if migration-only) and use transaction.atomic(using=alias) and operate on the
routed DB by calling UniquenessRule.objects.using(alias).exclude(...) and use
.using(alias) for the queryset deletes
(rule.uniquenessrulefield_set.all().using(alias).delete() and
rule.delete(using=alias) or delete via the queryset) so all reads/deletes are
executed and rolled back on the same connection.

---

Duplicate comments:
In `@specifyweb/specify/migration_utils/default_cots.py`:
- Around line 33-63: The helper create_default_discipline_for_tree_defs (and the
other helpers at the noted ranges) currently perform queries and .save() calls
without honoring the migration DB alias; change their signatures to accept
(apps, schema_editor), get alias = schema_editor.connection.alias, and perform
all queries as Model.objects.using(alias).all() (e.g.,
Discipline.objects.using(alias).all(), Institution.objects.using(alias).all())
and call .save(using=alias) when saving tree definition instances; update any
other helpers mentioned (lines 65-86, 111-123) to follow the same pattern so
reads/writes run on schema_editor.connection.alias rather than the default DB.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: 823940ce-81d2-4931-804a-bacdc15b9a66

📥 Commits

Reviewing files that changed from the base of the PR and between 39ec012 and 9ddf13e.

📒 Files selected for processing (9)

specifyweb/backend/businessrules/uniqueness_rules.py
specifyweb/settings/__init__.py
specifyweb/specify/management/commands/run_key_migration_functions.py
specifyweb/specify/migration_utils/default_cots.py
specifyweb/specify/migration_utils/misc_migrations.py
specifyweb/specify/migration_utils/router.py
specifyweb/specify/migration_utils/tectonic_ranks.py
specifyweb/specify/migrations/0002_geo.py
specifyweb/specify/migrations/0009_tectonic_ranks.py

💤 Files with no reviewable changes (1)

specifyweb/specify/migration_utils/misc_migrations.py

✅ Files skipped from review due to trivial changes (1)

specifyweb/settings/init.py

🚧 Files skipped from review as they are similar to previous changes (1)

specifyweb/specify/management/commands/run_key_migration_functions.py

coderabbitai · 2026-06-05T21:19:20Z

+    # If we're in a migration where businessrules have not been loaded and/or
+    # the initial businessrule migration has not been applied, then skip
+    # checking the rule for now.
+    # Note that the former can exist where the latter does: if we're reversing
+    # a migration which does not have a dependency on businessrules (so the
+    # businessrules app does not need to be loaded) but the businessrule
+    # migration is still applied
+    if not businessrule_app_is_ready(registry) or not _cached_businessrules_migration_applied():
+        return


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash set -euo pipefail FILE="specifyweb/backend/businessrules/uniqueness_rules.py" echo "== File: $FILE ==" wc -l "$FILE" echo echo "== Function locations ==" rg -n "def (_initial_businessrules_migration_applied|_cached_businessrules_migration_applied|businessrule_app_is_ready)\b" "$FILE" || true echo echo "== Cache definition / usage (near _uniqueness_migration_cache) ==" rg -n "_uniqueness_migration_cache" "$FILE" || true echo echo "== validate_unique snippet around lines 170-240 ==" nl -ba "$FILE" | sed -n '160,240p' # Print around the helper defs (by ranges around the matched line numbers) echo echo "== Top of file imports/context (first 120 lines) ==" nl -ba "$FILE" | sed -n '1,120p'

Repository: specify/specify7

Length of output: 810

🏁 Script executed:

#!/bin/bash set -euo pipefail FILE="specifyweb/backend/businessrules/uniqueness_rules.py" echo "== Cache definition block ==" sed -n '40,140p' "$FILE" | cat -n echo echo "== _initial_businessrules_migration_applied() body ==" sed -n '80,120p' "$FILE" | cat -n echo echo "== businessrule_app_is_ready(registry) body ==" sed -n '96,140p' "$FILE" | cat -n echo echo "== _cached_businessrules_migration_applied() body ==" sed -n '100,140p' "$FILE" | cat -n echo echo "== validate_unique early-return block (context) ==" sed -n '160,220p' "$FILE" | cat -n

Repository: specify/specify7

Length of output: 13770

🏁 Script executed:

#!/bin/bash set -euo pipefail FILE="specifyweb/backend/businessrules/uniqueness_rules.py" rg -n "def fix_global_default_rules" "$FILE" # Print around the function sed -n '220,340p' "$FILE" | cat -n

Repository: specify/specify7

Length of output: 5308

🏁 Script executed:

#!/bin/bash set -euo pipefail FILE="specifyweb/backend/businessrules/uniqueness_rules.py" echo "== fix_global_default_rules() around its definition ==" sed -n '480,560p' "$FILE" | cat -n echo echo "== Any transaction.atomic usage in the file ==" rg -n "transaction\.atomic\(" "$FILE"

Repository: specify/specify7

Length of output: 3446

🏁 Script executed:

#!/bin/bash set -euo pipefail rg -n "fix_global_default_rules\(" -S specifyweb | head -n 50

Repository: specify/specify7

Length of output: 276

🏁 Script executed:

#!/bin/bash set -euo pipefail FILE="specifyweb/backend/businessrules/migrations/0008_fix_global_default_rules.py" echo "== File ==" wc -l "$FILE" echo echo "== Contents ==" cat -n "$FILE"

Repository: specify/specify7

Length of output: 781

🏁 Script executed:

#!/bin/bash set -euo pipefail FILE="specifyweb/backend/businessrules/uniqueness_rules.py" rg -n "router" "$FILE" | head -n 50 rg -n "from django\.db import .*router" "$FILE" || true

Repository: specify/specify7

Length of output: 176

Fix DB-alias leakage in the businessrules migration gate and the global-rule cleanup.

validate_unique() calls _cached_businessrules_migration_applied(), but both _initial_businessrules_migration_applied() and _cached_businessrules_migration_applied() hard-code connections["default"] and cache key "default". On non-default DB aliases this can check the wrong migration state and incorrectly skip uniqueness enforcement.

fix_global_default_rules() wraps cleanup in with transaction.atomic(): (default connection) and performs deletes/reads without tying them to schema_editor.connection.alias. Migration 0008_fix_global_default_rules.py ignores schema_editor, so the cleanup may run against the wrong database/transaction boundary.

Suggested fix (migration gate)

-def _initial_businessrules_migration_applied(): +def _initial_businessrules_migration_applied(db_alias: str): return any( app == "businessrules" and migration_name == "0001_initial" for app, migration_name in MigrationRecorder( - connections["default"] + connections[db_alias] ).applied_migrations() ) -def _cached_businessrules_migration_applied() -> bool: - cache_key = "default" +def _cached_businessrules_migration_applied(db_alias: str) -> bool: + cache_key = db_alias cache_is_active, is_set = _uniqueness_migration_cache.get(cache_key, default=False) if cache_is_active and is_set: return True is_applied = _initial_businessrules_migration_applied(db_alias) if cache_is_active and is_applied: _uniqueness_migration_cache.set(cache_key, is_applied) return is_applied + db_alias = instance._state.db or router.db_for_write(model, instance=instance) - if not businessrule_app_is_ready(registry) or not _cached_businessrules_migration_applied(): + if not businessrule_app_is_ready(registry) or not _cached_businessrules_migration_applied(db_alias): return

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@specifyweb/backend/businessrules/uniqueness_rules.py` around lines 198 - 206, The migration gate and cleanup hard-code the default DB and cache key, causing DB-alias leakage; update _initial_businessrules_migration_applied and _cached_businessrules_migration_applied to accept a connection or alias (or derive it from the registry/registry.apps) instead of using connections["default"], and use a cache key that includes the connection.alias; update validate_unique to pass the right alias/connection when calling _cached_businessrules_migration_applied. In fix_global_default_rules (migration 0008_fix_global_default_rules.py) run cleanup against schema_editor.connection by wrapping operations in transaction.atomic(using=schema_editor.connection.alias) and perform queries/deletes with Model.objects.using(schema_editor.connection.alias) (or QuerySet.using) so all reads/writes occur on the migration-provided connection/transaction. Ensure all references mention the functions _initial_businessrules_migration_applied, _cached_businessrules_migration_applied, validate_unique, and fix_global_default_rules/schema_editor.connection.alias.

coderabbitai · 2026-06-05T21:19:20Z

+    TectonicUnit.objects.annotate(
+        has_children_nodes=Exists(
+            TectonicUnit.objects.filter(
+                parent=OuterRef("pk")
+            )
+        )
+    ).filter(
+        parent__isnull=True,
+        has_children_nodes=False,
+        name="Root"
+    ).delete()
+
+    # Delete the Root TectonicUnit rank if there are no nodes in the tree and
+    # no children rank reference the Root rank
+    TectonicUnitTreeDefItem.objects.annotate(
+        has_nodes=Exists(
+            TectonicUnit.objects.filter(
+                definitionitem=OuterRef("pk")
+            )
+        ),
+        has_child_rank=Exists(
+            TectonicUnitTreeDefItem.objects.filter(
+                parent=OuterRef("pk")
+            )
+        )
+    ).filter(
+        has_nodes=False,
+        has_child_rank=False,
+        name="Root"
+    ).delete()


⚠️ Potential issue | 🟠 Major | 🏗️ Heavy lift

Tighten the rollback scope before deleting tectonic data.

These reverse helpers still match on generic names/leaf-ness, not on records this migration can prove it created. A rollback can therefore strip pre-existing custom trees that happen to use Root, Tectonic Unit, Tectonic Subunit, etc. Please scope the deletes to the exact default chain/tree shape created by the forward migration instead of pruning by name alone.

Also applies to: 55-103

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@specifyweb/specify/migrations/0009_tectonic_ranks.py` around lines 17 - 46, Narrow the reverse deletion to only remove the exact default chain created by this migration by adding checks for the exact parent/child chain and names instead of deleting any "Root" or leaf with that name: update the TectonicUnit queryset to require a Root node whose single child is named "Tectonic Unit" (and that child’s single child is "Tectonic Subunit" if applicable), ensure counts/no extra siblings (e.g. annotate child_count via Subquery/Count and require expected counts), and similarly tighten the TectonicUnitTreeDefItem filter to assert the exact tree shape (Root -> "Tectonic Unit" -> "Tectonic Subunit") via Exists/Subquery checks against parent and definitionitem relationships before calling .delete() so only the specific chain created by this migration is removed.

CarolineDenis added 30 commits June 1, 2026 12:27

fix:reverse_hide_component_fields to be set to Falses

2a5eda8

(cherry picked from commit ab5b3e3)

fix: Do not dedupe localized strings across all languages

e8466af

(cherry picked from commit 60a6797)

fix: Simplify MultipleObjectsReturned

25bf8b2

(cherry picked from commit f58c785)

Fix: Use db_alias for migration db connexion

063a7c6

(cherry picked from commit 2c3b7d5)

Fix: flips existing row back to isDatabaseConstraint=True

6cc5ffa

(cherry picked from commit c452f7d)

@CarolineDenis

1ee0f1e

Fix: compare full rule definitions before deleting anything in uniqueness (cherry picked from commit c61a530)

Fix: verify existing user permissions

f0dd766

(cherry picked from commit ef0eed0)

Fix: Remove the use of f-string

6dfe4eb

(cherry picked from commit b1f0346)

fix: prevent incorrect reuse when pairing tree defs and disciplines

8893687

(cherry picked from commit 00fe897)

fix: remove exclusion so partially migrated disciplines are repaired

32e2972

(cherry picked from commit c633180)

Fix: Update splocalecontainer items

6d1403c

(cherry picked from commit d4c0231)

Fix: Remove shadowing import in geo migration

d60939d

(cherry picked from commit 9953d6a)

Fix: Revert relative age migration instead of applying again

470d99d

(cherry picked from commit 8c9cc13)

Fix fix order of revert migration in tectonic migration

befb73b

(cherry picked from commit 6381382)

Fix: Indentation

8a1e35b

(cherry picked from commit bab5083)

Fix: Use deterministic ordering before positional pairing

82bb520

(cherry picked from commit 28b9f5b)

Fix: Log only on debug

8edb112

(cherry picked from commit 11b6608)

Fix: Fix age type

efc8228

(cherry picked from commit af4c963)

Fix: Improve logger

2366664

(cherry picked from commit 57bca64)

Fix: Remove unecessary param in def migration

4c0f9f8

(cherry picked from commit 00a21ef)

Fix: Remove projects from legacy tests

fac87e9

(cherry picked from commit 3da5761)

Fix: Change settings import

c8663e9

(cherry picked from commit e07aaeb)

Fix:Incomplete field initialization in conditional createt

a940612

(cherry picked from commit 36b307c)

Fix: Incomplete field initialization in conditional create

4e1d45a

(cherry picked from commit d97acbd)

Fix: Put model name in lower case for consistency and reversability

e228051

(cherry picked from commit d3c6c98)

Fix: Revert chnages

aff97b8

(cherry picked from commit b183031)

Fix: Remove double id rank

1e13637

(cherry picked from commit 2e9e4ad)

Fix: Indentation

624deee

(cherry picked from commit 2254fc4)

Fix: Add strict=False to allow diff length

cd740a7

(cherry picked from commit 39b001a)

Fix: Remove duplicate

05f6298

(cherry picked from commit d8bd82d)

CarolineDenis mentioned this pull request Jun 3, 2026

Integrate sync_schema_config_fields fixes into main #8067

Closed

7 tasks

CarolineDenis linked an issue Jun 3, 2026 that may be closed by this pull request

Prevent sync_schema_config_fields from creating schema records for {table}Id fields #8065

Open

melton-jason added 13 commits June 3, 2026 09:28

fix: use manager over base manager for tree patches

48face1

fix: remove check for old newly created DBs in specify 7

fdfe21b

fix: bring back is_sp6_user_permissions_migrated

96bf0cc

fix: change ordering of admin checks to evaluate more common first

cfad41f

fix: remove unused query

2e1835e

refactor: use defaults in get_or_create for Roles

f4d6491

chore: add refactor note

f792d5e

refactor: move DEBUG check into log_sqlalchemy_query

91aaf38

fix: remove unused log_sqlalchemy_query function

6b4d796

fix: incorrect Discipline -> tectonicunittreedef pairing when resolvi…

ab8320b

…ng links

refactor: optimize fix_taxon_treedef_discipline_links to one query

4196c1e

refactor: move fix_tectonic_links to TectonicUnit file

6e1902e

fix: forward tectonic unit migration functions

39ec012

coderabbitai Bot requested changes Jun 4, 2026

View reviewed changes

github-project-automation Bot moved this from 📋Back Log to Dev Attention Needed in General Tester Board Jun 4, 2026

melton-jason added 2 commits June 5, 2026 10:21

fix: use Migrator user for all run_key_migration operations

9f9df79

perf: reduce set_discipline_for_taxon_treedefs to single DB hit

a95bc71

melton-jason commented Jun 5, 2026

View reviewed changes

melton-jason added 5 commits June 5, 2026 11:58

refactor: collapse default tectonicunit ranks to tuple

1136e52

fix: simplify and move reverse tectonic root node migration

5418c64

fix: handle case when businessrule app is not ready but migrations ar…

419d69b

…e applied

refactor: move reverse migration to migration file

19f5c1a

fix: account for custom Tectonic Trees when reverting migration

9ddf13e

coderabbitai Bot requested changes Jun 5, 2026

View reviewed changes

Conversation

melton-jason commented Jun 1, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Testing instructions

Summary by CodeRabbit

Uh oh!

melton-jason commented Jun 4, 2026

Uh oh!

coderabbitai Bot commented Jun 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot Jun 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

melton-jason Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

melton-jason Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

melton-jason Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

melton-jason Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

melton-jason commented Jun 5, 2026

Uh oh!

coderabbitai Bot commented Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

melton-jason commented Jun 1, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Jun 4, 2026 •

edited

Loading

coderabbitai Bot Jun 4, 2026 •

edited

Loading

coderabbitai Bot commented Jun 5, 2026 •

edited

Loading