worker: pin Jython-era Ghidra (11.2.1) for headless .py post-script
Ghidra 11.4+/12.x dropped the bundled Jython, so the .py extractor fails headless with "Ghidra was not started with PyGhidra. Python is not available" — analysis succeeds but the post-script never runs, so no snapshot is produced. Default GHIDRA_URL now points at 11.2.1 (Jython); README documents the constraint and the PyGhidra path for staying on 12.x. Keeps the local Dockerfile fixes (pip upgrade, non-editable install). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
This commit is contained in:
12
README.md
12
README.md
@@ -63,11 +63,19 @@ Ghidra headless + `extract_engine_surface.py` → snapshot → import do Postgre
|
|||||||
Worker (`docker/worker.Dockerfile`, `eclipse-temurin:21-jdk`) pobiera Ghidrę i ustawia
|
Worker (`docker/worker.Dockerfile`, `eclipse-temurin:21-jdk`) pobiera Ghidrę i ustawia
|
||||||
`GHIDRA_HOME=/opt/ghidra`. Wersja jest przypięta w `ARG GHIDRA_URL`. Jeśli build padnie na pobieraniu,
|
`GHIDRA_HOME=/opt/ghidra`. Wersja jest przypięta w `ARG GHIDRA_URL`. Jeśli build padnie na pobieraniu,
|
||||||
nadpisz URL realnym wydaniem z [releases NSA](https://github.com/NationalSecurityAgency/ghidra/releases)
|
nadpisz URL realnym wydaniem z [releases NSA](https://github.com/NationalSecurityAgency/ghidra/releases)
|
||||||
(nazwa pliku: `ghidra_<wer>_PUBLIC_<data>.zip`):
|
(nazwa pliku: `ghidra_<wer>_PUBLIC_<data>.zip`).
|
||||||
|
|
||||||
|
> **Musi to być Ghidra ≤ 11.3.x.** Ekstraktor to skrypt **Pythona (`.py`)**, który Ghidra w trybie
|
||||||
|
> headless uruchamia przez wbudowanego **Jythona**. Ghidra **11.4+ / 12.x usunęły Jythona** — tam
|
||||||
|
> `.py` headless wymaga **PyGhidry** (CPython), której ten obraz nie inicjalizuje, i dostaniesz
|
||||||
|
> `Ghidra was not started with PyGhidra. Python is not available` (analiza przejdzie, ale post-skrypt
|
||||||
|
> nie wyemituje snapshotu). Domyślny `GHIDRA_URL` celuje w 11.2.1 (z Jythonem). Chcesz zostać na 12.x?
|
||||||
|
> Trzeba doinstalować `pyghidra` i odpalać headless przez PyGhidrę — sam skrypt jest CPython-kompatybilny,
|
||||||
|
> więc zadziała, gdy interpreter wstanie (patrz dokumentacja PyGhidra w danej wersji Ghidry).
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
docker compose build worker \
|
docker compose build worker \
|
||||||
--build-arg GHIDRA_URL=https://github.com/NationalSecurityAgency/ghidra/releases/download/Ghidra_11.3.2_build/ghidra_11.3.2_PUBLIC_20250415.zip
|
--build-arg GHIDRA_URL=https://github.com/NationalSecurityAgency/ghidra/releases/download/Ghidra_11.2.1_build/ghidra_11.2.1_PUBLIC_20241105.zip
|
||||||
docker compose up
|
docker compose up
|
||||||
```
|
```
|
||||||
|
|
||||||
|
|||||||
@@ -9,9 +9,8 @@ COPY ams ./ams
|
|||||||
COPY ghidra_scripts ./ghidra_scripts
|
COPY ghidra_scripts ./ghidra_scripts
|
||||||
COPY snapshots ./snapshots
|
COPY snapshots ./snapshots
|
||||||
|
|
||||||
# Editable install keeps ams + ghidra_scripts co-located (the worker resolves the script
|
# The API needs the queue client too, to enqueue jobs.
|
||||||
# path relative to the package). The API needs the queue client too, to enqueue jobs.
|
RUN pip install --no-cache-dir ".[api]" rq redis "psycopg[binary]>=3.1"
|
||||||
RUN pip install --no-cache-dir -e ".[api]" rq redis "psycopg[binary]>=3.1"
|
|
||||||
|
|
||||||
ENV AMS_UPLOAD_DIR=/data/uploads
|
ENV AMS_UPLOAD_DIR=/data/uploads
|
||||||
EXPOSE 8000
|
EXPOSE 8000
|
||||||
|
|||||||
@@ -3,9 +3,16 @@
|
|||||||
#
|
#
|
||||||
# Override the Ghidra build without editing this file:
|
# Override the Ghidra build without editing this file:
|
||||||
# docker build --build-arg GHIDRA_URL=https://github.com/.../ghidra_X_PUBLIC_DATE.zip ...
|
# docker build --build-arg GHIDRA_URL=https://github.com/.../ghidra_X_PUBLIC_DATE.zip ...
|
||||||
|
#
|
||||||
|
# IMPORTANT: the extractor is a Python (.py) headless post-script, which Ghidra runs via its
|
||||||
|
# bundled **Jython**. Ghidra 11.4+ / 12.x REMOVED Jython - there `.py` headless needs PyGhidra
|
||||||
|
# (CPython), which this image doesn't initialise, and you'll get:
|
||||||
|
# "Ghidra was not started with PyGhidra. Python is not available"
|
||||||
|
# So pin a Jython-era release (<= 11.3.x). If this URL 404s, copy the exact filename from
|
||||||
|
# https://github.com/NationalSecurityAgency/ghidra/releases (form: ghidra_<ver>_PUBLIC_<date>.zip).
|
||||||
FROM eclipse-temurin:21-jdk-jammy
|
FROM eclipse-temurin:21-jdk-jammy
|
||||||
|
|
||||||
ARG GHIDRA_URL=https://github.com/NationalSecurityAgency/ghidra/releases/download/Ghidra_11.3_build/ghidra_11.3_PUBLIC_20250205.zip
|
ARG GHIDRA_URL=https://github.com/NationalSecurityAgency/ghidra/releases/download/Ghidra_11.2.1_build/ghidra_11.2.1_PUBLIC_20241105.zip
|
||||||
|
|
||||||
# Runtime deps: python (the package), unzip/wget (fetch Ghidra), libarchive-tools (bsdtar:
|
# Runtime deps: python (the package), unzip/wget (fetch Ghidra), libarchive-tools (bsdtar:
|
||||||
# unpacks ISO9660 + ZIP game archives).
|
# unpacks ISO9660 + ZIP game archives).
|
||||||
@@ -19,6 +26,8 @@ RUN wget -q "$GHIDRA_URL" -O /tmp/ghidra.zip \
|
|||||||
&& mv /opt/ghidra_* /opt/ghidra \
|
&& mv /opt/ghidra_* /opt/ghidra \
|
||||||
&& rm /tmp/ghidra.zip
|
&& rm /tmp/ghidra.zip
|
||||||
|
|
||||||
|
RUN pip3 install --no-cache-dir --upgrade pip setuptools wheel
|
||||||
|
|
||||||
ENV GHIDRA_HOME=/opt/ghidra
|
ENV GHIDRA_HOME=/opt/ghidra
|
||||||
ENV AMS_GHIDRA_SCRIPTS=/app/ghidra_scripts
|
ENV AMS_GHIDRA_SCRIPTS=/app/ghidra_scripts
|
||||||
ENV AMS_UPLOAD_DIR=/data/uploads
|
ENV AMS_UPLOAD_DIR=/data/uploads
|
||||||
@@ -29,7 +38,7 @@ COPY ams ./ams
|
|||||||
COPY ghidra_scripts ./ghidra_scripts
|
COPY ghidra_scripts ./ghidra_scripts
|
||||||
COPY snapshots ./snapshots
|
COPY snapshots ./snapshots
|
||||||
|
|
||||||
RUN pip3 install --no-cache-dir -e ".[api,acquire,worker]"
|
RUN pip3 install --no-cache-dir ".[api,acquire,worker]"
|
||||||
|
|
||||||
# Drain the 'acquire' queue. Shell form so $REDIS_URL expands at runtime.
|
# Drain the 'acquire' queue. Shell form so $REDIS_URL expands at runtime.
|
||||||
CMD rq worker --url "${REDIS_URL:-redis://redis:6379/0}" acquire
|
CMD rq worker --url "${REDIS_URL:-redis://redis:6379/0}" acquire
|
||||||
|
|||||||
Reference in New Issue
Block a user