Solr
Solr indexing utilities for GSO.
show_active_config(ctx)
Log the effective config we are using (purely informational).
Source code in gso/cli/solr.py
_stream_write_json_array(out_path, docs)
Write a valid JSON array to disk from a stream without buffering all docs.
Returns number of documents written.
Source code in gso/cli/solr.py
ensure_copy_field()
Ensure the catch-all copy-field to _text_
exists on the configured core.
reindex_all(batch_size=500, skip_copy_field=False, dry_run_json=None)
Stream data from Postgres and index into Solr (no giant in-memory lists).
Source code in gso/cli/solr.py
dump_docs(out_json=typer.Argument(..., help='Path to write the generated JSON array (no Solr post).'))
Generate Solr docs (streaming) and write them to a JSON file for inspection.