Published

January 6, 2025

Lazy and strict reading of CEX sources

You can instantiate a DSECollection from a delimited-text data source using the fromcex method.

  • lazy: a single cite relations block
  • strict: full metadata for one or more sources

We’ll read a single, large DSE collection from a file

src = joinpath(root, "test","assets", "hmt-2022k.cex")

using CitablePhysicalText
using CitableBase
dsev = fromcex(src,DSECollection,FileReader)
#lazydsev = fromcex(src,DSECollection,FileReader; strict = false)
Precompiling CitablePhysicalText...
    908.5 ms  ✓ Clustering
   1062.3 ms  ✓ MacroTools
    551.3 ms  ✓ SimpleTraits
    736.2 ms  ✓ Setfield
   1320.4 ms  ✓ ImageAxes
    971.5 ms  ✓ ImageMetadata
   2378.8 ms  ✓ Graphs
    872.1 ms  ✓ SimpleWeightedGraphs
   1344.3 ms  ✓ Netpbm
   9958.3 ms  ✓ Polynomials
   1029.7 ms  ✓ Polynomials → PolynomialsChainRulesCoreExt
   1008.0 ms  ✓ Polynomials → PolynomialsFFTWExt
   1554.7 ms  ✓ ImageBinarization
  15542.0 ms  ✓ JLD2
   1352.4 ms  ✓ MetaGraphs
   2561.8 ms  ✓ ImageSegmentation
   3153.8 ms  ✓ Images
   3223.1 ms  ✓ CitableImage
   3109.7 ms  ✓ CitablePhysicalText
  19 dependencies successfully precompiled in 30 seconds. 245 already precompiled.
1-element Vector{DSECollection}:
 urn:cite2:hmt:hmtdse.v1:all Homer Multitext project indexing of digital scholarly editions
dsev[1] |> typeof
DSECollection