Vadim Markovtsev, Waren Long
sourced.tech
pga list - download and list the index
pga get - download the siva files
siva unpack + regular Git>>> from sourced.engine import Engine>>> engine = Engine(spark, "/path/to/siva/files", "siva")>>> engine.repositories.references.head_ref \ .commits.tree_entries.blobs \ .classify_languages() \ .filter('lang = "Python"') \ .extract_uasts() \ .query_uast('//*[@roleIdentifier]') \ .extract_tokens("result", "tokens") \ .select("blob_id", "path", "tokens")