CWB
|
A specialised hash for computing frequency distributions over tuples of lexicon IDs. More...
A specialised hash for computing frequency distributions over tuples of lexicon IDs.
list of the N attributes corresponding to the keys of the hash
Referenced by main(), and scancorpus_add_key().
int buckets |
int constraint_ok[MAX_N] |
whether constraint is satisfied (initialised at start_cpos, reset at end_cpos)
Referenced by main(), and scancorpus_add_key().
int current_struc[MAX_N] |
number of current or next structure
Referenced by main(), and scancorpus_add_key().
int end_cpos[MAX_N] |
end of this structure (cpos)
Referenced by main(), and scancorpus_add_key().
Referenced by main().
int* id_list[MAX_N] |
optional regex constraint (stored as a list of matching lexicon IDs)
Referenced by main(), and scancorpus_add_key().
int id_list_size[MAX_N] |
size of this list
Referenced by main(), and scancorpus_add_key().
int is_constraint[MAX_N] |
list of flags marking constraint keys ("?...")
Referenced by main(), and scancorpus_add_key().
int is_negated[MAX_N] |
whether regex constraint is negated (!=)
Referenced by main(), and scancorpus_add_key().
int is_structural[MAX_N] |
list of flags identifying s-attributes (all others are p-attributes)
Referenced by main(), and scancorpus_add_key().
int K |
number of non-constraint keys, i.e.
the actual hash table stores K-tuples
Referenced by hash_add(), hash_find(), main(), and scancorpus_add_key().
int max_offset |
largest offset of all keys (to avoid scanning past end of corpus
Referenced by main(), and scancorpus_add_key().
int N |
number of keys, including constraint-only keys
Referenced by main(), and scancorpus_add_key().
int offset[MAX_N] |
list of optional corpus position offsets
Referenced by main(), and scancorpus_add_key().
optional regex constraint (compiled regular expression)
Referenced by main(), and scancorpus_add_key().
char* source_base[MAX_N] |
base pointers to compute virtual IDs (= offsets) from annotation strings
Referenced by main(), and scancorpus_add_key().
int start_cpos[MAX_N] |
start of this structure (cpos)
Referenced by main(), and scancorpus_add_key().
array of HashEntry pointers == buckets (initialised to NULL, i.e.
empty buckets)
Referenced by hash_add(), hash_find(), and main().
int virtual_id[MAX_N] |
virtual ID of a region's annotation string (constant within region)
Referenced by main().