A Fully Reversible Data Transform Technique Enhancing Data Compression of SMILES Data

Abstract : The requirement to efficiently store and process SMILES data used in Chemoinformatics creates a demand for efficient techniques to compress this data. General-purpose transforms and compressors are available to transform and compress this type of data to a certain extent, however, these techniques are not specific to SMILES data. We develop a transform specific to SMILES data that can be used alongside other general-purpose compressors as a preprocessor and post-processor to improve the compression of SMILES data. We test our transform with six other general-purpose compressors and also compare our results with another transform on our SMILES data corpus, we also compare our results with untransformed data.
Complete list of metadatas

Cited literature [12 references]  Display  Hide  Download

https://hal.inria.fr/hal-01506777
Contributor : Hal Ifip <>
Submitted on : Wednesday, April 12, 2017 - 11:19:09 AM
Last modification on : Thursday, April 13, 2017 - 1:06:48 AM
Long-term archiving on: Thursday, July 13, 2017 - 12:42:20 PM

File

978-3-642-40511-2_5_Chapter.pd...
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

  • HAL Id : hal-01506777, version 1

Citation

Shagufta Scanlon, Mick Ridley. A Fully Reversible Data Transform Technique Enhancing Data Compression of SMILES Data. 1st Cross-Domain Conference and Workshop on Availability, Reliability, and Security in Information Systems (CD-ARES), Sep 2013, Regensburg, Germany. pp.54-68. ⟨hal-01506777⟩

Share

Metrics

Record views

152

Files downloads

166