我有文本文件,是大的(3GB),并希望在火花处理这个文本文件。此文本文件中没有分隔符。在每50个字符之后,新记录就开始了,但是记录之间没有分隔符。我不知道如何加载这些数据和处理这个文件?sc.textFile('path/to/file.txt') # this not helping here as there is no delimiter between records
只是为了识别我使用的强调和强的模式,然而,正如我们所知,文本文件没有任何强调和强,它是纯文本。
To recover your lost Database and avoid leaking it: Send us 0.1 Bitcoin (BTC) to our Bitcoin address 1J6jLduCXbPyxt5EMTs7iHwdafANy4ThJc and contact us by Email with your Server IP or Domain name and a Proof of Payment. If you are unsure if we have your data, contact us and we will send you a proof.