Error when retrieving binary data in TXT records #100

maelp · 2021-08-10T17:22:24Z

The RFC1035 indicates that TXT records are "" which are to be interpreted as binary data of length at most 255, prefixed with a length character.

If I'm packing a TXT record containing a null byte, it is not correctly parsed by the library and the parsed output contains an empty buffer

saghul · 2021-08-10T18:13:05Z

Do you know if c-area is capa of parsing those? Do you have a sample record I could query?

maelp · 2021-08-10T18:19:41Z

I'm not sure whether c-ares is doing it as it should or not, but I guess it should be supposed to because the RFC indicates that's how it should be done

For instance I'm trying to use this byte string as a TXT record (I'm using dnslib to pack the data, which works as expected when I inspect the packet, and aiodns to receive and parse the packet, which shows an empty record):

b'\x00\x08\xa7\xe4\xca\x88\x06\x12\x02\x12'

and (perhaps because the first byte is null) it outputs as an empty string

saghul · 2021-08-10T18:54:44Z

The bug is most likely here: https://github.com/saghul/pycares/blob/4e6e36f839255ebef05e0682b98cbee1533805ce/src/pycares/__init__.py#L749-L764 -- the txt reply structures do have a length field I'm not using, I'm relying on strlen really.

A patch would be most welcome!

maelp · 2021-08-10T19:13:56Z

I guess it is this indeed https://github.com/saghul/pycares/blob/4e6e36f839255ebef05e0682b98cbee1533805ce/src/pycares/utils.py#L21

it should not necessarily force a decoding as ascii, it should be a "bytes" response (but this might break existing code as it means that the result of the resp.text is "bytes" rather than "str", and it is to each user to know how to interpret it, although this would be the correct way to do it)

saghul · 2021-08-10T19:42:58Z

We are already returning bytes sometimes, and str just when the response is strictly ascii. Can't we keep that behavior?

maelp · 2021-08-10T20:15:56Z

So I guess the real "error" is there, trying to cast to a _ffi.string when it should stay as a byte array (but not sure how to do this using the lib?) https://github.com/saghul/pycares/blob/4e6e36f839255ebef05e0682b98cbee1533805ce/src/pycares/__init__.py#L763

maelp · 2021-08-10T20:22:29Z

The correct way seems to be

self.text = bytes(_ffi.buffer(txt.txt, txt.length))

maelp · 2021-08-10T20:26:21Z

This PR fixes the bug saghul/pycares#160

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error when retrieving binary data in TXT records #100

Error when retrieving binary data in TXT records #100

maelp commented Aug 10, 2021 •

edited

Loading

saghul commented Aug 10, 2021

maelp commented Aug 10, 2021

saghul commented Aug 10, 2021

maelp commented Aug 10, 2021

saghul commented Aug 10, 2021

maelp commented Aug 10, 2021

maelp commented Aug 10, 2021

maelp commented Aug 10, 2021

Error when retrieving binary data in TXT records #100

Error when retrieving binary data in TXT records #100

Comments

maelp commented Aug 10, 2021 • edited Loading

saghul commented Aug 10, 2021

maelp commented Aug 10, 2021

saghul commented Aug 10, 2021

maelp commented Aug 10, 2021

saghul commented Aug 10, 2021

maelp commented Aug 10, 2021

maelp commented Aug 10, 2021

maelp commented Aug 10, 2021

maelp commented Aug 10, 2021 •

edited

Loading