Paths

Table of Contentst

[Python 3] bytes-like strings decoding for saslauthd.py
ClosedPublic
Actions

Authored by ghane on Mar 23 2022, 5:54 PM.

Project Tags

None

Referenced Files

	F124724195: D3469.1778927716.diff
	Fri, May 15, 10:35 AM

	F124724194: D3469.1778927712.diff
	Fri, May 15, 10:35 AM

	F124688185: D3469.1778650512.diff
	Tue, May 12, 5:35 AM

	F122562638: D3469.1777902434.diff
	Sun, May 3, 1:47 PM

	F122400937: D3469.1777861596.diff
	Sun, May 3, 2:26 AM

	F122007477: D3469.1777710077.diff
	Fri, May 1, 8:21 AM

	F121332869: D3469.1777322582.diff
	Sun, Apr 26, 8:43 PM

	F121332215: D3469.1777321921.diff
	Sun, Apr 26, 8:32 PM

View All 46 Files

Subscribers

sicherha

vanmeeuwen

Details

Reviewers

vanmeeuwen
sicherha

Group Reviewers

PyKolab Developers

Commits

rP0638c077f14f: [Python 3] bytes-like strings decoding for saslauthd.py

Summary

byte-like strings needed to be decoded as text strings

debugging with 2.7 origin code returned unicode strings, but returned in python 3 bytse-like string variables.

Optparse gets text strings from shell.
configparse uses internally text strings.
From version 3.0, python-ldap uses text where appropriate. On Python 2, the bytes mode setting influences how text is handled.

socket.streams and db including bytes-like string code, which will be decoded with this diff, for python-ldap operations.

Diff Detail

Repository

rP pykolab

Lint

Lint Skipped

Unit

Tests Skipped

Event Timeline

ghane requested review of this revision.Mar 23 2022, 5:54 PM

ghane created this revision.

ghane edited the summary of this revision. (Show Details)

I don't understand the case or cases in which this change helps, where it would have otherwise failed.

This revision now requires changes to proceed.Mar 23 2022, 6:13 PM

this is for python 3 as there is more strict type operations between byte and text strings.
https://docs.python.org/3/howto/pyporting.html#text-versus-binary-data

you could search each unicode string in code like:

    login.append(value)

if len(login) == 4:
    realm = login[3]
elif len(login[0].split('@')) > 1:  # this will fail in python 3 as it is mixed bytes and split uses text string , but both is only str in python2 
    realm = login[0].split('@')[1]    # this will fail in python 3 as it is mixed bytes and split uses text string , but both is only str in python2 
else:
    realm = conf.get('kolab', 'primary_domain') # this is text string in python3, in auth/ you would get a mix between login [0] login [1] and realm

I tested the code against versions 2.7, 3.7, 3.8 on debian buster and ubuntu focal.

@vanmeeuwen, how should we proceed here? This is an effort to get the PyKolab codebase into a state where it works with Python 3 without breaking existing systems that are still based on legacy Python 2. Given that background, the commit looks plausible to me.

pykolab/auth/ldap/auth_cache.py
139	I know it's already present in the original code, but the second argument to `encode()` looks strange to me. Isn't that argument supposed to be a string describing the error-handling scheme? The value `'latin1'` wouldn't make any sense in that case.

streams ( Python 2 -> type { str } (bytes string) | Python 3 -> type { class bytes } ) uneven
encode() ( Python 2 -> type { str } (bytes string) | Python 3 -> type { class bytes } ) uneven
decode() ( Python 2 -> type { str } (text string) | Python 3 -> type { class string } ) even

LDAP needs string on search text string to get a result, else the result of search is 0,. <= this is an error, this case is not filtered.
LDAP gets a string on Python 2 as bytes strings are also represented as class string , on Python 3 this is more explicit and byte string are now of class bytes, as in Python 2 bytes() represents the class bytes.

setting table "entries" columns explicitly "domain" from String -> Unicode and "values, keys" from Text -> UnicodeText, would make encode() and decode() unnecessary in auth_cache.py and encoding decoding would handled by sql alchemy.
but if you need the OS locale encode().decode() would do the job. decode() uses as default the OS locale

encode and decode becomes obsolete at the return value as sql, sql alchemy should do the encoding/decoding
see also:

class Entry
    def __init__

line 70-75 checks unicode

Nice, this looks way better.

This revision was not accepted when it landed; it landed in state Needs Review.Jun 15 2022, 9:57 PM

Closed by commit rP0638c077f14f: [Python 3] bytes-like strings decoding for saslauthd.py. · Explain Why

This revision was automatically updated to reflect the committed changes.

Revision Contents
Changeset List

Path

Size

pykolab/

auth/

ldap/

auth_cache.py

lines

Diff 10400

View Options

pykolab/auth/ldap/auth_cache.py

[Python 3] bytes-like strings decoding for saslauthd.pyClosedPublicActions