Splitting a semicolon-separated string to a dictionary, in Python

I have a string that looks like this:


Is there a built-in class/function in Python that will take that string and construct a dictionary, as though I had done this:

dict = {
    "Name1": "Value1",
    "Name2": "Value2",
    "Name3": "Value3"

I have looked through the modules available but can't seem to find anything that matches.

Thanks, I do know how to make the relevant code myself, but since such smallish solutions are usually mine-fields waiting to happen (ie. someone writes: Name1='Value1=2';) etc. then I usually prefer some pre-tested function.

I'll do it myself then.

Asked by: Dominik759 | Posted: 28-01-2022

Answer 1

There's no builtin, but you can accomplish this fairly simply with a generator comprehension:

s= "Name1=Value1;Name2=Value2;Name3=Value3"
dict(item.split("=") for item in s.split(";"))

[Edit] From your update you indicate you may need to handle quoting. This does complicate things, depending on what the exact format you are looking for is (what quote chars are accepted, what escape chars etc). You may want to look at the csv module to see if it can cover your format. Here's an example: (Note that the API is a little clunky for this example, as CSV is designed to iterate through a sequence of records, hence the .next() calls I'm making to just look at the first line. Adjust to suit your needs):

>>> s = "Name1='Value=2';Name2=Value2;Name3=Value3"

>>> dict(csv.reader([item], delimiter='=', quotechar="'").next() 
         for item in csv.reader([s], delimiter=';', quotechar="'").next())

{'Name2': 'Value2', 'Name3': 'Value3', 'Name1': 'Value1=2'}

Depending on the exact structure of your format, you may need to write your own simple parser however.

Answered by: Carlos582 | Posted: 01-03-2022

Answer 2

This comes close to doing what you wanted:

>>> import urlparse
>>> urlparse.parse_qs("Name1=Value1;Name2=Value2;Name3=Value3")
{'Name2': ['Value2'], 'Name3': ['Value3'], 'Name1': ['Value1']}

Answered by: Rafael330 | Posted: 01-03-2022

Answer 3

s1 = "Name1=Value1;Name2=Value2;Name3=Value3"

dict(map(lambda x: x.split('='), s1.split(';')))

Answered by: Marcus864 | Posted: 01-03-2022

Answer 4

It can be simply done by string join and list comprehension

",".join(["%s=%s" % x for x in d.items()])

>>d = {'a':1, 'b':2}
>>','.join(['%s=%s'%x for x in d.items()])

Answered by: Rubie562 | Posted: 01-03-2022

Answer 5

easytiger $ cat test.out test.py | sed 's/^/    /'
{'Name2': 'Value2', 'Name3': 'Value3', 'Name1': 'Value1'}
{'Name2': 'Value2', 'Name3': "'Value3'", 'Name1': 'Value1'}
{'Name2': ['Value2'], 'Name3': ["'Value3'"], 'Name1': ['Value1']}
import timeit
import urlparse

s = "Name1=Value1;Name2=Value2;Name3='Value3'"

def p_easytiger_quoting(s):
    d = {}
    s = s.replace("'", "")
    for x in s.split(';'):
        k, v = x.split('=')
        d[k] = v
    return d

def p_brian(s):
    return dict(item.split("=") for item in s.split(";"))

def p_kyle(s):
    return urlparse.parse_qs(s)

print "p_easytiger_quoting:" + str(timeit.timeit(lambda: p_easytiger_quoting(s)))
print p_easytiger_quoting(s)

print "p_brian:" + str(timeit.timeit(lambda: p_brian(s)))
print p_brian(s)

print "p_kyle:" + str(timeit.timeit(lambda: p_kyle(s)))
print p_kyle(s)

Answered by: Owen550 | Posted: 01-03-2022

Answer 6

IF your Value1, Value2 are just placeholders for actual values, you can also use the dict() function in combination with eval().

>>> s= "Name1=1;Name2=2;Name3='string'"
>>> print eval('dict('+s.replace(';',',')+')')
{'Name2: 2, 'Name3': 'string', 'Name1': 1}

This is beacuse the dict() function understand the syntax dict(Name1=1, Name2=2,Name3='string'). Spaces in the string (e.g. after each semicolon) are ignored. But note the string values do require quoting.

Answered by: Fiona144 | Posted: 01-03-2022

Similar questions

Parsing/searching a comma and semicolon-separated string in python

The thing I am working on ATM has a (kinda) long strings of data like this: 56,1,0,153,0,0;56,1,0,153,0,0;56,1,0,153,0,0;5,1,2,34,B_3_1_1,0;5,1,2,34,C_9841,0; I would like to look for the values starting with 'C_' and return the number after it. I know they will always be on the fourth position of a list of values delimited by semicolon. I was thinking about using regular expression to parse the st...

python - Regex matching a string which is a semicolon-separated sequence of tokens

I am trying to produce a Python regex string to validate the values of a column which are comma-separated sequences of unique three-letter codes from a list of three-letter (upper-cased) alphanumeric codes, e.g . the list looks something like ['XA1', 'CZZ', 'BT9', 'WFF',...]. So valid column values could be XA1, XA1;CZZ, or XA1;BT9;WFF; etc. A code cannot occur in the seq...

Still can't find your answer? Check out these communities...

PySlackers | Full Stack Python | NHS Python | Pythonist Cafe | Hacker Earth | Discord Python